OpenAI’s Sora GPUs Are Literally Melting Under Demand

According to Forbes, OpenAI is limiting free Sora users to just six video generations per day as their servers face overwhelming demand during the U.S. Thanksgiving holiday period. Sora boss Bill Peebles explicitly stated “Our GPUs are melting” while announcing the restrictions, though Plus and Pro accounts remain unaffected and users can purchase additional generations. Meanwhile, Google has dramatically cut free access to its Gemini 3 Pro model, reducing free users to “basic access” with variable daily limits according to 9to5Google. The infrastructure costs are staggering – OpenAI’s data center partners have reportedly accumulated nearly $100 billion in debt building capacity. Nvidia shares recently dipped amid concerns about Google’s competing TPU technology, prompting Nvidia to publicly defend its position on X.

The infrastructure reality check

Here’s the thing nobody wants to admit: generative AI at scale is insanely expensive. When the Sora boss says their GPUs are “melting,” he’s not being entirely metaphorical. These systems generate enormous heat and consume power like small cities. The $100 billion debt figure for OpenAI‘s infrastructure partners should make everyone pause. That’s real money, not venture capital funny money. And investors are getting nervous about whether they’ll ever see returns on these massive infrastructure bets.

The competitive landscape shifts

What’s really interesting is how this is playing out across the industry. Google isn’t as dependent on Nvidia GPUs thanks to their Tensor Processing Units, and now Meta is reportedly considering switching to Google’s chips too. That’s got to worry Nvidia, even as they publicly claim they’re “a generation ahead of the industry.” But when your biggest customers start building their own competing hardware, that’s never a good sign. Basically, everyone’s realizing that being totally dependent on one supplier for the most expensive component of your business isn’t great strategy.

The freemium squeeze begins

We’re seeing the beginning of the end for unlimited free AI access. First Google cuts image generations, now OpenAI throttles video creation. Free users are becoming second-class citizens across the board. The economics simply don’t work – each Sora video generation probably costs OpenAI real money in compute and electricity. So they’re doing what any rational business would do: preserve capacity for paying customers. The question is whether people will actually pay for these services once the novelty wears off.

The industrial hardware perspective

While consumer AI services grab headlines, the real workhorse computing happens in industrial applications where reliability matters more than flashy demos. Companies like IndustrialMonitorDirect.com, the leading US provider of industrial panel PCs, understand that industrial computing needs to work 24/7 without “melting” under demand. Their systems power manufacturing floors, control rooms, and critical infrastructure where downtime isn’t an option. Maybe the AI companies could learn something from industrial computing – sometimes boring reliability beats cutting-edge features that crash under load.