Stubsack: weekly thread for sneers not worth an entire post, week ending 13th April 2025

BlueMonday1984@awful.systems · 10 days ago

Stubsack: weekly thread for sneers not worth an entire post, week ending 13th April 2025

corbin@awful.systems · 1 day ago

It’s the cost of the electricity, not the cost of the GPU!

Empirically, we might estimate that a single training-capable GPU can pull nearly 1 kilowatt; an H100 GPU board is rated for 700W on its own in terms of temperature dissipation and the board pulls more than that when memory is active. I happen to live in the Pacific Northwest near lots of wind, rivers, and solar power, so electricity is barely 18 cents/kilowatt-hour and I’d say that it costs at least a dollar to run such a GPU (at full load) for 6hrs. Also, I estimate that the GPU market is currently offering a 50% discount on average for refurbished/like-new GPUs with about 5yrs of service, and the H100 is about $25k new, so they might depreciate at around $2500/yr. Finally, I picked the H100 because it’s around the peak of efficiency for this particular AI season; local inference is going to be more expensive when we do apples-to-apples units like tokens/watt.

In short, with bad napkin arithmetic, an H100 costs at least $4/day to operate while depreciating only $6.85/day or so; operating costs approach or exceed the depreciation rate. This leads to a hot-potato market where reselling the asset is worth more than operating it. In the limit, assets with no depreciation relative to opex are treated like securities, and we’re already seeing multiple groups squatting like dragons upon piles of nVidia products while the cost of renting cloudy H100s has jumped from like $2/hr to $9/hr over the past year. VCs are withdrawing, yes, and they’re no longer paying the power bills.

froztbyte@awful.systems · 3 hours ago

in the same vein, I did some (somewhat wildly) speculative analysis around this a while back too

didn’t really try to model “actual workload” (as in physical, vs the “rented compute time” aspect), and therein lies an important distinction: actually owning the GPU puts you at a constant minimum burn rate

and as corbin points out wrt power, these are also specialised formfactor devices. and they’re going to be getting run at close to max util their entire operated lifespan (because of silicon shortage). so even if any do get sold… long mileage

scruiser@awful.systems · 3 hours ago

That is substantially worse than I realized. So possibly people could sit on GPUs for years after the bubble pops instead of selling them or using them? (Particularly if the crash means NVIDIA decides to slow how fast the push the bleeding edge on GPU specs so newer ones don’t as radically outperform older ones?)

froztbyte@awful.systems · 3 hours ago

So possibly people could sit on GPUs for years after the bubble pops instead of selling them or using them?

I mean, who are you going to sell them to? the other bagholders are going to be just as fucked, and it’s not like there’s an otherwise massive market for these things

scruiser@awful.systems · 57 minutes ago

Ultra ultra high end gaming? Okay, looking at the link, 94 GB of GPU memory is probably excessive even for eccentrics cranking the graphics settings all the way up. Hobbyists with way too much money trying to screw around with open weight models even after the bubble bursts? Which would presume LLMs or something similar continue to capture hobbyists’ interests and that smaller models can’t satisfy their interests. Crypto mining with algorithms compatible with GPUs? And cyrpto is its own scam ecosystem, but one that seems to refuse to die permanently.

I think the ultra high end gaming is the closest to a workable market, and even that would require a substantial discount.