Instead of time it should be energy. What is the best model you can train with a...

NooneAtAll3 · 2025-08-14T11:21:03 1755170463

it's not about efficiency - it's about availability

H100 is not an everyday product. Laptop is

nickpsecurity · 2025-08-14T14:22:26 1755181346

Also, my laptop running Linux and its outputs are probably mine and private. If I use cloud GPU's, I need to be a lawyer to be sure what they can or can't do with my data or models.

There's also no overages or hidden charges with a laptop. Past simply breaking it. You know the replacement cost ahead of time, though.

Sharlin · 2025-08-14T14:04:05 1755180245

H100s are almost-instantly available to anyone with a credit card and access to the internet. Without even having to lift their butt from the seat. And you get plenty more than five minutes of compute for the price of an M4.

dekhn · 2025-08-14T15:33:39 1755185619

While I love cloud computing, you're comparing the cost of renting a GPU for a fixed amount of time to the purchase of an asset which can be used for years. Not a useful comparison IMHO.

sudoshred · 2025-08-14T21:52:41 1755208361

Disagree, equity of access matters a lot. Not everyone benefits from exposure to the entire hardware lifecycle, the same way that buying housing is not the best financial decision for everyone regardless of affordability. I might have unlimited budget but if I only need access to state of the art hardware intermittently or under irregular circumstances the cost of renting may be efficient for my needs. Also consider the costs of supporting hardware that is fully owned, if you own the hardware but underutilize it that is inefficiency and the owner bears that cost. The unusual way that silicon depreciates mean that the value of your “asset” is not static and rapidly depreciates as silicon manufacturing improves.

dekhn · 2025-08-14T22:42:14 1755211334

Your argument is not related to my statement. You're arguing something else.

jsperson · 2025-08-14T14:09:14 1755180554

For the orgs where I've worked the important thing isn't availability of compute it's security. Using what we have on our local network is much easier from a governance and approval standpoint than whatever is available on the internet.

Sharlin · 2025-08-14T15:18:31 1755184711

Many orgs have no problems using cloud envs for most things. The usual suspects offer just as secure compute envs as everything else.

Anyway, I was assuming personal use, like the messing-around experimenting that the article is about. (Or who knows, maybe it was part of the author’s job.)

potatolicious · 2025-08-14T14:25:14 1755181514

And yet just about any intro-to-programming tutorial gets something running on your local machine, and local machine development continues to be the default for most people, even though devving on a cloud machine is eminently reasonable.

"Pull out credit card, sign up for some thing and pay a bit of money" is a non-trivial bit of friction! Extremely non-trivial!

Especially in a corporate context - you have to get the expense approved. It's not clear if you can put company data onto the machine. Whereas generally running local things on corporate laptops is far less controversial.

"Download this tool and run it." is still an extremely powerful pitch. Pretty much the only thing that beats it is "go to this website which you can use without any signup or payment".

Sharlin · 2025-08-14T15:17:28 1755184648

Sure, if you already have said local machine. Which I guess in HN’s context many/most do.

victorbjorklund · 2025-08-14T17:17:42 1755191862

I already have an M4 so the cost of running it is tiny.

0x457 · 2025-08-14T18:20:13 1755195613

Yeah, is a large server rack to run those H100s. But realistically, the majority of people have a PC with consumer grade GPU or more likely a laptop with...laptop grade GPU.

Cloud H100 don't count because you need lawyer to review ToS and other agreements.

ekianjo · 2025-08-14T14:50:35 1755183035

no org will let you send their data to a random online h100...

Sharlin · 2025-08-14T15:13:49 1755184429

Many orgs happily use Google’s everything. And Google offers secure compute envs just like it offers secure cloud everything.

Anyway, I thought the context was doing stuff for personal use/fun, not work.

sethhochberg · 2025-08-14T17:39:58 1755193198

Frankly I think a lot of full-time-employed technical people are largely experimenting for fun in the context of things that might eventually be useful to their employer. AI is cool and fascinating stuff and when I have a few idle minutes at the end of my workweek I love catching up and experimenting with the latest and greatest, but with an eye towards company problems and on company time, and sometimes using company datasets. That means company vendor approval and financing of my efforts.

In my personal life, when its time for fun, I close the laptop and go do some gardening.

kragen · 2025-08-14T23:40:20 1755214820

Aren't there export controls in place that exclude 90% of the people who have internet access, even if they do have credit cards?

KeplerBoy · 2025-08-14T11:45:59 1755171959

Still, I don't think the m4 is going to be far off from the h100 in terms of energy efficiency.

edit: fixed typo

menaerus · 2025-08-14T12:10:23 1755173423

What efficiency did you have in mind? Bandwidth-wise M4 is ~10x to ~30x lower.

KeplerBoy · 2025-08-14T12:15:17 1755173717

ah, i mistyped. I meant energy efficiency, not memory efficiency.

Der_Einzige · 2025-08-14T12:58:13 1755176293

At this point, given how many H100s there are in existence, it’s basically an everyday product.

logicchains · 2025-08-14T13:05:05 1755176705

I envy you if $25k is an everyday product cost.

falcor84 · 2025-08-14T13:15:47 1755177347

Maybe not to buy one, but to rent one. Like how barista-made coffee is an everyday product even though most people can't afford a fancy professional coffee machine.

bee_rider · 2025-08-14T16:14:41 1755188081

Reasonably high quality coffee machines are very widespread. Or you can do pour-over. I don’t think the cost of a machine is a limiting factor for many people, it is just convenience.

Maybe an analogy could be made to espresso, nice espresso machines get costlier. But, you can still get quite good results out of a manual machine like a Flair.

I think this is why the suggestion to rent a machine is not to helpful. In this analogy we’re on BaristaNews, we all know about the industrial machines, lots of folks use them at work. But, the topic of what sort of things you can do on your manual machine at home has come up.

inetknght · 2025-08-14T20:55:56 1755204956

> Reasonably high quality coffee machines are very widespread. Or you can do pour-over. I don’t think the cost of a machine is a limiting factor for many people

No, reasonably-priced coffee machines is an enabling factor for many people.

If coffee machines weren't reasonably priced, they would not be "very widespread".

bee_rider · 2025-08-14T21:28:36 1755206916

I’m not sure I follow your deeper meaning here, sorry.

jeroenhd · 2025-08-14T13:12:35 1755177155

For what it's worth, most of the world can't afford an M4 Macbook either.

wongarsu · 2025-08-14T13:20:03 1755177603

And renting an H100 for an hour is a lot easier than renting an M4 MacBook for an hour.

giancarlostoro · 2025-08-14T12:13:21 1755173601

Mac is more competitive on power consumption though since its not ever pulling as much as a Nvidia GPU is my understanding.

On that note you can rent an H100 for an hour for under $10 which might make for a slightly more interesting test, whats the best model outcome you can train in under an hour.

dtnewman · 2025-08-14T12:20:10 1755174010

> you can rent an H100 for an hour for under $10

Far cheaper these days. More like $2-3 for a consumer to do this. For bulk deals, pricing is often < $2.

giancarlostoro · 2025-08-14T14:56:37 1755183397

I couldnt remember offhand the exact amount but figured noting that under $10 is still impressive for one high end GPU for an entire hour.

bigyabai · 2025-08-14T13:34:17 1755178457

It depends. If you're bottlenecked by memeory speed, the Mac typically comes out on-top.

In terms of conpute efficiency though, Nvidia still has Apple beat. Nvidia wouldn't have the datacenter market on a leash if Apple was putting up a real fight.

giancarlostoro · 2025-08-14T14:56:55 1755183415

Yeah, this is correct. My 3080 will render quicker than my M4 but my M4 will outcompete on being able to load larger models.

netcan · 2025-08-14T13:06:37 1755176797

They're all good. Being somewhat arbitrary isnt a bad thing.

motorest · 2025-08-14T15:02:16 1755183736

> Instead of time it should be energy (...) Then the MBP and H100 are on a more even footing.

What exactly is your point? That instead of expressing workloads in terms of what a laptop could do, you prefer to express them in terms of what a MacBook Pro could do?

zarzavat · 2025-08-14T15:53:53 1755186833

The point is that "best model you can train in 5 minutes" is hardware dependent, the answer will be different depending on the hardware available. So it's necessarily a single-player game.

"Best model you can train with X joules" is a fairer contest that multiple people could take part in even if they have different hardware available. It's not completely fair, but it's fair enough to be interesting.

Training models with an energy limit is an interesting constraint that might lead to advances. Currently LLMs implement online learning by having increasingly large contexts that we then jam "memories" into. So there is a strict demarcation between information learned during pre-training and during use. New more efficient approaches to training could perhaps inform new approaches to memory that are less heterogenous.

tl;dr: more dimensionally correct

jvanderbot · 2025-08-14T13:27:01 1755178021

Bro por que no los dos

We can / should benchmark and optimize this to death on all axes