Apple M1 Ultra

polyrand · on March 8, 2022

I think the GPU claims are interesting. According to the graph's footer, the M1 Ultra was compared to an RTX 3090. If the performance/wattage claims are correct, I'm wondering if the Mac Studio could become an "affordable" personal machine learning workstation (which also won't make the electricity bill skyrocket).

If Pytorch becomes stable and easy to use on Apple Silicon [0][1], it could be an appealing choice.

[0]: https://github.com/pytorch/pytorch/issues/47702#issuecomment... [1]: https://nod.ai/pytorch-m1-max-gpu/

kllrnohj · on March 8, 2022

The GPU claims on the M1 Pro & Max were, let's say, cherry picked to put it nicely. The M1 Ultra claims already look suspicious since the GPU graph tops out at ~120W & the CPU graph tops out at ~60W yet the M1 Studio is rated for 370W continuous power draw.

Since you mention ML specifically, looking at some benchmarks out there (like https://tlkh.dev/benchmarking-the-apple-m1-max#heading-gpu & https://wandb.ai/tcapelle/apple_m1_pro/reports/Deep-Learning... ), even if the M1 Ultra is 2x the performance of the M1 Max (so perfect scaling), it would still be far behind the 3090. Like completely different ballpark behind. But of course there is that price & power gap, but the primary strength of the M1 GPUs seems to really be from the essentially very large VRAM amount. So if your working set doesn't fit in an RTX GPU of your desired budget, then the M1 is a good option. If, however, you're not VRAM limited, then Nvidia still offers far more performance.

Well, assuming you can actually buy any of these, anyway. The M1 Ultra might win "by default" by simply being purchasable at all unlike pretty much every other GPU :/

_jx7j · on March 8, 2022

The 3090 also can do fp16 and the M1 series only supports fp32, so the M1 series of chips basically needs more RAM for the same batch sizes. So it isn't an Oranges to Oranges comparison.

Back when that M1 MAX vs 3090 blog post was released, I ran those same tests on the M1 Pro (16GB), Google Colab Pro, and free GPUs (RTX4000, RTX5000) on the Paperspace Pro plan.

To make a long story short, I don't think buying any M1 chip make senses if your primary purpose is Deep Learning. If you are just learning or playing around with DL, Colab Pro and the M1 Max provide similar performance. But Colab Pro is ~$10/month, and upgrading any laptop to M1-Max is at least $600.

The "free" RTX5000 on Paperspace Pro (~$8 month) is much faster (especially with fp16 and XLA) than M1 Max and Colab Pro, albeit the RTX5000 isn't always available. The free RTX4000 is also a faster than M1 Max, albeit you need to use smaller batch sizes due to 8GB of VRAM.

If you assume that M1-Ultra doubles the performance of M1-Max in similar fashion to how the M1-Max seems to double the gpu performance of the M1-Pro, it still doesn't make sense from a cost perspective. If you are a serious DL practitioner, putting that money towards cloud resources or a 3090 makes a lot more sense than buying the M1-Ultra.

mehmetoguzderin · on March 9, 2022

> The 3090 also can do fp16 and the M1 series only supports fp32

Apple Silicon (including base M1) actually has great FP16 support at the hardware level, including conversions. So it is wrong to say it only supports FP32.

oneplane · on March 9, 2022

I'm not sure if he was talking about the ML engine, the ARM cores, the microcode, the library or the OS. But it does indeed have FP16 in the Arm cores.

inkyoto · on March 9, 2022

FP16 is supported in M1 GPU's and Neural Engines through the CoreML framework. From https://coremltools.readme.io/docs/typed-execution :

> The Core ML runtime dynamically partitions the network graph into sections for the Apple Neural Engine (ANE), GPU, and CPU, and each unit executes its section of the network using its native type to maximize its performance and the model’s overall performance. The GPU and ANE use float 16 precision, and the CPU uses float 32.

Also, this exploration (https://tlkh.dev/benchmarking-the-apple-m1-max#heading-neura...) reports the 5.1-5.3 TFLOPS FP16 ballpark performance.

_jx7j · on March 9, 2022

I should have been more clear. I didn't mean the hardware, but the speedup you get from using mixed precision in something like Tensorflow with an NVIDIA GPU.

_jx7j · on March 9, 2022

Thanks. At least when I ran the benchmarks with Tensorflow, using mixed precision resulted in the CPU being used for training instead of the GPU on the M1 Pro. So if the hardware is there for fp16 and they will implement the software support for DL frameworks, that will be great.

mehmetoguzderin · on March 9, 2022

Yes, unfortunately, the software is to blame for the time being, and I also ran into issues myself. :\ Hope they catch up to what the hardware delivers well, including both the GPU and the Neural Engine.

anaisbetts · on March 9, 2022

At some point can we finally admit that Apple's GPU claims just aren't....true? Like, every Apple keynote they put up incredible performance claims, and every time people actually get their hands on the product, it doesn't even come close to holding water in any domain where GPU performance matters (Game performance, ML training perf)

threeseed · on March 9, 2022

> GPU performance matters (Game performance, ML training perf)

No one plays games on a Mac.

And it has nothing to do with GPU performance but rather the fact that the audience simply isn't interested in gaming on it and so there is no economic incentive to target them.

So the GPU performance that matters to Mac users and is relevant to Apple is not games but rather content creation, production etc.

the_af · on March 9, 2022

> No one plays games on a Mac.

My wife does (I play games on Linux).

I have friends who own Macs who reluctantly dual boot to Windows just to play some games -- they would completely ditch Windows if they could just play every game on Mac.

I see there are Mac games on Steam.

All of this points to the situation being more nuanced than "no one plays games on Mac".

kingaillas · on March 9, 2022

I interpret these sorts of statements to be short for "a statistically insignificant number of people play games on macos (or linux)" rather than the literal case where as long as a single person does, it is false.

According to the steam hardware survey, Windows is 95%, Macos is 4%, and Linux is 1%. And to dig deeper you'd need to see what games that 5% of the non-Windows is playing - are they simpler games that don't need graphics acceleration (e.g. puzzle games, roguelikes, etc) or ones that do?

My desktop is an Intel NUC running Ubuntu, and yeah I play games on it. Slay the Spire, Spacechem, even some older MMOs like DDO or LoTRO (which run but at 15-20 fps since that system just has Intel Iris). I'm unable to even start many others (e.g. Grim Dawn) due to not having dedicated graphics.

So yeah it's nuanced but lots of games that need a graphics card don't run or even display on that system.

That's why I have a windows gaming system too. I'm realistic, the market just isn't there. I used to have a Mac (dropped it in 2018) and if I still did I'd subscribe to Geforce Now or just do console gaming.

crmd · on March 9, 2022

You’re right but i think there’s an immensely valuable lesson here for people who communicate with engineers as part of their job: be mindful of casual precision mistakes. “no one” and “a statistically insignificant number of” are colloquial synonyms to civilians, but they are black and white to engineers, especially software people. They are the difference for example between a safe code path and a edge case bug. In my experience, people who use casual imprecision in technical conversations are sometimes seen as inexperienced or not fully understanding a problem, which may actually not be the case. Learning to speak with precision without being pedantic is an excellent soft skill that can be developed over time.

the_af · on March 10, 2022

I understood it was about statistical significance, but my point remains:

Who says Mac users aren't gamers? It's a self perpetuating vicious cycle: gamers use Windows because that's where the majority of games are, so developers keep targeting Windows, so gamers use Windows, and so ad infinitum.

But people who use Macs enjoy games as well. They would rather not dual boot to Windows.

It's not true that Mac users don't play games. Rather, it's that most games are on Windows, which is a shame.

proverbialbunny · on March 11, 2022

If you want to get a bit more into significant numbers, Apple has the largest number of games for any platform and it has the largest number of gamers. They're mobile games, but on the M1 people can play them on their desktop.

What I think you mean is competitive gamers and AAA gamers do not play on Apple hardware. This is mostly true today ofc, but keep in mind that's actually not the majority of the gaming market. Apple is raking it in from its gaming market.

jbluepolarbear · on March 9, 2022

Grim Dawn is just poorly implemented. I have a Ryzen 3700x with a 2070 super and Grim Dawn runs like crap.

musicale · on March 10, 2022

The interesting question would be: out of all Mac users, how many of them play games on their Macs?

I have no idea what the answer is. Personally I have run a number of games on macOS, via Steam, and via Boot Camp and virtualization. Some popular MMOs like Final Fantasy XIV, World of Warcraft, and Eve Online have macOS clients, though Guild Wars 2 discontinued theirs.

Apple Arcade apparently has enough Mac users that Apple has a reason to support it on macOS as well as iOS.

And Apple apparently has some reason to support iOS/iPadOS games on Apple Silicon Macs as well (though it could just be a side effect of a future iOS-macOS merger or hybrid device.)

the_af · on March 10, 2022

> The interesting question would be: out of all Mac users, how many of them play games on their Macs?

...and how many would play more games on their Macs if they were available?

I refuse to believe Mac users are less fond of videogames. Based on my personal observation of the Mac users I know, they enjoy games as much as anyone.

At least casual games on iTunes for the iPad have a vast library, with many genuine good games (source: me, my iPad 2 was mainly a gaming platform for me, never found other uses for it).

I do realize more complex games are a different beast. But casual? I'd say Apple fans love them.

kllrnohj · on March 9, 2022

M1's blender rendering performance also wasn't very good, though. Assuming the scene was small enough to fit in the smaller VRAM of competitive consumer GPUs anyway.

Video content creation specifically is where it mostly achieves what the graphs indicate, and that's mostly a "well yeah, video decoder ASICs are really efficient. I'll take things we already knew for $100, Alex"

tambourine_man · on March 9, 2022

Blender didn’t even run on metal until a few months ago. It’s on beta now and the performance has increased substantially.

If the software is optimized, the graphics hold up fine.

zamalek · on March 9, 2022

The other elephant in the room is OptiX. Are Apple seriously indicating that they can outperform that with a iGPU?

bsenftner · on March 9, 2022

Unless you specifically configure Blender for GPU rendering, it renders on the CPU(s).

airza · on March 9, 2022

Huh? I know lots of people who play games on a mac.

jaimex2 · on March 9, 2022

I thought that would be a given by now.

Won't even get into that it can't run most things you'd want that kind of hardware for.

hirako2000 · on March 9, 2022

even their graphs don't look legit. they look like what you see in boxes of vitamins found on ransom websites. Got to wait for independent benchmarks with well explained scenarios. anything can run twice faster than anything with some specific tweaks to the program that runs.

brigade · on March 8, 2022

100W of that is probably for the USB ports; afaik TB4 ports are required to support 15W, and I don’t think there’s been a Mac didn’t support full power simultaneously across all ports. (if that’s even allowed?)

dingle_thunk · on March 9, 2022

I suppose given that this is two M1 Max glued together, assuming cooling is a solved problem, the max SOC power consumption just twice as high as usual, plus interconnect overhead. Right? Based on the thermal and power consumption characteristics of previous chips I would not be surprised if say ~120W is the max power draw of this thing.

edit: Of course the M1 max only shipped in laptops, so... who knows.

kllrnohj · on March 9, 2022

The M1 Max hits 100w in a laptop form factor with 'real' workloads when hitting CPU and GPU simultaneously (or at least not parasitic ones like prime95 & furmark). So this is probably >200w, unless it's been power limited and thus performs worse than 2x M1 Max's do anyway.

rowanG077 · on March 9, 2022

Does the M1 Max throttle when hit with hitting CPU & GPU at the same time? 14 inch would be most interesting to me.

masklinn · on March 9, 2022

> assuming cooling is a solved problem

I’d assume that’s what most of the chonk is about, no?

> Based on the thermal and power consumption characteristics of previous chips I would not be surprised if say ~120W is the max power draw of this thing.

The Max could be brought up to 90W or so.

fnord123 · on March 9, 2022

I wouldn't assume heat is solved as it's been Apples weak point in the past. The cube would crack, g5 iMac's would melt capacitors, MacBooks would burn users' laps.

Bud · on March 9, 2022

Those issues were all almost 20 years ago.

If you think this is still a problem, you haven't used any recent Macs. The current MB Air and MB Pro both run very cool even under prolonged heavy loads.

Apple's management of any and all heat issues has been far better than any competitors for a while now.

kllrnohj · on March 9, 2022

> Apple's management of any and all heat issues has been far better than any competitors for a while now.

Only if you define "for a while" as "since a year ago with the introduction of the M1".

Apple refused to make a thicker laptop or one with better ventilation to adequately cool the CPUs & GPUs they were sticking in them. They were among if not the worst of them all at handling the heat of the components they were using. Until the M1 Pro & Max rolled around, anyway, and suddenly they got thicker, with feet that raise it farther off the desk, and absolutely massive amount of vents all over 3 sides of the machine. Curious timing on that...

fnord123 · on March 9, 2022

The last intel macs run way too hot still. They get up over 80dC. The 2014 ones would hard reboot when building large Java projects.

hughrr · on March 9, 2022

To be fair the current Dells do the same.

nikau · on March 10, 2022

Haha what are you smoking, my 2018 macpro laptop was constantly throttling under heavy load, its thermal management was terrible

Bud · on March 12, 2022

I didn't say that particular model was perfect. I said Apple has been doing far better than competitors. Which is true. I set up and used hundreds of Macs and hundreds of PCs in 2018 as part of my job. I am pretty confident as to which performed better overall thermally.

And of course, Apple has made huge progress since then (M1, better thermal designs, new fan designs which are quieter and more efficient) whereas PC makers have made basically zero progress.

nikau · on March 20, 2022

You said "Those issues were all almost 20 years ago."

The "huge progress since then" was a year ago.

Your timeline is a little bit made up.

scoopertrooper · on March 9, 2022

Two thirds of the volume seems to be dedicated to cooling. Assuming they’re not complete idiots, they must be doing something!

fnord123 · on March 9, 2022

> Assuming they’re not complete idiots

https://www.theverge.com/22967776/apple-magic-mouse-charging...

tyrfing · on March 8, 2022

I'm confident the marketing oversells it, but it's likely very good in comparison. 3090 is about 28B transistors on Samsung 8nm, with some budgeted for raytracing. This is on TSMC 5nm, a process with 3-4x the density, and the 114B transistor count could potentially allow for similar GPU size - although I'd wait for Locuza or someone to analyze it. It should be very competitive in performance, and the winner by far in perf/watt, at least until RDNA3 and Lovelace GPUs release towards the end of the year.

Tagbert · on March 9, 2022

You are comparing the power source (370W) to the CPU/SOC (120W). The power supply provides power for USB-C/Thunderbolt ports and it’s never a good idea to spec a power supply too low and run it too close to capacity.

Melatonic · on March 9, 2022

I am sure it has great GPU performance for what it is but comparing it to a top Nvidia chip just seems ridiculous on Apples part. Apple I think is going to have trouble conquering back the semi pro workstation market they abandoned not that many years ago if they do not start offering M1 chips along with Nvidia GPUs.

Once again we get a Mac Semi-Pro Mini (seems like the Studio is more like a replacement for the Trashcan) that their marketing implies is maybe as good as a Mac Pro but is obviously not. It does look a lot better this time around - at least it has more ports :-D

scarface74 · on March 9, 2022

Just a note: they explicitly said the Mac Pro is coming.

Melatonic · on March 11, 2022

I'm interested to see if they shoot themselves in the foot and try to make it all M1 architecture or partner with AMD or Nvidia

joshspankit · on March 8, 2022

Watching the keynote I was almost thinking that Nvidia missed the boat when they chose not to sign whatever they had to to make OSX drivers.

Thank you for recalibrating me to actual reality and not Apple Reality (tm)

zdw · on March 8, 2022

nVidia missed the boat in releasing a bunch of "replace the whole laptop logic board" chips that died in the 2008-2012 timeframe and annoyed a whole host of OEMs:

https://www.techpowerup.com/64683/nvidia-admits-to-selling-f...

Apple specifically: https://support.apple.com/en-us/HT203254

Symmetry · on March 9, 2022

Nvidia switched to lead free solder while retaining the same potting material. This led to a mismatch in thermal expansion coefficients which caused strain with repeated thermal cycles and eventually some of the solder bumps just broke. You could use a toaster to melt the solder again and reconnect them but that didn't fix the underlying problem.

wil421 · on March 9, 2022

My 2011 MPB 15” suffered from the Nvidia GPU issue. It was so bad my computer wouldn’t boot properly and the Apple Geniuses kept denying the claim because it couldn’t finish a test.

Anyway, it was still my longest lived Laptop. My Sony VAIOs were great but I liked that Mac better.

fouc · on March 9, 2022

I pulled the logic board out and baked it in the oven and it lasted another 7 months before needing another bake. By then I moved onto a newer macbook.

ja27 · on March 9, 2022

By 2011 it was AMD(ATI) not Nvidia. I had one that failed but Apple did the replacement for free.

JiNCMG · on March 9, 2022

I am certain AMD like most responsible vendors (Seagate) also worked with Apple to correct the issue. Nvidia's issue was that it told all of the laptop vendors to just deal with it. It's why they are hated by other vendors and AMD/Intel worked hard to keep them from creating x86 cpus.

trodrigues · on March 9, 2022

> Well, assuming you can actually buy any of these, anyway. The M1 Ultra might win "by default" by simply being purchasable at all unlike pretty much every other GPU :/

Can we stop it with the meme that these GPUs are unobtainable? Yes, they are still overpriced compared to their supposed original prices and they'll likely never return to that price given that the base prices of manufacturing, materials and such have increased for multiple reasons.

But stock has been generally available for many months now and it's possible to get them as long as you can afford them.

conk · on March 9, 2022

Where are these available? I just checked all the links on Nvidia's webpage for a 3060 and they are all out of stock...

ChrisMarshallNY · on March 9, 2022

> 370W continuous power draw.

Don't know if it's the same, these days, but when I was designing electronic stuff, we were always told to spec the power supply at twice the maximum draw.

lmilcin · on March 8, 2022

> The M1 Ultra claims already look suspicious since the GPU graph tops out at ~120W & the CPU graph tops out at ~60W yet the M1 Studio is rated for 370W continuous power draw.

And that is expected, a lot has to be reserved for USB devices.

BolexNOLA · on March 9, 2022

Yeah can’t say I’ve ever seen a computer rated for the exact amount it’ll be drawn at. They have to leave room.

jsjohnst · on March 9, 2022

SSD and those two large fans also use a fair amount of power too.

kllrnohj · on March 9, 2022

Those fans are unlikely to break 5w combined.

jsjohnst · on March 9, 2022

Guess we will see once one is available to be tested, but I doubt that’s the case. I was guessing 5-8W each, but I could be wrong of course.

kllrnohj · on March 9, 2022

Most PC fans are 1-2w. Given the claims of "near silent", I think it's plenty safe to say these are not 5,000+ RPM rippers that are going to be in the 5-8W each range.

jsjohnst · on March 10, 2022

Given the M1 Ultra has a 2lb heavier heat sink just to dissipate the extra heat, not sure that’s a safe bet. There are other ways to reduce noise than just using lower RPM fans.

kllrnohj · on March 10, 2022

> There are other ways to reduce noise than just using lower RPM fans

No, not really. You can try adding sound dampening, like BeQuiet does, but it's not as effective as just having more lower RPM fans (although it does help with coil whine). But Apple has historically never used sound dampening, and this doesn't look like it changes that. With how "open" it is anyway (the entire back being just a bunch of holes), sound dampening wouldn't be all that effective.

I'm not really sure what you think the heavier heat sink has to do with either the fan RPM or the noise profile. The bigger heatsink is if anything evidence of larger, lower-RPM fans. They're using more surface area, so they can spread the air movement out over larger fan blades. Which means they don't need to use as high an RPM fan.

lmilcin · on March 10, 2022

Massive (by weight) heatsink can be used as thermal mass. This delays the need to increase fan speed and allows spreading that increase over time. One thing that is more noticeable than fan speed is rapid fan speed changes -- avoiding those makes entire system seem quieter.

oneplane · on March 9, 2022

Yeah, there are a lot more parameters here than just 'there are two chips and one is the bestestst', like the availability you pointed out.

There is raw performance, but there is also performance per watt, availability and scalability (which is both good and bad - M1 is available, but there is no M1 Ultra cloud available). If you want a multi-use setup, an RTX makes more sense than most other options, if you can get one and at a reasonable price. If you want a Mac, the M1U is going to give you the best GPU. In pretty much all other setups there are so many other variables it's hard to recommend anything.

kllrnohj · on March 9, 2022

For the market this is aimed at, performance per watt is really irrelevant. Performance per dollar or just outright performance are far more important the vast majority of the time. That's how we ended up with 125w+ CPUs and 300w GPUs in the first place.

oneplane · on March 9, 2022

There are dedicated ML cards from Nvidia for that, far most powerful than a 3090, so that is indeed true. But PPW is never irrelevant when someone is doing things at scale, so the question becomes: who is doing this for money but somehow not at scale?

kllrnohj · on March 9, 2022

These aren't rack mount products aimed at cloud providers, they are essentially mini workstations. What are you calling "at scale" for this? You're basically always pairing one of these machines to one physical person sitting at it whose time is being paid for as well (even for a solo creator, time is still money). It's a terrible tradeoff to save pennies per hour on power to turn around and pay the person dollars more per hour waiting on results.

oneplane · on March 10, 2022

That seems like a really bad way to spend money. Why limit a person to a single workstation if the workstation is the limiting factor? This is where we get clouds or if it must be done locally, rack mounted systems with many cards.

If you are doing it solo, with "just the hardware you happen to have", it matters a bit less. If you are doing it constantly to make money, and you need a lot of power, buying a one-person desk-machine makes no sense.

blitzar · on March 9, 2022

The cost of powering my 3090 for a year is now more than the cost (RRP) of a 3090.

kllrnohj · on March 9, 2022

Where do you live that power is anywhere close to that expensive? And are you overclocking your 3090?

Even assuming literal 24hr/day usage at a higher, "factory overclocked" 450w sustained, at a fairly high $0.30/kWh that's $1200/yr. Less than half the retail price of a 3090. And you can easily drop the power limit slider on a 3090 to take it down to 300-350w, likely without significantly impacting your workload performance. Not to mention in most countries the power cost is much less than $0.30/kWh.

At a more "realistic" 8 hours a day with local power pricing I'd have to run my 3090 for nearly 10 years just to reach the $2000 upgrade price an M1 Ultra costs over the base model M1 Max.

blitzar · on March 10, 2022

UK electricity is $0.28/kWh but will be $0.36/kWh from the end of the month - my business is quoted at $0.60/kWh fixed for the next 12 months.

At $0.36/kWh - card alone @450W ~running cost ~= RRP 3090 $1,499

Yes, can power it down to be more efficient, however, that effectively agrees with the previous comment that PPW matters.

emteycz · on March 9, 2022

Wholesale electricity price in some EU states is €550/MWh today. Most EU states are above €250/MWh.

kayoone · on March 9, 2022

prices in Europe right now are considerably higher than $0.30/kWh

kllrnohj · on March 9, 2022

It seems safe to say prices right now are not the norm due to, you know, that whole war thing going on that is impacting one of the EU's primary power supplies. The 2021 EU average was otherwise $0.22/kWh.

kayoone · on March 9, 2022

yeah but let's how fast we have those "normal" prices again, maybe this is the new normal, who knows.

flembat · on March 11, 2022

The war is a factor going forward, but we got notified of the price increases before the war started, they relate more to the piss poor planning of our rulers than any external factors. Unless the people in charge are going to suddenly start planning for the future this will be the new normal.

_abox · on March 9, 2022

But do you really run it all year round?

rbanffy · on March 9, 2022

> performance per watt is really irrelevant.

Watts are dollars that you'll continue spending over the system life. It matters because you can only draw so many amps per rack and there will be a point when, in order to get more capacity, you'll need to build another datacenter.

manigandham · on March 9, 2022

The market is not data center use.

rbanffy · on March 9, 2022

You'll still spend another Mac Studio on energy in order to run a comparable PC for the next five years. To say nothing about not wanting to be in the same room as its fans.

manigandham · on March 10, 2022

What are you talking about? This is not for cloud datacenters trying to squeeze every bit of compute per resource.

These machines are commonly used by professionals in industries like movie and music production. They don't care what the power bill is, it's insignificant compared to the cost of the employee using the hardware.

rbanffy · on March 10, 2022

> "They don't care what the power bill is"

Oh... They do. At least, they should. If a similar PC costs $500 less but you spend $700 more in electricity per year because of it, at the end of the year, your profits will show the difference.

manigandham · on March 10, 2022

As I said, these numbers are so insignificant that they don't matter. The cost of the employee and their productive is several magnitudes more.

I ran a visual effects company a decade ago. We bought the fastest machines we could because saving time for production was important. The power draw was never a factor; a few catered lunches alone would dwarf the power bill.

fomine3 · on March 9, 2022

Note that Geforce RTX on cloud is prohibited by Nvidia.

oneplane · on March 9, 2022

Yep, that's true. You have to use the DC SKUs which (IIRC) aren't the same silicon either. Worse: some of the server SKUs are restricted for market segmentation where your ML and hashing performance is bad but video is good (and the other way around).

The silly thing about it is that most of the special engines can now be flashed into an FPGA which is becoming more common in the big clouds so special offload engines aren't that big of a deal when they are missing. So in some cases you can have your cake and eat it too; massive parallel processing and specialised processing in the same server box without resorting to special tricks (as long as it's not suddenly getting blocked in future software updates).

freemint · on March 9, 2022

The part of the EULA which is supposed to enforce that is not enforceable in Germany. It is complicated. There might be other ways you can circumvent agreeing to the EULA based on your location.

kube-system · on March 9, 2022

How do they functionally do that? I googled and found this?: https://www.nvidia.com/en-us/data-center/rtx-server-gaming/

Honestly asking because I’m kind of out of the nvidia loop at the moment.

mschuster91 · on March 9, 2022

Technically: the driver detects if it is run in a virtualization environment, it is at least able to detect KVM and VMware. On the upside, it's relatively easy to bypass the check.

Legally: I assume no cloud provider will assume the legal risk of telling their customers "and here you have to break the EULA of the NVIDIA driver in that way to use the service". In Europe where the legal environment is more focused on interoperability, this might not be as much of a problem, but still it may be too much risk.

fomine3 · on March 9, 2022

They disallow such usage for "GeForce" by proprietary driver's EULA, and they limit open source driver performance (IIRC they require signed binary blob).

Macha · on March 8, 2022

An M1 Ultra is $2000 incrementally over a M1 Max, so there is no price gap, even with the inflated prices 3090s actually go for today.

kllrnohj · on March 8, 2022

To be fair that $2000 also gets you +32GB RAM, +512GB storage, and +10 CPU cores. It's not just the GPU. Although yeah you can definitely fit a 3090-equipped PC into a $4k budget even with ebay pricing if pure GPU performance is all you really want.

fivea · on March 9, 2022

> To be fair that $2000 also gets you +32GB RAM, +512GB storage, and +10 CPU cores.

Even though it's not an apples to apples comparison, keep in mind that a 1x32GB DIMM sells for less than 150$, and you can buy 1TB SSDs for less than 100$.

inkyoto · on March 9, 2022

M1 Ultra is 64Gb extra, not 32.

> keep in mind that a 1x32GB DIMM sells for less than 150$

Keep in mind that M1 Pro/Max/Ultra is LPDDR5 6400 (https://www.anandtech.com/show/17024/apple-m1-max-performanc...) connected by a 512 bit memory controller.

Whereas is a kit of 2x 32 GB LPDDR5 4800 (I could not easily locate a quote for 1x 64Gb LPDDR5 4800 DIMM, leave alone 6400) retails for USD 548 (https://www.newegg.com/crucial-64gb-288-pin-ddr5-sdram/p/N82...).

I could not locate a reliable source on the type of the SSD employed in M1 Pro/Max/Ultra, so I will refrain from remarking on the comparison.

jsjohnst · on March 9, 2022

I remember a slide mentioning I think “up to 7.4gb/sec SSD read/write speed” or similar, which drastically reduces the pool of comparison SSDs. Intel Optane meets those specs, as do a few other brands I hadn’t heard of previously. In the later case, a 2TB version seemed to be $350-400. Take this for what it’s worth, but SSD is going to be more than $100 extra cost imho.

kllrnohj · on March 9, 2022

That's pretty much the speed that Samsung advertises for the 980 Pro: https://www.samsung.com/us/computing/memory-storage/solid-st...

And what WD advertises for the Black SN850: https://www.westerndigital.com/products/internal-drives/wd-b...

And what Seagate advertises for the FireCuda 530: https://www.seagate.com/products/gaming-drives/pc-gaming/fir...

And what Gigabyte advertises for the Gen4 7000s: https://www.gigabyte.com/Solid-State-Drive/AORUS-Gen4-7000s-...

etc...

They aren't $100 for 1TB, no, but a lot of them are around $150. Which would be a lot less than +$100 to go from 512GB to 1TB, too. It's $40 to go from the 500GB SN850 to the 1TB SN850, for example.

jsjohnst · on March 9, 2022

I checked the first two and it’s 7,000 read / 5,000 write. Pretty sure Apple said read AND write, which would be a lot faster than those. I might go back and rewatch the keynote, but I still thinking we are arguing pointlessly as Apple has always overcharged for SSD and RAM upgrades vs the price you’d pay elsewhere. Thanks for the DV though even when what I stated was right! ;)

Macha · on March 9, 2022

It achieves this speed by not insisting on flushing to disk when requested: https://nitter.net/marcan42/status/1494213855387734019

When configured to ensure data integrity in the case of power loss (more important in this new M1 Studio machine unless it comes with integrated battery), then it's a lot worse.

trogdor · on March 10, 2022

I have a 8TB M1 Max. It pretty consistently has a max write speed of ~7.3 GB/s and max read of ~5.4 GB/s. No idea why or how write is faster, but that's not a typo.

goosedragons · on March 11, 2022

Their website just specifies read speeds with no mention of write speeds. It's also "up to" and tested on the 8TB models. Assuming it's like other SSDs smaller capacities are usually slower.

freemint · on March 9, 2022

HBM2 is a tad more expensive, but yes.

kllrnohj · on March 9, 2022

Apple doesn't use HBM2, so not really relevant

freemint · on March 9, 2022

Correct my bad. GDDR6x is still more expensive then GDDR4 you get with Thread ripper.

kllrnohj · on March 9, 2022

Apple isn't using GDDR6X, either, nor does threadripper use GDDR4.

cpuguy83 · on March 8, 2022

Wait, you can get a 3090?

gambiting · on March 9, 2022

3090 is literally the easiest card to get at RRP, at least here in UK. Set up discord alerts, the FE cards stay in stock for hours, the last one in February was in stock for the entire day so anyone who wanted to buy one at RRP(£1399) could do so without any issue at all.

_abox · on March 9, 2022

This.. The FE is in stock every month for RRP. The 3080Ti too, I got mine that way.

kayoone · on March 9, 2022

its still 2400 EUR in Germany for example.

gambiting · on March 9, 2022

I'm not sure who sells the FE cards in Germany actually. I know it's LDLC in Netherlands, France and Spain.

edit: just found out - it's NBB:

https://www.notebooksbilliger.de/

There's probably a German discord somewhere to have alert for drops.

Macha · on March 8, 2022

Yeah. They've been in stock for months here (though the retailers are charging the inflated prices too), e.g. https://www.computeruniverse.net/en/c/hardware-components/nv...

BolexNOLA · on March 9, 2022

I mean at that price…yeah it’s “in stock” but it sure as hell ain’t available

jsjohnst · on March 9, 2022

All of those listed are WAY above MSRP! MSRP is $1,499 in the U.S. and £1,399 in the U.K.

Macha · on March 9, 2022

The question was if they were available, not if they were available at MSRP (thanks downvoters, the comment even called out the price was inflated...). My understanding is that until recently in the US you basically had to buy from scalpers as the stocks had none at any price unless you stalked for restocks.

They're overpriced for sure, and that's the only reason the M1U pricing looks equivalent rather than exorbitant

jsjohnst · on March 14, 2022

> The question was if they were available, not if they were available at MSRP (thanks downvoters, the comment even called out the price was inflated...).

I wasn’t a DVer, but they’ve always been available if you were willing to pay a scalper. The only thing that has changed is that more retailers find it appropriate to rip off their customers. It’s kinda like a liquor store I’ve done business with for years now wanting $3,800 for a bottle of 23yr PVW. MSRP is $299.99. Should the owner not be able to make extra profit on it, no, of course they should. But >12x MSRP is just predatory imho.

trogdor · on March 10, 2022

What does "overpriced" mean, other than "more than you are willing to pay?"

Macha · on March 11, 2022

* Higher than historical norms due to very unusual market conditions

* Much higher than MSRP, which was reduced compared to the previous generation because that attempt at raising prices killed market demand for said previous generation.

* No longer affordable by the traditional customer base but sustained by a new market with questionable longevity in its demand

Take your pick.

Sure, in a pure rational economic sense the market price has risen because supply has fallen at the same time a new market of buyers became very interested in the product, but we're talking consumer expectations and historical trends here, not the current price in a vacuum.

hajile · on March 9, 2022

A 16" macbook with an M1 only uses around 100w and that's when maxing all the things. It runs at about 40w for CPU and 60w for GPU. Based on those numbers, 120w seems totally expected for two chips at the same frequency.

Sosh101 · on March 9, 2022

> look suspicious since the GPU graph tops out at ~120W & the CPU graph tops out at ~60W yet the M1 Studio is rated for 370W continuous power draw.

Interested to know what you think a reasonable PSU would be for A machine that was consuming close to 200W for processing...

geraldwhen · on March 9, 2022

Gigabyte and strix 3090 are routinely in stock at Newegg. Msrp. The shortage is over

kayoone · on March 9, 2022

3090 is still 2400 EUR in Germany, pretty sure that's not MSRP

whoisburbansky · on March 8, 2022

Cursory look gives you a ~$3500 price tag for a gaming PC with a 3090 [1], vs. at least $4k for a Mac Studio with an M1 Ultra. Roughly the same ballpark, but I wouldn't call the M1 Ultra more affordable given those numbers.

1. https://techguided.com/best-rtx-3090-gaming-pc/#:~:text=With....

BugsJustFindMe · on March 8, 2022

> Cursory look gives you a ~$3500 price tag for a gaming PC with a 3090

That 3500 is for a DIY build. So, sure, you can always save on labor and hassle, but prebuilt 3090 rigs commonly cost over 4k. And if you don't want to buy from Amazon because of their notorious history of mixing components from different suppliers and reselling used returns, oof, good luck even getting one.

airstrike · on March 8, 2022

You mean I get to save AND have fun building my own PC?

unicornfinder · on March 8, 2022

Not to mention if you build your own PC you can upgrade the parts as and when, unlike with the new Mac where you'll eventually just be replacing the whole thing.

momothereal · on March 8, 2022

I believed that until I realized I couldn't individually upgrade my CPU or RAM because I have a mobo with LGA1150 socket and only supports DDR3 (and it's only 6 years old).

So eventually you still have to "replace everything" to upgrade a PC.

Macha · on March 9, 2022

You were unlucky to buy ddr3 near its end of life then (like someone buying ddr4 now), but you could still upgrade stuff like your GPU or drives independently. My first SSD (a 240gb Samsung 840) is still in service after 9 years with its smart metrics indicating only 50% of its expected lifetime cycles have been used, for example.

You could also put a 4790k, 16gb of ddr3 and a modern gpu in that system to get a perfectly functional gaming system that will do most titles on 1080p high. Though admittedly we've passed the point where that's financially sensible vs upgrading to a 12400 or something as both devil's canyon CPUs and ddr3 are climbing back up in price as supplies diminish

perfopt · on March 9, 2022

Right now not many DDR5 boards. In fact none for AMD

fivea · on March 9, 2022

> I believed that until I realized I couldn't individually upgrade my CPU or RAM because I have a mobo with LGA1150 socket and only supports DDR3 (and it's only 6 years old).

DDR4 was released in 2014, which would suggest you purchased your mobo two full years after DDR3 was already deemed legacy technology and being phased out.

Also LGA1150 was succeeded by LGA1151 in 2015, which means you bought your mobo one full year after it was already legacy hardware.

momothereal · on March 9, 2022

Yes, they entered the market around those years, but what does that change? DDR3 and LGA1150 were not deemed "legacy" the day DDR4 and LGA1151 motherboards entered the market. They were 2-3x the price, and DDR3 dominated RAM sales until at least 2017. In fact, the reason DDR4 took so long to enter the market was incompatibility with existing hardware, and higher costs to upgrade. [1] I didn't go out of my way to buy "legacy hardware" because they weren't, at the time.

Point being, PC-building makes it easier to replace and repair individual components, but in time, upgrading to newer generations means spending over 50% of the original cost on motherboard, CPU, PSU, RAM. Not too different than dropping $3K on a new Mac.

[1] https://web.archive.org/web/20101219085440/http://www.xbitla...

fivea · on March 9, 2022

> Yes, they entered the market around those years, but what does that change?

It means the hardware was purchased after it started to be discontinued.

It's hardly a reasonable take, and makes little sense, to complain how you can't upgrade hardware that was already being discontinued before you bought it.

> DDR3 and LGA1150 were not deemed "legacy" the day DDR4 and LGA1151 motherboards entered the market.

I googled for LGA1150 before I posted the message, and one of the first search results is a post on Linux tech tips dating way back to 2015 on whether LGA1150 was already dead.

And you purchased the Mobo one year after that.

momothereal · on March 9, 2022

I think you are forgetting the context of my replies. I'm not saying it's unreasonable to have to upgrade discontinued hardware, even if you have to do it all at once. My take is that it's not too different from having to replace a Mac when the new generation comes in (which is usually every ~5 years for Apple, not too far from my own system's lifetime). Being able to upgrade individual parts through generations is a pipe dream.

Also, we must have a different interpretation of "discontinued", because DDR3 and LGA1150 were still produced, sold, and dominated sales for way long after I bought that system. At the time (and for the next 1-2 years), consumer DDR4 was a luxury component that most no existing hardware supported.

goosedragons · on March 9, 2022

You can still buy DDR3 new for not that much? 16GB is about $50 from numerous brands on Amazon at the moment. I bought some for an old laptop a couple months ago.

To do CPU upgrades you eventually have to replace the motherboard but you can keep using whatever your GPU/storage/other parts is. Sometimes that also means a RAM upgrade but it's still better than the literal nothing of modern Macs.

mixedCase · on March 9, 2022

AMD has never disappointed me in this regard.

doublepg23 · on March 9, 2022

Zen 4 will being using a new socket, I wouldn’t go buying a Zen 3 with plans to upgrade the CPU down the road.

kllrnohj · on March 9, 2022

Well, you should still get one last upgrade out of an AM4 socket in the form of the upcoming 5800X3D ( https://www.amd.com/en/products/cpu/amd-ryzen-7-5800x3d )

arvinsim · on March 9, 2022

This is already known for a long time already. You would have to actively choose to not listen to AMD news to not know.

doublepg23 · on March 9, 2022

I understand just making sure no one jumps on Zen 3 now with a promise of forwards compatibility.

zamalek · on March 9, 2022

8 years isn't a bad run for a CPU socket.

ricardobeat · on March 8, 2022

Since the context here is using these machines for work, a mid-level engineer will easily cost an extra $1000* in his own time to put that together :)

EDIT: I’m quite confident this is not at all an exaggeration. Unless you have put together PCs for a living. $100/h (total employment cost, not just salary), 1-2 hours of actual build & setup, 8 more hours of speccing out parts, buying, taking delivery, installing stuff and messing around with windows/Linux (I’ve probably spent 40 hours+ in the past couple years just fixing stuff in my windows gaming pc. At least 1 of those looking for a cabled keyboard so I could boot it up the first time, ended up having a friend drive over with his :D)

gtvwill · on March 8, 2022

1000 bucks for 45 mins work? Maybe 1.5hrs tops? I didn't realise their wage was >500 an hour?

Normal_gaussian · on March 8, 2022

To be fair here, there is more to it than just assembly.

You have to spec out the parts, ensuring compatibility. Manage multiple orders and deliveries. Assemble it. Install drivers/configuration specific packages.

All of these things are easier today than ten or twenty years ago - but assigning it to a random mid-level engineer and I'd set my project management gamble on half a day for the busiest, most focused engineers least likely to take the time to fuss over specs, or one day for the majority.

ofc. to get to $1000 for that they'd still have to be on $230k to $460k.

fouc · on March 9, 2022

Given that the last time I put together a PC computer was 2006, it'd probably take me DAYS to spec out a machine because of all the rabbit holes I'd be exploring, esp with all the advances in computer tech.

rpmisms · on March 9, 2022

PC part picker will do the heavy lifting for you. There are also management tools that will let you install software bundles easily, no real extra time investment.

asoneth · on March 9, 2022

Just knowing about services like PC Part Picker and the management tools you mention requires time and expertise that people generally do not have before they build a computer, so "no real extra time investment" may only be true for someone who can amortize those upfront costs across many builds.

In my case I have built a couple PCs before, but it was so long ago that I'd have to re-learn which retailers are trustworthy, what the new connection standards are these days, etc. It's just not worth it to me to spend a dozen hours learning, specing, ordering, assembling, installing, configuring, etc to save a few hundred bucks.

swiftcoder · on March 9, 2022

It's a lot closer than you might think.

A senior engineer in the Bay can easily pull down $400k/year in total comp, which is $200/hour. The rule of thumb I've always heard is that a fully-loaded engineer costs roughly 2x their comp in taxes/insurance/facilities/etc.

When someone costs the company north of $3k/day, it's cheaper all round to just plonk a brand new $6k MacBook Pro on their desk if they have a hardware issue.

jtbayly · on March 9, 2022

It would take me over 1.5 hours just to figure out what parts I need to buy.

_joel · on March 8, 2022

FSVO fun if you use Newegg

chrisweekly · on March 8, 2022

FSVO: For Some Value Of

(I've been accused of overuse of acronyms, but that one's rare!)

freemint · on March 9, 2022

At MicroCenter you would be hard pressed to pay more then $250 on their PC building service, you'll even get water coling installed and tested for this price. https://www.microcenter.com/site/service/instore-custom-pc-b...

zamalek · on March 9, 2022

The 3090 claims are overstated. There are multiple competitors in that space, and all of them need the TDP.

Performance per watt? I could see that being disrupted, but an iGPU in 2022 will be orders of magnitude less powerful than a dGPU, if wattage is ignored.

cma · on March 9, 2022

They are still a year+ ahead of 3090 on process node. Max was about equivalent to a 2080, so 2X max does line up with a 3090. A big difference is no ray tracing hardware, which takes up a lot of die space. Same process node and no ray tracing hardware and nvidia would come in at far less die space (3090 is 628.4 mm ^2, M1 ultra is 850mm^2).

If Nvidia were on the same node and increased die space to match M1 (ignoring the CPU portion of the die size), they would then be able to run at a lower clock with more compute units and probably match the TDP discrepancy.

An iGPU isn't necessarily slower if the system ram is fast, and M1 was one of the first consumer CPUs to move to DDR5. 3090 has 936.2 GB/s with GDDR6X, M1 Ultra with DDR5 memory controllers on both dies gets 800GB/s.

zamalek · on March 11, 2022

Having had my M1 MB pro from work freeze and stutter, I'm just not buying it. Your theory is great, never once expected this BS in practice.

For the record: I was the first M1 recipient (temporary 16gb MB, stock issues). I needed an Intel MBP because Rosetta ain't all that. I opted for, and was upgraded to, the 32gb M1 MBP. I chose M1 over Intel because it was unbelievably faster for the form-factor. My original comment does not concern laptops. My PC is orders of magnitude more powerful.

TDP is physics. You all might perceive Apple as perfect and infallible and all so lovely, but physics is physics.

I use AMD, not NVIDIA. And "what if" is irrelevant. It's like intentionally neutering Zen2 by comparing it to Intel single-core (as was done all the time). The reality is absolute, not relative. Comparing effective performance, not per-TDP, is what matters to the user. And my network/gpu/audio drops on both my 16gb M1 MB and 32gb M1 MBP under load.

Seriously not buying that "Apple can do nothing wrong" bias.

Take those #Ff6600-colored glasses off. The M1 has unbeatable value proposition in a pretty wide market, but Apple couldn't be further from a universally good machine.

sudosysgen · on March 8, 2022

Prebuilt 3090 builds can often be found for less than the cost of the corresponding parts.

udbhavs · on March 9, 2022

And there is usually a premium for small form factor prebuilts

ProAm · on March 9, 2022

I'm not being snarky but I dont believe Mac people would know how to build a PC given their history of non-modifiable hardware and no way to repair them.

FridgeSeal · on March 8, 2022

Hahaha good luck getting your hands on a 30xx series card though.

Here in Australia, 3090’s go for close to 3k on their own.

dmz73 · on March 8, 2022

And cheapest Mac Studio with M1 ultra is A$6000 so yes....

20-Core CPU 48-Core GPU 32-Core Neural Engine

    64GB unified memory
    1TB SSD storage¹
    Front: Two Thunderbolt 4 ports, one SDXC card slot
    Back: Four Thunderbolt 4 ports, two USB-A ports, one HDMI port, one 10Gb Ethernet port, one 3.5-mm headphone jack

A$6,099.00

gambiting · on March 9, 2022

Here in UK it's not a big deal. Subscribe to discord alerts for FE series drops, last 3090FE drop in February the cards were in stock for a full day, at RRP(£1399). I got a 3080 drop at RRP this way too(£649).

But even ignoring the FE series, the prices have already crashed massively, you can get a 3080 AIB for less than £1000, and 3090s frequently appear around £1500-1600.

IndrekR · on March 8, 2022

I can right now (ok, in the morning actually) walk into a computer store across the road here and buy 3090 off the shelf for 2299..2499€ (different makes and models). Those are in stock and physically on the shelf. Same for lesser cards of same series or AMD RX6000.

Domenic_S · on March 8, 2022

Those are scalper prices. Anyone can get a 3090 tomorrow for that price.

KerrAvon · on March 9, 2022

Yeah, they can fuck right off with those prices. Ethereum’s proof of stake switch can’t come too soon.

Nursie · on March 9, 2022

I'm seeing 3080s, in stock in stores I might consider buying from, sub-1800 AUD. It is heading back towards RRP (still about 50% over I guess). 3090s are twice that, yep.

arvinsim · on March 9, 2022

Don't know about Australia but in my area(Asia) the prices are now going back near MSRP.

bastardoperator · on March 9, 2022

I bought two on ebay no problem

sorry_outta_gas · on March 8, 2022

We've been buying tons of 3090s at work for about 1.6 USD- 2k USD without to much trouble

nightfly · on March 8, 2022

> tons

1123581321 · on March 9, 2022

Hey, at 5lbs each a ton is only 400 cards!

vimy · on March 9, 2022

You also need to compare the right cpu. M1 Ultra cpu is the equivalent of the fastest threadripper. Which costs $3990. So a pc with similar performance would be $7500

cyber_kinetist · on March 9, 2022

Not the top-of-the-line threadripper (which can go up to 32c/64t), but probably similar to the 5950x (16c/32t), which costs like 1000$.

But you’re comparing apples to oranges, because the real advantage of M1 chips is the unified memory - almost no CPU-GPU communication overhead, and that the GPU can use ginormous amounts of memory.

_abox · on March 9, 2022

A threadripper also has many more PCIe lanes than a Ryzen. It's a bit of a different usecase I think although there's overlap.

hajile · on March 9, 2022

I can't put a 3090 into a 3.5 liter case. Even a 10 liter case is really pushing it. That's before mentioning power savings. 3090 real world power when in use is something like 4x as high.

gjsman-1000 · on March 8, 2022

They are also absolutely massive and probably much more expensive long-term because of the massively increased electricity usage.

kllrnohj · on March 8, 2022

Unless you're running a farm of these, the power cost differences is going to be largely unnoticeable. Like even in a country with very expensive power, you're talking a ~$0.10/USD per hour premium to have a 3090 at full bore. And that's assuming the M1 Ultra manages to achieve the same performance as the 3090, which is going to be extremely workload dependent going off of the existing M1 GPU results.

TuringNYC · on March 9, 2022

It shocks me how much payroll and cap-ex is spent on the M1 and how little is invested in getting TensorFlow/Pytorch to work on it. I could 10x my M1 purchases for our business if we could reliably run TensorFlow on it. Seems pretty shortsighted.

The GPU claims wouldnt even need to be on parity with NVIDIA, it would just need to offer a vertically integrated alternative to having to use EC2.

hedgehog · on March 9, 2022

Having beaten my head on this for a while (and shipped the first reasonably complete ML framework that runs on Metal) Apple's opinion as expressed by their priorities is that it's just not important.

moflome · on March 9, 2022

> reliably run TensorFlow

What reliability issues are you having with TensorFlow on M1 Macs?

TuringNYC · on March 9, 2022

We've followed five different instructional and documentation pages to make it happen and none seem to consistently install. Throw in a corporate system where you need IT for root access to make changes and it is game over. So i've got an M1-max fully loaded and cant get TF running on it.

Now i've got a team of data scientists in a fully MBP shop and we're holding off upgrades to M1 until this all gets resolved.

On my personal M1, I managed to make it work, but its hard to know the layers of changes made and what exactly allowed it to work.

zamalek · on March 9, 2022

You can get off this GPU circus and simply go with purpose-built AI solutions.

You can buy single tensor accelerators from Google: https://www.coral.ai/products/

You can buy a bunch of those integrated into a single PCI-E card. https://iot.asus.com/products/AI-accelerator/AI-Accelerator-...

Cheap too. Some of these work with Mac. More of them work for PC, because the hardware interface is outside of Apple's thin vertical slice/garden.

dwroberts · on March 12, 2022

These are devices for Tensorflow Lite which is more appropriate for IoT etc. not doing the intensive initial training of a complex model

fouc · on March 9, 2022

Could be worth tracking what you did and make a new set of instructions, and trying to reproduce with a fresh install.

freemint · on March 9, 2022

This is something Apple should pay people for.

hedgehog · on March 9, 2022

Deep learning support for Mac is not going to happen at a level of quality you can rely on for research & dev work (like PyTorch + TensorFlow). The underlying problem is no big company cares about Mac platform and the work to maintain framework support for a specific piece of hardware is way beyond a hobby project. If you want your own on-prem hardware just buy Nvidia.

forgotmyoldacc · on March 8, 2022

Neural Engine cores are not accessible for third party developers, so it'll be severely constrained for practical purposes. Currently the M1 Max is no match for even last generation mid-tier Nvidia GPU.

viktorcode · on March 8, 2022

They are accessible to third party developers, only they have to use CoreML.

komuher · on March 8, 2022

dev_tty01 · on March 9, 2022

Huh? Neural engine is certainly usable by developers. You just use the CoreML framework.

rasz · on March 9, 2022

Apple loves to compare incomparable stuff. G5 the "world's fastest personal computer" etc. Its easy to claim GPU performance when you dont support modern OpenGL nor Vulcan so nobody can just run modern games and verify and you end up with "relative performance" graph whatever that means.

upbeat_general · on March 8, 2022

I'm very skeptical of this because until the CUDA strangehold is gone, it will be a pain to develop on. Even if the frameworks themselves support M1's GPU, there are still lots and lots of CUDA kernels that won't run.

I really hope I'm wrong (as someone who owns an M1 Pro chip) but I find it hard to imagine things changing significantly in the next ~2 years unless someone is able (legally and technically) to release a CUDA compatibility layer.

jpetso · on March 13, 2022

Here's AMD's attempt: https://github.com/ROCm-Developer-Tools/HIPIFY

Naturally, the HIP tooling doesn't support M1 GPUs at this time. We'll see if anyone else tries.

sytelus · on March 9, 2022

The most important detail here is 128GB ram for GPU computation! This allows to train monster models, ex 1B params GPT series, on single M1 Ultra. This is quite unprecedented. Unfortunately, it is also about 3.5x slower than 3090.

izacus · on March 9, 2022

The GPU claims for M1 Pro and M1 Max were wildly above their actual performance in real life (as opposed to CPU performance) so maybe don't put all that much faith in Apple marketing here either.

alasdair_ · on March 8, 2022

Note the label on the y-axis. "relative performance" from "0-200" seems like marketing bullshit to me.

"M1 Ultra has a 64-core GPU, delivering faster performance than the highest-end PC GPU available, while using 200 fewer watts of power."

Note that they say "faster performance" not "more performance". What does "faster" mean? Who knows!

Retric · on March 8, 2022

I have heard this argument before, if it’s identical workloads you get faster output but the same total work. Thus “faster performance” seems correct for fixed workloads and “more performance” is correct on games or benchmarks where you get more FPS.

I still think “faster performance” sounds sound odd, but I understand their point.

fomine3 · on March 9, 2022

Anyway any benchmark from first party should be taken with a mountain of salt.

LegitShady · on March 8, 2022

I always take such claims with a grain of salt anyways. It usualy on one specific benchmark. I wait for better benchmarks always instead of trusting the marketing

pathartl · on March 8, 2022

Even if their claims are accurate, it usually has the asterisk of *Only with Apple Metal 2. I honestly cannot understand why Apple decided they needed to write their own graphics API when the rest of the world is working hard to get away from the biggest proprietary graphics API.

_abox · on March 9, 2022

Because vendor lock-in and full control over their APIs which has always been an apple staple but especially now.

5-10 years ago they were still serious about open standards, like OpenCL.. Now it's all locked in.

jpetso · on March 13, 2022

I'd like to note that Microsoft is doing no better with DirectX. If it weren't for the drivers on Windows being distributed and supported by the GPU manufacturers themselves, Vulkan would only be a thing for Linux (incl. Android) and custom niche devices now.

saagarjha · on March 9, 2022

Because when they wrote the API no real standard existed to suit their needs.