Not only faster but more efficient at 2.2x greater performance per watt. 11/2/2022 will be interesting to see if AMD RDNA3 can answer. Guessing things will remain the same with Nvidia #1, AMD #2 and Apple last behind $329 Intel Arc A770.
4090
12287.86 points / 450W = 27.3 points/W
64GPU M1 Ultra
1371.46 points /
~110W = 12.5 points/W
View attachment 2094381
There is a context missing there though. Nvidia 3rd gen specialized hardware RT (and 2nd-3rd gen software library). AMD/Intel partial RT hardware . Apple little fixed function hardware at all.
Card | blender | Watts | pts/W |
| | | |
Nvidia 4090 | 12265 | 450 | 27 |
| 5055 | 320 | 16 |
Intel 770 | 1630 | 225 | 7 |
Intel 750 | 1607 | 225 | 7 |
Intel 750M | 1379 | 150 | 9 |
| | | |
M1 Ultra 64 | 1371 | 110 | 12 |
M1 Ultra 48 | 1242 | 83 | 15 |
| | | |
M1 Max 32 | 818 | 55 | 15 |
M1 Max 24 | 712 | 41 | 17 |
| | | |
AMD 6800 | 1531 | 250 | 6 |
AMD 6700 | 1308 | 175 | 7 |
AMD 6650 | 1021 | 175 | 6 |
AMD 6600 | 1008 | 132 | 8 |
AMD 6700M | 1082 | 135 | 8 |
In the pts/W, Apple is clearly out in front of the Intel/AMD mobile offerings. With no specialized hardware RT they are approximately matching 2nd Gen 3080 metrics. So spending no extra die space on this and doing better than Intel and AMD who did.
I highly doubt Apple is using the desktop 3090 or 4090 as a 'goal metric' for this next iteration. If Apple could cover the desktop AMD 6800 with a Max sized die with 32-40 core GPU with no TDP Watt increase, then I suspect they would declare that a 'win'. [NOTE: the 3080Ti laptop is 3802 ] If there is any 4000 series target then it would be the 4080M/4070M. Apple is stalking the largest mobile dGPUs run inside of larger laptops. Whatever they scale up from there is what they get to cover the upper desktop range. Where the Intel 750M is in mobile GPU deployment ( roughly equal power to the M1 Ultra ) the result isn't all that different. I don't think they are going to 100% catch the mobile 4070M, but covering a W6800X duo with an "two die SoC" would likely get trumpeted as a win.
If Apple adds some specific RT hardware in next iteration or two then the pts/W isn't going to be very far off at all.
that '27' is largely fixed-function hardware difference where have taken the bulk of the work off of general computation cores. Nvidia actually has less room to move "more" to fixed function at this point. ( Nvidia tossed NVLink from the die to toss in more specialed hardware. For the Studio/Mac Pro destine dies it is unlikely Apple is going to toss UltraFusion for more specialized cores. So likely mostly waiting on higher transistor allocation budgets (e.g., TSMC N3 and after ) for major moves.