Become a MacRumors Supporter for $50/year with no ads, ability to filter front page stories, and private forums.

smalm

macrumors newbie
What you are labeling in blue I think are E-cores.
P-cluster AMX unit.

And I don't think it is a cluster of 6 Cores. Looks more like a mix of 4 P and 2 P clusters on the Pro and Max. Same with the E-cores 4 + 2 for the Pro and the Max has more than 4 ( and just harvesting the 4 core cluster that works 'better'. Perhaps a small yield management thing.
So you can't tell an AMX unit from an E-cluster....

Also I don't think any die is a chopped version of another die. Though clearly they all share a lot of common sections.
Not chopped down die, chopped down die design.

BTW, only the M3 Max GPU has this extra cache. Each quarter of the GPU has one. They seem to be related to the 4 SLC blocks.
 

Attachments

  • M3 Mac GPU Quadrant.png
    M3 Mac GPU Quadrant.png
    165.2 KB · Views: 56
Last edited:
  • Like
Reactions: altaic

Confused-User

macrumors 6502a
Oct 14, 2014
852
986
So is the general consensus (absent proper benchmarks) that:

- M3 Max looks to be a good upgrade over M2 Max for both CPU (50% more P Cores than M2 Max) and GPU (40 ray-tracing cores vs 38 non-ray tracing cores), but only if you pony up for the top model

- M3 Pro regresses on the CPU cores and memory bandwidth and is likely to have little to offer over M2 Pro unless you can take advantage of the new GPU features. Query whether in some situations it will be slower than M2 Pro and how edge case these are?

- M3 is an incremental upgrade over M2, keeping the same CPU and GPU cores, so leaning heavily on the better GPU and a small boost to CPU performance
For the Max, the big question is if various uncore aspects (NoC, etc.) are a significant win over M2. If so it'll be a bigger win that you're thinking, probably. The 30-GPU-core version may or may not be a significant improvement over the M2 Max (aside from RT which is either huge or nothing, depending on use case); the "missing" bandwidth may not have much of an impact.

The M3 Pro is a bit of an open question. Yes, the P/E distribution and memory bus are potential negatives, but I think we need to see it in action before we decide. There are likely corner cases where the bandwidth is an issue, but very few - 150GBps is still quite a lot. (That's assuming they haven't done anything super-interesting like full memory compression, which would change the story completely.) 6P+6E will always be less good than 8P+4E of the same core types, but these are not the same as the M2, so I don't think we'll have a good sense of how they compare until benchmarks come out. My bet is that there will be no meaningful set of cases where the M2 is better, but facts > bets.

For the M3, there may be small gains from the uncore along with the advantages each core brings. But yes, that was always going to be an incremental upgrade on the CPU side. RT is pretty huge, where it's used, as are a few other things like AV1 decode, and meaningless otherwise.

But for all of these, even after we have benchmarks, we *still* won't know something important. How much clock headroom do these chips have? I suspect we will see these very same M3s run faster in desktops (minis and studios). And I think the 16" will run a little faster/hotter than the 14", though probably not by all that much.
 

leman

macrumors Core
Oct 14, 2008
19,521
19,674
@smalm Can you identify the NPU? I am curious whether it’s the same used in A17 or not (as they advertise different capabilities)
 

senttoschool

macrumors 68030
Nov 2, 2017
2,626
5,482
Here is my perspective on your posts: you want to toss out uninformed, inaccurate nonsense and pretend that it deserves respect? Be prepared for pushback instead, and don't get all bent out of shape when it happens.
Go ahead, throw your jabs in.

I posted my reasons why I thought it might not be A17 Pro here: https://forums.macrumors.com/threads/are-we-sure-m3-is-based-on-a17-pro.2409293/

It's just my speculation. No one can say for sure right now that it's based on A17 Pro. Perhaps it's modified. Perhaps parts of it was ported over. Perhaps Apple cut some things out of the M3 that was in A17 Pro.

Feel free to argue against my points instead of throwing these cheap shots in. @leman already explained why he did it and I accepted it. You though? Who are you? What exactly do you contribute to this forum?
 

playtech1

macrumors 6502a
Oct 10, 2014
695
889
(That's assuming they haven't done anything super-interesting like full memory compression, which would change the story completely.) 6P+6E will always be less good than 8P+4E of the same core types, but these are not the same as the M2, so I don't think we'll have a good sense of how they compare until benchmarks come out.
I like your optimism but I think something like full memory compression would have received a call out in the presentation and the lack of performance data on M3 Pro is making me sceptical.

I have ordered a 14 inch M3 Max because M3 Pro looked a bit gimped and I think that's probably what Apple wants!
 

Confused-User

macrumors 6502a
Oct 14, 2014
852
986
I like your optimism but I think something like full memory compression would have received a call out in the presentation and the lack of performance data on M3 Pro is making me sceptical.
I agree, I think it's low probability. But there are other things they could be doing to improve effective bandwidth or reduce latency that could make a meaningful difference. We'll know when the benchmarks hit.
 

Confused-User

macrumors 6502a
Oct 14, 2014
852
986
One more thought about these chips in desktops: It's possible that they were designed to support LPDDR5X as well as LPDDR5. If so we might see that in those Macs. Like full memory compression I don't think it's all that likely, but it's far from impossible.

Edit: One more thing I forgot to mention - there was no talk about PCIe implementations (4 or 5?), nor about SSD performance. This is ripe for improvement - Macs have had (mostly) strong performance on their SSDs for a while now, but they're starting to get slaughtered by NVMe drives on PCIe 5. It would be nice if Apple did something about that too. We won't see it in the laptops or they'd have said something about it, and that's no surprise - those things suck power! But in the Mini or Studio, it could easily be done.
 
Last edited:

altaic

macrumors 6502a
Jan 26, 2004
711
484
@smalm Can you identify the NPU? I am curious whether it’s the same used in A17 or not (as they advertise different capabilities)
It’s different. I wish I had a decent die shot of the A16 to compare, but the only one I’ve seen floating around is crap quality. I suspect the M3* NPUs are a new design.
 

Confused-User

macrumors 6502a
Oct 14, 2014
852
986
It’s different. I wish I had a decent die shot of the A16 to compare, but the only one I’ve seen floating around is crap quality. I suspect the M3* NPUs are a new design.
How sure are you? The quoted performance numbers are probably different than the A17 because they're quoting different sizes (INT8 on A17 IIRC). That's not what they used for the M2 announcement, so they may have opted for consistency with that.
 

leman

macrumors Core
Oct 14, 2008
19,521
19,674
Actually, these new chips come with a new 6-core cluster design. Even the 4-core M3 follows a similar pattern. You can see big square-like structures in all three SoCs, with one corner without CPU cores. Instead, there is an AMX coprocessor in that corner. The AMX coprocessor is smaller than 2 P-cores, so the square still appears to be missing one corner. The 4-core M3 seems to have 2 more cores removed from the other 2 corners, as annotated in the graph. The red square represents the approximate cluster, the blue square represents the AMX coprocessor, and the yellow square represents the 'missing cores' in M3

That's a super interesting observation! So the ratio is now one AMX block per six cores? And it looks significantly larger in relation to the P-core than on M2, right? I am curious what this means for the cache hierarchy... having six cores sharing L2 cache could be nice for complex multithreaded workloads. Not to mention that this likely means more L2 cache for single-core operation?
 

leman

macrumors Core
Oct 14, 2008
19,521
19,674
No, the NPU in the M3/M3Pro/M3Max is different that the one in the A17Pro.

Ah, sorry, I misunderstood what you were saying. So weird... I don't really understand why they wouldn't reuse the block they already had?
 

altaic

macrumors 6502a
Jan 26, 2004
711
484
Ah, sorry, I misunderstood what you were saying. So weird... I don't really understand why they wouldn't reuse the block they already had?
No idea. FWIW, the M3 NPU looks a bit larger than the A17Pro NPU, indicating that the design difference wasn’t to save die space.
 

Aggedor

macrumors 6502a
Dec 10, 2020
799
939
The description of the CPU in the Mac keynote matches the iPhone one. The performance improvements are an exact match. Finally, the die shot images of the P fire look identical to A17 Pro. Previous cores had a different physical layout.
Just speculating, but do we know those die shot images are actually real? Or at they just CG representations of the architecture?
 

Confused-User

macrumors 6502a
Oct 14, 2014
852
986
No idea. FWIW, the M3 NPU looks a bit larger than the A17Pro NPU, indicating that the design difference wasn’t to save die space.
Fascinating and unexpected. My guess would be that the A17P was optimizing for INT8 (and INT4?) while they did something in the M3 to optimize handling 16-bit values.
 

leman

macrumors Core
Oct 14, 2008
19,521
19,674
Just speculating, but do we know those die shot images are actually real? Or at they just CG representations of the architecture?

From what I understand Apple posts real enough images. This might be CG but the layout is accurate.
 
  • Like
Reactions: Chuckeee

altaic

macrumors 6502a
Jan 26, 2004
711
484
Just speculating, but do we know those die shot images are actually real? Or at they just CG representations of the architecture?
Apple has tampered with their die shot in the past to hide the existence of the M1 Ultra, so while it’s possible that they’re not exact, I’d be very surprised if they were outright fabricated.
 

Gnattu

macrumors 65816
Sep 18, 2020
1,106
1,668
What you are labeling in blue I think are E-cores. At least on the M3 Pro and M3. Go back to post 1437 for P cores.
[Although I would bracket those vertical as opposed to horiztonal in that picture. The bottom horizontal I think is grouping two different cluster groups. ]


And I don't think it is a cluster of 6 Cores. Looks more like a mix of 4 P and 2 P clusters on the Pro and Max. Same with the E-cores 4 + 2 for the Pro and the Max has more than 4 ( and just harvesting the 4 core cluster that works 'better'. Perhaps a small yield management thing. More bulk on the Max die so can provision some 'overkill'. The space to too small for any of the other major subsystems so just use it. I think Apple is at fringe of aggregate bandwidth so not doing both active. )
No they are not E cores. I mark additionally with the E clusters in green and their AMX in white. The M3 Pro one is clearly larger.
Apple-M3-chip-series-architecture-231030_big.jpg.large_2x.jpg
 

Gnattu

macrumors 65816
Sep 18, 2020
1,106
1,668
@smalm Can you identify the NPU? I am curious whether it’s the same used in A17 or not (as they advertise different capabilities)
It is quite interesting that the NPU has different floor plan in three socs and some of them are not even in square now:
92E337B38BBB4CAC1F4EE280361A1A82.jpg
 
  • Like
Reactions: altaic

deconstruct60

macrumors G5
Mar 10, 2009
12,493
4,053
No they are not E cores. I mark additionally with the E clusters in green and their AMX in white. The M3 Pro one is clearly larger.
View attachment 2304952


Thanks, sorry about that. They shift in position relative to the P cluster on each die so I had trouble picking those out. And the Max is I/O on top and the other two are I/O on bottom (and GPUs vice versa).
 

name99

macrumors 68020
Jun 21, 2004
2,410
2,317
Well, at least let me have some fun while I am at it 😂

At any rate, I’m not trying to be abrasive. It’s just that I have very little time per post and the fine aspects of communication tend to get lost when one is in a rush.
I, for one, am on your side.
There's way too much noise being generated by people based on god-knows what, but certainly not any technical competence.
 
Register on MacRumors! This sidebar will go away, and you'll see fewer ads.