No. Does anyone?So you consider addition of SME "not indicative of a new [CPU] architecture"?
OK then.
No. Does anyone?So you consider addition of SME "not indicative of a new [CPU] architecture"?
OK then.
M4 is not a full fledged new generation product. While Apple is free to use whatever naming scheme they want, the reality is that M4 does not use a new architecture. It uses a slightly modified (to boost yield and reduce cost) tech process. All semi companies but Apple simply refused to use N3B because it was a dud. Apple was in a pickle because they had to stick with the annual iPhone release schedule. But they knew the process was bad, so, while still working on the N3B based chips they probably started re-spinning the layout for N3E. The small time gap between M3 and M4 is not an indication of some sort of acceleration of the development/release cycle.
The sentence you highlighted in your link ("M4 builds on the GPU architecture of M3...") concerns the GPU, not the CPU. Just above it is a sentence where they promise a big CPU performance boost.Clock speed depends on many factors, not just architecture. Apple explains what's new in M4 here and new architecture is not listed. Unless Apple is so modest as to not mention the new CPU architecture (while boasting about the new display engine), it's not a new architecture.
I did not highlight this sentence, Google search did. I just copied the link. But I was talking about the entire article and, in it, Apple did not claim new CPU architecture. It's funny, that Apple did not claim it but Apple fans do. CPU architectures are not developed in 6 months. uarch update may improve the performance but it does not constitute new architecture.The sentence you highlighted in your link ("M4 builds on the GPU architecture of M3...") concerns the GPU, not the CPU. Just above it is a sentence where they promise a big CPU performance boost.
Also, Apple marketing is not exactly where I go for reliable technical information about Apple's CPU microarchitecture. It's not exactly the kind of thing they put a lot of effort into. We'll know a lot more once Apple updates their CPU Optimization Guide - a developer oriented doc - with all the details on M4.
Sign In - Apple
developer.apple.com
I think it's funny that you claim M4 isn't a full new generation product, yet at the same time you acknowledge clock speed depends on many factors. One of those factors is... microarchitecture! Clock speed isn't exclusively a product of process node.
Also funny is that we do have reasonably good evidence that CPU microarchitecture did change. If M4's CPU was just M3 ported to N3E plus the end result of AMX morphing into SME, you would expect all non-ML single core test results to land at about 4.40 GHz / 4.05 GHz = 1.086 = 108.6% in this comparison:
What we actually see: some are clustered around 108%, others very much aren't, and some of the ones which aren't are definitely not tests you'd expect SME to be used in. (For example, Navigation and HTML5 Browser.) There's also some which fail to reach the 108% bar. This is exactly the kind of spread you expect to see with a CPU uarch update - some things fared better than others.
In general when I see your posts I know exactly what to expect: all Apple's grapes are sour, nothing they do is any good. You've got one note to play, and you play it a lot!
I think what the internet is missing is that, ultimately, perf/ghz does not matter. It's always perf/watt that is important and Apple just increased perf/watt in a massive way (assuming no drastic increase in power).I think it's funny that you claim M4 isn't a full new generation product, yet at the same time you acknowledge clock speed depends on many factors. One of those factors is... microarchitecture! Clock speed isn't exclusively a product of process node.
I don't know how to respond to this. This indicates a level of cluelessness that's truly scary.No. Does anyone?
SME is an extension to architecture. It is fairly common for CPUs from the same architecture generation to support or omit some extensions. For example, Intel Xeons will have AVX-512 and their desktop siblings won't, but they will still share the same core architecture generation.I don't know how to respond to this. This indicates a level of cluelessness that's truly scary.
Do you have any idea what SME is? Do you know what it entails in terms of changing the CPU?
Would you also not consider the addition of, say, AVX to be a change in architecture?
Let's put it differently, what WOULD satisfy you as being "a change in architecture"?
SME is an extension to architecture. It is fairly common for CPUs from the same architecture generation to support or omit some extensions. For example, Intel Xeons will have AVX-512 and their desktop siblings won't, but they will still share the same core architecture generation.
Many consider a new microarchitecture to be a redesign, not an upgrade. Thus, many consider Zen 3 an upgrade of Zen 4, but Zen 5 a new microarchitecture.If you don’t consider it a a new architecture, then you also should be prepared to argue that Zen4 or Alder Lake are not new architectures.
Has Apple changed the decoder to adopt SME?Do you know what it entails in terms of changing the CPU?
And then comes the time where you have to shoot the engineer to get the project out of the door...I see a similar concept. They have building blocks of different parts of a SoC. In several states of functionality. When they have to freeze the design, it could be a block more advanced would be available just a little time latter. Bad luck. But you have to freeze once. You can’t forever.
May I know how you came to the conclusion that the AMD slide shows a new design, while the Apple slide does not?Many consider a new microarchitecture to be a redesign, not an upgrade. Thus, many consider Zen 3 an upgrade of Zen 4, but Zen 5 a new microarchitecture.
Has Apple changed the decoder to adopt SME?
Honestly, I don't know if M4 can be considered a new microarchitecture. What I have tried to convey, perhaps poorly, is that AMD considers Zen 5 a new microarchitecture, while Zen 4, does not. I am under the impression that Zen 1, Zen 5 and maybe Zen 3 can be considered new microarchitectures, while Zen 2, Zen 3+, Zen 4 are upgrades of the previous version.May I know how you came to the conclusion that the AMD slide shows a new design, while the Apple slide does not?
Well, if you ask me, it doesn't really matter if it is a new architecture or not. At the end of the day, it is how much performance the CPU architect and designer can squeeze out of their designs that counts.Honestly, I don't know if M4 can be considered a new microarchitecture. What I have tried to convey, perhaps poorly, is that AMD considers Zen 5 a new microarchitecture, while Zen 4, does not. I am under the impression that Zen 1, Zen 5 and maybe Zen 3 can be considered new microarchitectures, while Zen 2, Zen 3, Zen 3+ are upgrades of the previous version.
Honestly, I don't know if M4 can be considered a new microarchitecture. What I have tried to convey, perhaps poorly, is that AMD considers Zen 5 a new microarchitecture, while Zen 4, does not. I am under the impression that Zen 1, Zen 5 and maybe Zen 3 can be considered new microarchitectures, while Zen 2, Zen 3+, Zen 4 are upgrades of the previous version.
So it’s agreed that no one knows what “new” means, and no more bike shedding 👏Well, if you ask me, it doesn't really matter if it is a new architecture or not. At the end of the day, it is how much performance the CPU architect and designer can squeeze out of their designs that counts.
The half-power comparison was to M2, not M3. And let’s be real, that claim is likely specific to certain workloads taking advantage of new arch features.So it’s agreed that no one knows what “new” means, and no more bike shedding 👏
Anyway, back to the topic, if Apple’s claims about the M4 operating at 1/2 the power of M3 are true, and if the geekbench scores are legit (where the M4 is much better than the M3 Pro), that’s ****ing amazing. That means that the fabled “double Ultra” is feasible in the Studio. Plus, with LPDDR5X, they could go up to 10700 at similar power, which would be a marked increase in memory throughput. Better start bitching about how you’re being obsoleted before it’s cool!
Half power compared to M2 is more impressive. Not sure what you’re getting at 🙃The half-power comparison was to M2, not M3. And let’s be real, that claim is likely specific to certain workloads taking advantage of new arch features.
Honestly, I think we need to wait for this to reach the Mac to be sure when it comes to Battery Life. I do think the chip is amazing and more capable in the M4 Iteration.
I really loved the fact that they decided to push forward with more E Cores and keep improving them, that's where I wanted them to take the base and Pro chips.
Main QUESTION is whether NEON is now SVE, and if so whether it's 128b SVE or 256b SVE.
I'm not so sure.
If you look at SME (especially the latest SME2.1 stuff, eg
https://reviews.llvm.org/D137571 )
so much of it, in hindsight, seems motivated by AMX functionality. For example LUTI2 and LUTI4 seem to match AMX lookup table stuff [since AMX1, apparently to support quantized weights] along with the strided 2 and 4 vector loads that were added to M3 AMX.
Is this going to be a Ship of Theseus argument?I don't know how to respond to this. This indicates a level of cluelessness that's truly scary.
Do you have any idea what SME is? Do you know what it entails in terms of changing the CPU?
Would you also not consider the addition of, say, AVX to be a change in architecture?
Let's put it differently, what WOULD satisfy you as being "a change in architecture"?
Is this going to be a Ship of Theseus argument?
I don’t know the criteria for a “new” architecture.
I believe Apple's claim was half power at the same performance as M2. This is likely a comparison between a M2 core running at its highest performance state, and a M4 core running at a reduced clock speed chosen to approximate the performance of the full-speed M2.Half power compared to M2 is more impressive. Not sure what you’re getting at 🙃
I think you’re right about that. Still an impressive improvement, though.I believe Apple's claim was half power at the same performance as M2. This is likely a comparison between a M2 core running at its highest performance state, and a M4 core running at a reduced clock speed chosen to approximate the performance of the full-speed M2.
I expect M4 P cores in their highest performance state to still use about the same amount of power as Apple's P cores always do - somewhere around 5 to 6 watts. It seems to be the target they aim for.
Most likely everything M Pro below will get it because it's portable and on the go devices.They did move the Pro to 6E already so it’s good to see the base chip get this too. What I’m curious about is if the A18 gets this upgrade as well.
Apple are engaged in on-going work (that moves a little more each year) to split the OS up into more and more pieces that can run independently on separate cores. Obviously this is a goal that every OS vendor strives for in the age of multi-core; Apple's nothing special in this respect, just the techniques they will use will be optimal for the structure of Darwin.Most likely everything M Pro below will get it because it's portable and on the go devices.
I'm curious the approach they will take with the M4 Pro but I do hope they go this route, but based on this chip alone I think it's safe to say (same goes for M3 Pro)