If the N3-based M3 family brings a big performance jump, how would Apple design the M3 so that the MBA M3 doesn't get too close to the MBP M2 Pro/Max? Would Apple keep new accelerators for the M3 Pro/Max?
I think that has some suspect assumptions built in. In turn ...
1. That Apple is going to use 100% of N3 options just to raise performance to the exclusion of grabbing some power savings.
a. According to some reports the reason the the M2 Pro/Max are squatting on multiple years old N5P is that some new features 'blew' the power and/or space (die area) budget. If going to add new additional power consuming accelerators then pretty good chance going to use some of the N3 tradeoffs to claw back some power usage. Not 100% of the trade-offs spread on chasing single thread , 'top fuel dragster' , single thread performance. They don't have to spend it all on power savings but some far more balanced approach.
b. Caches are going to shrink at all from the N5P they are using now. L2 / System(L3) stuff will be just as big. If try to slap hot rod engines that crank of the number of cache misses ... not really going to be able to counter much with Apple's classic move over the last 6+ years of "throw more cache at it.". If cores get smaller than can add more cache. However, throwing the kitchen sink into the cores so they don't get any smaller (even if the N3 shrink) will likely increase the cache pressure.
( There will be some handwaving that LPDDR5X+++ will help bail out. I wouldn't be on Apple leaning of some super bleeding edge LPDDR5 process for reliable , high volume supply chain in 2023-2024. )
c. The A16 , M2 , M1 Pro/Max have all bloated out to larger die sizes than their predecessors. N3 wafers cost even more. There is very little chance Apple is going to be trying to get to the same M2/A16 die sizes with N3. And again 'smaller die' it going to put pressure on "about the same stuff smaller". I wouldn't look for some large increases in core counts anywhere. ( maybe in some 'new' accelerator logic that doesn't increase the cache pressure much. Or filling in gaps. AV1 en/decode , DisplayPort 2.1 , the HW Ray Trace that some are so desperate to have included . That isn't going to get revolutionary better Geekbench scores though. )
FlexFin could be extremely useful, but it is also new ( in tool support and volume production). It would make sense risk management wise to take the 'shrink wins' and call it a day on the first iteration on N3 family. There will be some performance uplift with the N3 move. Especially, if jumping from older N5P. Some modest clock bumps (on cores and caches) , internal network improvements , taking time to 'fix' things that didn't work as smoothly before but now do with closer distances , etc.
2. The power gains that Apple is tap dancing on with M2 Pro/Max are largely on doing almost nothing on the CPU/GPU cores. The video decode get more power efficient. That is nice by how long can keep that misdirection going? AMD is getting better at "performance on battery". They haven't gotten to parity but they are closing the gap. Same thing with dGPUs from Nvidia in laptop space. That humongous vast gulf between 3070 level performance and Apple's stuff is substantively smaller now. Still somewhat large, but it has a bit less 'wow' factor to it.
End of the day this M3 family CPU core , GPU core , etc are all going into a phone SoC. Taking the N3 trade-offs on power/performance to 100% leaning on the performance side isn't going to help that phone SoC much.
N3 wafers cost more money. So the notion of 'completely fork' the m-series off from A-series is actually likely harder because margins are going to get squeeze more. Apple isn't going to the 'poor house' , but probably not likely to arbitrary take the attitude of "throw money at all with zero concern about profit" on how much 'shared across product lines" the SoCs leverage.
A smaller chip that consumes less power is probably on the priority list. Not to the point there is zero performance uplift, but high enough priority that "more performance" is not the only design parameter on the table.
M3 might threatened the M1 Pro but that should be relatively OK. The M2 Pro has twice the GPU cores of the M2. N3 isn't going to bring some 'doubling' of GPU core counts and the individual cores won't jump that high either. Probably zero increase in CPU/GPU core count (smaller die with same size cache). There is zero rational reason to skimp on 'hiding'/'removing' the accelerators from M3 .. Especially if there was a 'huge miss' that forced them onto N5P. If there was no space ... now there is on the node progression. N3 allows them to put in a proportionally sized accelerator
without the die bloat. That why you pay more for the new fab process. To pay more and skip it is somewhat looney toons. There is enough shrink with N3 that can pull back the die size
and also put in another feature or two as long as pick something this is highly aligned with the "shrinks better with N3" properties. (e.g. can't throw N3 at bigger cache size and get smaller. More compute with relatively little additional cache is a far better 'fit' to pay more. )