So no efficiency gains from A16 to A17, performance gains came directly from frequency increase, node process gains went to GPU and other modules?
That's not quite true. The graph is noisy enough that one can't really say that.
(a) If what you were saying were true, we should be able to see A14 as clearly different from A17. But A14 and A17 fall (for the purposes of the noisiness of this graph) on the same line. Or are you going to suggest that from A14 to A17 there has been zero improvement in IPC?
The best you can really do is consider upper and lower lines. Better IPC means more of the points congregate closer to the "lower" (ie lower and to the right) line, and even by eye it looks like A14 points are more towards the upper line; A17 points more towards the lower line.
(b) Simply increasing frequency without dropping IPC requires constant new smarts. If Apple doesn't give us more IPC over the next few designs, I'll be sad! But I don't think it's true that there is no more IPC to be gained, OR that Apple's current team don't know how to get there.
I think it's more the case that every design is a compromise. Right now Apple's MOST IMPORTANT task is to deal with scalability, to satisfy the highest end customers who might consider ditching them for a 64 core AMD design with kick-ass nVidia GPU.
To that end, I expect every aspect of "this year's" (ie the A17 and M3 designs) was considered with that in mind. Get right the things that matter at the high end (and are essentially invisible on a phone!) like the new coherency protocol, GPU work distribution, larger (or more) CPU clusters as I described, VM support, probably TLB support for very large RAM sizes (eg large page support). Add in to the CPU what was "easy" or ready to go (wider, higher GHz) but nothing more than that. Apple aren't dumb (in spite of the internet crowd who feel the SoC team is now a collection of dribbling morons); they are surely well aware of the points I keep stressing, like how simply adding more resources (eg wider) without changing algorithms, has limited value.
But THIS YEAR is not about optimizing the algorithms (with the time and risk that takes), it's about scaling up to match a large AMD+nVidia system; and phones clearly will benefit the absolute least from that work...
I mock Intel occasionally (and the crazier Intel supporters frequently!) but honestly I'm sympathetic to the choices they have made with MTL, for the same reasons.
MTL is (IMHO) a poor underlying design direction, but given that IS the direction you have chosen, the decision that the cores are essentially identical to the Raptor Lake cores is the same sort of risk management. Focus on the part that is tricky, and MATTERS MOST for the overall strategic direction (in Intel's case, getting all the chiplet to chiplet communication, clocking, and power balancing correct) and leave the difficult core improvements to next year.