We already have an Apple Silicon GPU on the M1, but I assume you meant a discrete GPU of some description. There is nothing that specifically ties dGPUs to x86 chips - ARM chips can (and do) support PCIe and would be able to access third-party GPUs given appropriate drivers.Could we see an Apple Silicon GPU based on the M1 technology? Or that works different and we need x86?
How do they increase 20% yearly? I thought to increase speed you need to add more transistors, cores, or make them closer by nanometers? What I am asking is what do they do yearly that makes it 20% faster that they couldn't do last year?
Apple has various options for providing more powerful GPUs with the Mx chips:
1) Increase the number/capability of the GPU cores on the primary silicon die. This of course makes the die larger increasing the possibility of manufacturing failures, but the M1 is relatively small (c. 120mm^2) compared to larger CPUs and especially GPUs (a GTX 3080 has a die area of 628mm^2)
2) Have a separate GPU die on the same Mx package, connected via a proprietary interconnect. This potentially allows the GPU to be larger, and to have custom SoCs with the same CPU cores but differing levels of GPU capability, but still maintain the unified memory architecture and high performance.
3) Have a completely separate GPU package (similar to current MacBooks and iMacs) that is connected via either PCIe, or some new interconnect. The challenge here is how to make it work with unified memory. I think AMD has some technology that does this, allowing the GPU to directly access system memory, in addition to using its own VRAM.
As for CPU year-on-year performance increases, it depends on several things:
1) Run at higher clock frequency, which is possible with smaller transistors (less power required, so can increase the clock frequency to stay at the same TDP)
2) Better / faster cache and memory management
3) Architectural improvements the CPU execution mechanism (e.g. predictive branching) that increase the possible Instructions-per-clock.
4) Use of Application Specific blocks on the SoC - to speed up particular operations like file compression, encoding, lexical handling. This is a strong area for Apple.
I sure there are many other possibilities. It's a case of getting a few percent marginal gains for each change, that leads to a significant overall improvement.