One difficult concern about GBx is that it measures peak performance in a relatively brief test. This is fine, we do like to know how fast it can go. But it means that an Intel or AMD processor is running at "Turbo" for the test (somewhere between 4~6GHz).
In the real world, "Turbo" is simply not sustainable for very long. Even if you pour liquid nitrogen on the CPU to keep it from overheating, you still face the limitations of the memory system, which will eventually lead to processor starvation.
The M2 runs at around 3.5~3.7GHz (P-cores) and may drop to about 3.0, but it is getting scores in the same general neighborhood as x86 devices at some 60% of clock, and most importantly, it can maintain that pace pretty much indefinitely.
x86 machines have the high-clock capability, and good for them. ARM processors do not have a burst mode (at least, not on the same order of magnitude) but rely on getting the work done at a steady, practical pace – and, interestingly, they seem to be able to do just about as much work at a much lower clock.