Sources say that Zen might have 19 stage pipeline, for conditional branches and 22 for mispredict branches. Which means the core clocks can be at least at the level of Bulldozer, which have had 15-16 stage pipeline.Actually I do not know really much about CPU architectures and what affects what, however...
https://browser.primatelabs.com/v4/cpu/88332
Isn't all this because memory performance need's to be double checked in very wide CPU core counts, and it takes time, and affects the overall performance of the CPU even on the Single Thread front?
So lets take a simple math based on this score for wide Intel CPU: 3000 points from Geekbench/3600 MHz of core clock x 1440 MHz of Zen core = 1200 Geekbench points.
Naples scored 1141 pts in the same test.
1141/1200 = 0.95
1200/1141 = 1.05
On paper, Zen architecture is very similar in single thread design(Floating Point, Integer, registers, cache uops).
http://www.anandtech.com/show/10591...t-2-extracting-instructionlevel-parallelism/7
Lets compare Zen to Broadwell on this front.
Integer Registers: Zen 168, Broadwell 168
Floating Point Registers: Zen 160, Broadwell 168
Retire Queue: Zen 192, Broadwell 192
Dispatch/cycle: Zen 6 uops/cycle, Broadwell 4 uops/cycle
Retire rate: Zen 8/cycle, Broadwell 4/cycle
Load Queue: Zen 72, Broadwell 72
Store Queue: Zen 44, Broadwell 42
Only thing that can let-down Zen at this point is the quality of silicon(not likely, as there will be two designs: 14 nm HP GloFo for server and most likely HEDT CPU parts, but that last bit remains to be confirmed, and 16 nm FF+, for... Zen APUs), and the clocks to how high levels it can clock the HEDT CPUs, and the bandwidth of cache itself. And of course, Zen is not able to work with AVX512. And that the CPU will emulate AVX256, from double the 128 bit feature set.
Those are just my thoughts on this. Lets focus on the architecture itself for this discussion.
So the only thing that is unknown are the core clocks. Interesting in this context is that Bulldozer chips have had 125W TDP, and 8 cores.
Before I thought that what we will actually see will be 8 core, 125W, and 3.5 GHz base core clock with 4 GHz Turbo CPU.
Right now Im not sure this will be the case...
Before you will say that 19-22 stage pipeline is too long. Look at FP and Integer Registers, Retire and Dispatch queue. All of this mitigates the latency created by that long pipeline, alongside the Core Clocks. At least this is how I interpret how CPU's work. Im not very fond of how they work(never been interested really, because there was always one brand for me: Intel).