You probably misread my post. I didn't say ST performance doesn't matter I said hardcore/extreme ST performance doesn't matter in everyday workflows except video gaming. Most of single thread apps are already efficient and they don't require extreme performance, even some single thread CAD apps will never peak M2 single thread performance when you use them.
What single thread workflow you got that you need M3 Max single thread performance compared to M1 single thread performance? I would really like to know. If there is such thing it must be very niche.
I think I understand what you're trying to say and it's much more reasonable than what I initially thought you were saying about ST/MT performance (though I still disagree, I would say ST performance is still more important for reasons I'll get to, but as a preview of my arguments efficiency here matters - race to sleep for ST matters). However, both this post and your subsequent longer piece is still an indirect contradiction or at least negation of your original statement that "Intel/AMD will catch Qualcomm before the other way around etc..." which is what I was responding to originally and what prompted this conversation. Let me explain: for both MT and ST, that's already happened except in ST performance per watt. Qualcomm's Elite multithreaded performance is already caught up to AMD/Intel and AMD's H300 multithreaded efficiency has caught up to the Elite as well, potentially so has LNL's (some of that is a consequence of the compromises in the Elite SOC design which I'll get to). And the ST performance of any of them is hardly extreme. All of these are mobile chips, not overclocked enthusiast builds running on liquid nitrogen here, and all have similar enough ST performance - often trading blows (though of course Intel in its marketing materials claims to have the best, as of yet unconfirmed).
Further, even if what you were saying wrt ST/MT performance was 100% accurate and it isn't ST performance is still the most important, ST efficiency would still be massively important to the user's experience, especially given that they are mobile chips. And for the case that you assert predominates users' workflows, hybrid, low thread count MT performance/efficiency, is also heavily impacted by ST performance/efficiency as you might imagine. As such, Qualcomm should have a substantial advantage in such workloads. And I don't disagree with that light MT is a crucial aspect to test. In fact, it's why GB changed its approach to testing MT in fact and their MT overarching benchmark includes, is even dominated by, such "non-scalable" MT workloads. They still test pure, scalable MT because those workloads can still matter, but the resulting MT "score" is an average of these two kinds of MT workloads as much as "FP" and "Int". As an aside, this is one reason why I am in favor of the idea that, when one is really getting into the weeds on performance comparing multi-test benchmarks, one ignores the top line figure entirely and just compare the subtests. But that's getting into a different topic. Qualcomm's (and Apple/ARM's) ST perf/watt is pretty unassailable, for now, and that does have knock on effects for designing SOCs and even getting the most out of scalable and non-scalable MT as well.
That said, you can probably tell from my previous posts that I think the Qualcomm Elite chip has serious issues. Where Qualcomm erred in its SOC design from my perspective as a tech enthusiast, not an expert, and even if I'm right they may not have had much of a choice as I'm sure some of these decisions were made from necessity rather than by preference, is as follows:
1A. no E cores for low power operations and light MT
1B. too much prioritizing of MT throughput not allowing a true fan-less design even in cutdown models (to be fair Strix Point seems to be similar - LNL may have gotten this balance right)
1C. Not even achieving the theoretical max of MT throughput, 12 P-cores should really be doing better and something about their SOC design, maybe heat (again no E-cores) or lack of cache/fabric bandwidth seems to be hampering MT performance, by about 20% by my estimation from another site
2. underperforming iGPU and, so far, no dGPU design wins, making it unattractive for gamers even if the driver/software issues weren't there and gaming occupies an outsized portion of the mindshare in the PC space
3. LATE - M2 core equivalents on N4 in 2024 simply don't cut it especially if you're having to fight software compatibility issues that your main rivals aren't
This isn't to say that the Elite project is a failure, for any V1 chip it would be almost remarkable not to have any of these issues*, but I think mostly everyone disagreeing with you on the ST/MT stuff agrees that Qualcomm's V2 needs to be a huge step up and not just in the SOC(s)/cores but everything around that if it is to successfully fight for meaningful market share in the (non-Mac) PC space.
*while the M1 had issues and delays itself - especially delays it took a year to release the M1 Pro/Max and well over for the M1 Ultra - it is overall remarkable how well the Apple transition went. More than performance, this is where total control of the product stack helps enormously.