The Register may have shed some light on why the X925 doesn't include SME instructions.
Not really.
Arm really wants chip designers to migrate to using Armv9, which brings in more neural network acceleration on the CPU side. And that's kinda why Arm has this beef with Qualcomm, which is sticking to Armv8 (with NEON) and custom NPUs for its latest Nuvia-derived Snapdragon system-on-chips. You've got the likes of Apple on one side using Armv9 and SME2 in its latest M4 chips, and Qualcomm and others on the other side persisting with NPUs. Arm would be happier without this fragmentation going forward.
As it offers 3nm shake-and-bake Cortex-X925, A725 processor designs for phones, PCs
www.theregister.com
This part is a bit too "loose" with terms.
" ... Specifically, we're talking about Armv9's
Scalable Matrix Extension (SME2) as well as its
Scalable Vector Extension (SVE2) instructions. ..."
The SVE2 was introduced in 2017
Arm has released early technical details of two new technologies for its A-Profile architecture. These are Scalable Vector Extension v2 and Transactional Memory Extension.
community.arm.com
SME came with Armv9-A in 2021:
" ... SME builds on the Scalable Vector Extensions (SVE and SVE2), adding new capabilities to efficiently process matrices. Key features include: ...
...
A new operating mode is added, Streaming SVE Mode. When in Streaming SVE Mode, the new SME storage and instructions are available, as well as significant subset of the existing SVE2 instructions. When not in Streaming SVE mode, behavior is unchanged from SVE2. Applications can switch between operating modes depending on what is needed. ...
"
In this blog, read the details for Scalable Matrix Extension (SME). This is a new extension from the latest Arm Vision day announcement for Armv9-A.
community.arm.com
SVE2 predates SME it doesn't "belong to it". The "Streaming SVE" stuff 'belongs' with SME. Arm "General Matrix Multiply" is part of Armv8.6 also.
The Arm Application-profile (A-profile) architecture targets high-performance markets, such as PC, mobile, gaming, and enterprise.
developer.arm.com
Similarly SVE came in with Arm v8 alongside Neon. When 'new' lots of these features are 'optional'. Can be "v9" and not implement SME (implementors can 'cherry-pick' features until they go 'required' ) 8.1 will have a different set of required features than 8.8. Likewise in the 9.x series. SVE2 being 'older' means it is closer to get to the 'required' status than SME is. That is part of the issue for this generation.
If the X925 had SME then the A520 they they put into the same compute tile die would also need SME. That would make the A520 get bigger. ( or you would have to turn it off in the X925 and it would pragmatically 'dead' silicon , e.g., AVX-512 in Intel's gen 12-13. ). If Arm is going to have three layers of highly coupled CPU cores then they need to roll SVE2 out to all of them before moving on to SME. That is just going to take time.
The notion the article presents is that Arm is in SME right now or bust mode. SME not being in the X925 is more demonstrative that isn't true. Besides Apple has a NPU even though they have AMX/SME.
( Converstational capture/generation might want an NPU to run all the time where the CPU may or may not need to run . Depends upon how big and coupled the model is to the rest of the computations running for an interactive app. )
" ... Now Arm has taken that shake-and-bake approach to personal or client devices, and will offer complete physical implementations of the above new Cortex CPU and Immortalis GPU core designs under the banner of
Compute Subsystems for Client. These designs were made with the help of TSMC and Samsung, ..."
(also from the linked article.)
I think Intel is in that fab mix also. ( as the article points out the implementation issues get coupled into the system issues. If Arm is packaging mulitple types of cores then they overall system has an impact on which features roll out at which times. )
P.S. Recently rumors that Arm wants to get into the "AI chips" market also.
" ...
SoftBank-owned Arm is reportedly preparing to add AI chips to its product portfolio starting in 2025.
The chip designer plans to set up an AI division with prototypes available in spring 2025 and mass production handled by contract manufacturers starting in fall,
reports Nikkei. ..."
Plus Foxconn defends Wisconsin efforts, Korea to spend on chips, and more
www.theregister.com
they can't be tooooooo far in the anti-matrix-from-CPU-cluster camp.