....
AMD's main problem is its poor support for desktop GPUs.
View attachment 2353120
System requirements (Linux) — ROCm installation (Linux)
System requirements for AMD ROCmrocm.docs.amd.com
Last generation AMD had a desktop MI210 that was 'cut down' to be a better workstation CNDA options. This MI300 generation they went extra big. So there is no PCI-e card at all. I have a suspicion that wasn't always the 'plan'. The problem for AMD is that they are trying to get this all "off the ground and running while making money" . It likely isn't "poor support" as much as they don't want to lower the margin and give away lots of 'almost free' cards.
The more divergent base level silicon architectures that RoCm has to target , the slower the path to a more optimized product. [ It is most "mythical man month" to just claim that can just hire more bodies and throw more people at at and the software project will mature faster. That typically doesn't work for several reason. Intel hired a whole horde of folks to do their dGPUs drivers. That didn't really work well? Really not surprising viewed from outside the wealthy groupthink bubble inside Intel. Intel has blown a giant money pit hole in the ground here. AMD cannot afford the size of the hole that Intel dug; even if they wanted to try. ]
AMD has somewhat painted themselves into a corner though of not having a 'normal' PCI-e card option for the MI300 series. Nvidia has also bought themselves lots of market share by handing out "free" cards to researchers. AMD is constraining it to the top end of the mainstream GPUs for both least amount of fratricide ( on the much higher priced options) and keeping the architecture expansion the same. ( XTX, XT, and GRE are all the exact same chiplets in different combos and configuration settings).
AMD has to grow bigger in order to do broader array of stuff. Too broad , too soon is a major part of what kept AMD in second place versus Intel for a long time. RoCm working extremely well on MI300 matters way more than running on the consumer stuff.
The 7900 GRE card is going to sell for around $549. Relative to the MI200/MI300 prices, that is a far, far more accessible. Not accessible to home hobbyist on a tight budget, but that isn't really who AMD needs to get to in the short-intermediate term. It is also low enough that AMD probably should consider handing a fair number of those out for free to open source and AI/ML library maintainers. ( give away one $8,000 MI card or 14 7900 GRE , which one is going to have more ecosystem building impact? It is already a higher number. Going even cheaper to grow the freebie count gets into diminishing returns zone pretty quick. ). RoCm's library synergy needs to get broader so the MI300 -- MI400 stuff has more traction.
Also isn't going to match the "go big on VRAM" approach of MI300 to sink lower than 16GB of VRAM capacity threshold.
The other major factor though is that Microsoft has their own generic ML libaries ML APIs that AMD also needs to keep up with. Getting the mainstream GPUs covered by that should be a much higher priority than RoCm. The overwhelming vast majority of those cards are going to get deployed with Windows. Missing what may be perceived as an essential Windows API is a huge potential blow to traction. ( if AMD lets Qualcomm and/or Nvidia do a better job here, that will be losses in more than just GPU space. )
Last edited: