Become a MacRumors Supporter for $50/year with no ads, ability to filter front page stories, and private forums.

rondocap

macrumors 6502a
Original poster
Jun 18, 2011
542
341
So on various forums, some people had the question a while back - Is the bandwidth limitation of having a Vega ii duo GPU on a single 16x any indication in the real world vs having two solo Vega ii, each with 16x pcie bandwidth on the 2019 Mac Pro?

It seemed like no one really tested this, understandable due to the cost - but I was curious. In theory, the Apple white paper says that two Vega ii solo are better for video, but there were no real world tests done.

Obviously, it is preferable to have a single Duo GPU if you can - instead of the two solo Vega ii, but only if performance was close.

I tried to test the heaviest stuff I could for video. 3D rendering seems to not have the same bandwidth limitations, so it is mostly on the video side.

Now, this is rendering and export - perhaps noise correction or real time color grading can reveal more, and I may try it.

I also took special care to make sure it was on NVME drives, as R3d Raw and Pro res 4444 XQ get pretty fast, so I did not want to bottleneck it. Also done on a Mac Pro with 28 cores.

Here are some tests, I added in dual W5700x too. They are surprisingly close most of the time for less cost.

Let's start with 6k R3d Raw 20 min clip, in Resolve, going to 4444 XQ:

1626394010171.png


Basically within the margin of error for the Vega configs, so this is not enough to show any difference.

How about R3d raw 8k, to 4444 XQ? Once again very close, margin of error.

1626394122812.png


Maybe we need more - here is a 10 min, BRAW 12k, to 4444 XQ. But once again, they are basically the same.

1626394166168.png


In the Resolve Candle test, they are the same too - and in Octane X, the same.

Here is Final Cut Pro, 6k to 422 HQ, basically the same, and other Final Cut Pro tests were similar to the Resolve as well - but generally Resolve is faster.

1626394246824.png



So it seems that at least for playing back, exporting/rendering, both configs are more or less equal.

Does anyone know what test may reveal this bandwidth difference? There must be something that does it - at least a very niche test I'd imagine. Maybe real time speed on color grading or noise reduction? Everything seems to be within the margin of error.

Or maybe there isn't anything that can really saturate the bandwidth past 8x for each GPU - or is possibly bottlenecked by something else. Like I said, I used NVME drives and kept an eye on those speeds, and 28 cores.


I have some 6900XT numbers too, but you don't want to see those - they beat everything else, lol.
 

Attachments

  • 1626394077737.png
    1626394077737.png
    5.3 KB · Views: 93

deconstruct60

macrumors G5
Mar 10, 2009
12,493
4,053
……

Now, this is rendering and export - perhaps noise correction or real time color grading can reveal more, and I may try it.

I also took special care to make sure it was on NVME drives, as R3d Raw and Pro res 4444 XQ get pretty fast, so I did not want to bottleneck it. Also done on a Mac Pro with 28 cores.
…..

Does anyone know what test may reveal this bandwidth difference? There must be something that does it - at least a very niche test I'd imagine. Maybe real time speed on color grading or noise reduction? Everything seems to be within the margin of error.

…..


Somewhat of an oxymoron to avoid disk bottleneck and need enough data to saturate 1.5-2 x16 PCIe v3 data streams . Going Multiple cameras . For example is pointed two Afterbuner cards at one Duo versus two ‘Solos’ consuming one each . ( afterburner peaks out on concurrent 8k decoder around where saturating it’s bus connection to the GPU.

Slot 1 and 3 are on their own solidarity connection to the CPU’s core backbone. What need is the other two x16 bundles that are attached to the Plex switch to ’blast‘ more than one x16 woth of data at slot 1 ( or 3 ). Can’t do that by tiptoeing around getting data off a disk ( or from some other source ).
 

rondocap

macrumors 6502a
Original poster
Jun 18, 2011
542
341
Somewhat of an oxymoron to avoid disk bottleneck and need enough data to saturate 1.5-2 x16 PCIe v3 data streams . Going Multiple cameras . For example is pointed two Afterbuner cards at one Duo versus two ‘Solos’ consuming one each . ( afterburner peaks out on concurrent 8k decoder around where saturating it’s bus connection to the GPU.

Slot 1 and 3 are on their own solidarity connection to the CPU’s core backbone. What need is the other two x16 bundles that are attached to the Plex switch to ’blast‘ more than one x16 woth of data at slot 1 ( or 3 ). Can’t do that by tiptoeing around getting data off a disk ( or from some other source ).
I guess it’s very difficult to saturate it then, unless you really target it With multi cam or other intense workloads in 8k. So in that case, for real world use 99% of the time the duo is equal to the solo Vega ii in terms of bandwidth, right?
 
Register on MacRumors! This sidebar will go away, and you'll see fewer ads.