Are there any real world examples of applications taking advantage of 128GB+ of VRAM?

quarkysg · Sep 26, 2022

theorist9 said:
However, the GPUs in the DGX are connected by NVLink, so I thought that was answered (in the negative) by your quote from the NVIDIA developer, who said "NVLink provides a fast interconnect between GPUs, but does not aggregate those GPUs into a single logical device."

I read this as each of the dGPU will have their own kernel processes running, but each of those processes can potentially read data off other dGPU's VRAMs. If this is the case, probably have to code the dGPU kernels to understand this fact and make the GPU codes more complicated.

Apple's UMA with UltraFusion makes this a lot more elegant IMHO.

singhs.apps · Sep 27, 2022

theorist9 said:
It would be cool if they could, but what about this?:

Plus Maxxon (the maker of Redshift) says:

"Redshift does not combine the VRAM when using multiple GPUs. This is a limitation of current GPU technology and not related to Redshift in particular." [emphasis mine.]

https://support.maxon.net/hc/en-us/articles/1500006456701-When-Redshift-uses-multiple-GPUs-is-their-memory-combined-

https://help.otoy.com/hc/en-us/articles/360054367272-Hardware-Guide-for-OctaneRender

Read up on nvlink (section 3) for octane. Basically octane treats the 2nd nvlink GPU’s vram as fast out of core memory (nvlink in the 2xxx series has about 100 GBps bandwith - much faster than PCI-e and system ram) but you will need around 3x-4x System ram to fully saturate the combined vram of nvlinked cards and keep octane chugging along.

singhs.apps · Sep 27, 2022

senttoschool said:
Why would this ratio be linear? I assume that macOS takes a static amount of the memory but macOS does not take more memory if you have more.

I tried searching for the 128 GB ultra redshift to see if the ratio holds but didn’t find much info.
It would be a relief actually if the ratio doesn’t hold at higher ram capacities.

senttoschool · Sep 27, 2022

singhs.apps said:
I tried searching for the 128 GB ultra redshift to see if the ratio holds but didn’t find much info.
It would be a relief actually if the ratio doesn’t hold at higher ram capacities.

Everything I know about software engineering suggests that apps do not take a linear amount of RAM. If it does, there's some really bad programming going on.

If a Mac has 1TB of RAM, does macOS automatically consume 200GB of RAM? No.

singhs.apps · Sep 27, 2022

senttoschool said:
Everything I know about software engineering suggests that apps do not take a linear amount of RAM. If it does, there's some really bad programming going on.

If a Mac has 1TB of RAM, does macOS automatically consume 200GB of RAM? No.

I could be wrong but got the sense that redshift couldn’t eat up more ram because the system didn’t allow the extra headroom.
Needs more investigation and less opaque responses from Apple and developers alike.

l0stl0rd · Sep 27, 2022

singhs.apps said:
I tried searching for the 128 GB ultra redshift to see if the ratio holds but didn’t find much info.
It would be a relief actually if the ratio doesn’t hold at higher ram capacities.

Here you go

singhs.apps · Sep 27, 2022

l0stl0rd said:
Here you go

View attachment 2081581

So the ratio holds (kind of).
I wonder if it’s an OS level hard limit or Redshift is the limiting factor.
Also, how is the scaling viz the M1 Max 64 GB variety?

senttoschool · Sep 27, 2022

singhs.apps said:
So the ratio holds (kind of).
I wonder if it’s an OS level hard limit or Redshift is the limiting factor.
Also, how is the scaling viz the M1 Max 64 GB variety?

It doesn't hold.

64GB - 42GB = 22GB
128GB - 96GB = 32GB

Either way, I'm not sure what's going on here. macOS does not/should not take up more RAM if you have more RAM. It's possible that these tests are not standardized and one person ran it with more apps opened than the other.

mi7chy · Sep 27, 2022

Mr Screech said:
At 1024x1024 I get noise, but it works with 960x960.
Using about 38gb of RAM on Ultra 128GB.

This was the output when using 'apple tree'
View attachment 2081406

There's a VRAM optimized version of stable diffusion that can output up to ~1152x1088 with 6GB VRAM with processing chunking. <3 minutes with laptop 3060 6GB to output 1152x1088 with 30 sampling steps.

https://github.com/basujindal/stable-diffusion

singhs.apps · Sep 27, 2022

senttoschool said:
It doesn't hold.

64GB - 42GB = 22GB
128GB - 96GB = 32GB

Either way, I'm not sure what's going on here. macOS does not/should not take up more RAM if you have more RAM. It's possible that these tests are not standardized and one person ran it with more apps opened than the other.

https://redshift.maxon.net/topic/41339/m1-ultra-performance/335

It appears the ratio holds. A user reporting 48 Gb available to redshift ( 6GB up from 42 GB if I remember correctly ) on his m1 max. I have a m1 max MBP. Will test myself later this week.

sunny5 · Sep 27, 2022

Davinci Resolve might use tons of VRAMs.

quarkysg · Sep 27, 2022

singhs.apps said:
It appears the ratio holds. A user reporting 48 Gb available to redshift ( 6GB up from 42 GB if I remember correctly ) on his m1 max. I have a m1 max MBP. Will test myself later this week.

Really depends on the applications being used. Some applications allocate more buffer from memory if they see the system have more to offer, thus reducing the "VRAM" for the GPU.

If we just look at memory consumed when macOS boots up, we will likely see a plateauing of memory used if plotting memory used against total memory installed. The larger the amount of memory installed, the more memory is required by the OS to keep track of all the memory pages, but as the amount of memory grows, it will outpace the memory used, without any apps asking for memory.

Most OSes will also reserve some memory for essential services like networking, file I/O, etc, so the more memory that's installed, they more will be reserved, but I would think it will also plateau off after a certain size. But if there's more memory not use when a file is loaded, most OSes will just use the free memory up to cache the files loaded, but these allocated memory regions will be freed up if applications ask for memory if there're no more free regions available.

In short, I don't think there's any magic ratio.

singhs.apps · Sep 28, 2022

quarkysg said:
In short, I don't think there's any magic ratio.

No one’s talking about any magic ratio.
If the same task is conducted on two systems, one of which has higher amount of ram, all else being equal, it’s fair to expect more ram is made available for said task in the system with the higher amount, if needed.

But in both cases only 75% of system ram is allocated. Doesn’t matter if system ram is 64GB or 128 GB.
64 GB - 48 GB allocated
128 GB - 96 GB allocated

Xiao_Xi · Oct 30, 2022

singhs.apps said:
I wonder if it’s an OS level hard limit or Redshift is the limiting factor.

Apple recommends the maximum GPU memory available for an application.

For 32GB and 64GB, those recommendations are:

I couldn't find it for 128GB.

Metal Compute on MacBook Pro - Tech Talks - Videos - Apple Developer

Discover how you can take advantage of Metal compute on the latest MacBook Pro. Learn the fundamental principles of high-performance...

developer.apple.com

Xiao_Xi · Oct 30, 2022

Xiao_Xi said:
3D rendering

This presentation from the Blender Conference explains how the PC world can render large scenes on GPUs.

Some examples of large scenes.

A potencial Mac Pro would simplify it.

Search

Search

Are there any real world examples of applications taking advantage of 128GB+ of VRAM?

quarkysg

macrumors 65816

singhs.apps

macrumors 6502a

singhs.apps

macrumors 6502a

senttoschool

macrumors 68030

singhs.apps

macrumors 6502a

l0stl0rd

macrumors 6502

singhs.apps

macrumors 6502a

senttoschool

macrumors 68030

mi7chy

Suspended

singhs.apps

macrumors 6502a

sunny5

Suspended

quarkysg

macrumors 65816

singhs.apps

macrumors 6502a

Xiao_Xi

macrumors 68000

Metal Compute on MacBook Pro - Tech Talks - Videos - Apple Developer

Xiao_Xi

macrumors 68000

Our Staff