TLDR: It looks like, when they say the 400 GB/s bandwidth of the M1 Max's unified memory is unusually high for a laptop, they're comparing it to typical CPU RAM bandwidth, not typical GPU RAM bandwidth. I.e., more specifically, the Max's unified memory is giving the CPU an unusually high RAM bandwidth, and giving the GPU a typical RAM bandwidth (compared to other laptops in its class).
****
I've read the tradeoff between DDR and GDDR RAM is that the former provides low latency (needed for CPUs) and the latter provides high bandwidth (needed for GPUs). So it seems Apple decided they couldn't accept the high latency of GDDR RAM for unified memory, and instead used DDR RAM and increased the bandwidth to the point it was comparable to what is available from mobile-workstation-class GDDR.
For instance, a comparably-equipped (64 GB RAM, 4 TB SSD, 120 Hz screen) 17" Dell 7760 workstation laptop, with an 8-core Xeon W11955M and A4000 mobile, costs about the same (within 10%) as a 16" M1 Max ($4900 for the MBP and $5440 for the Dell, based on current pricing on Dell's website*). And the A4000 mobile offers 384 GB/s GPU RAM bandwidth (see link below).**
So it sounds like the Max's 400 GB/s isn't unprecedented when it comes to GPU bandwidth (compared to other laptops in its class), but might instead be unprecedented compared to their CPU bandwidth (?).
*At the time I configured it, a couple of days ago, they were offering an automatic discount of 35% off; they always offer heavy discounts, but I don't know if this particular discount is typical or not. Interestingly, I tried configuring it, left, and came back, and the 2nd time it offered me a higher discount--so maybe its tracks you and offers a lower discount if you don't bite the first time
.
**You don't even need to go to a workstation-class laptop to get that GPU RAM bandwidth. For instance, an RTX3080 mobile also offers 384 GB/s memory bandwidth. Also, for comparison, the A4000 and RTX3080 desktop GPUs offer twice that: 768 GB/s