In the absence of any actual AS graphics programmer picking up the gauntlet, I'm going to say that there are a couple of things to keep in mind. Even though the M1PM is well endowed in terms of bandwidth, writing to and from main RAM is still going to be slower than keeping data in on chip tile caches. And there are steps you can just avoid.
The other aspect that springs to mind is perhaps trickier to evaluate the effect of, it's that programmers code to a target. This affects all cross platform benchmarking. Outside of mobile, IMRs have been the only game in town, and code and hardware has co-evolved. While you can run that code on a TBDR, it was originally coded taking the strengths and weaknesses of the target (industry standard) hardware in mind. Not only doesn't it take advantage of the benefits of TBDR, it may choose techniques that you simply wouldn't have gone for at all, if you knew that the code would run on a TBDR. You would have done things differently to get similar results with better performance. Indeed the game itself might have looked different!*
(*Ancient trivial example - the hidden surface removal made overdraw really cheap on TBDRs, so having villages and towns with lots of structures was no problem, whereas foliage (at the time handled via transparency) was a kick in the TBDR nuts. The TBDRs of the time was benchmarked using game environments that would simply have looked different, had it been know that they would be produced by that kind of architecture. But they were created based on the strengths and weaknesses of the dominant approach.)
Apples 3D developer videos are relatively easy to follow, and this sort of thing crops up in them - not only the more common case of "if you have been doing things like this, you'd gain a lot by modifying your code like that instead", but also "since this is really cheap, it's a good way to...". But it would take an active 3D-programmer to say just how much of a difference using an Apple TBDR as an original target for your code would make. I can only say that yes, it would make a difference, but don't have the real world experience to express that difference in numbers.