Become a MacRumors Supporter for $50/year with no ads, ability to filter front page stories, and private forums.
I ran it on my 2006 Mac Pro, 3.0 2x dual core (quad core), with 8gb of memory, and in 32 bit mode I got a 9600. From what I've read, if I upgrade to 64bit i could be looking in the 10,500-11,000 range. Unfortunately, my HD (the boot drive) is dying, so I'm only going to boot it up once i get the replacement drive in so I can transfer my files to the new one.

I'm calculating that I'll have about 1 hour to get the computer booted, new drive formatted, carbon copy cloner downloaded, and roughly 60gb of data transferred. Does that sound possible to you guys?
 
I ran it on my 2006 Mac Pro, 3.0 2x dual core (quad core), with 8gb of memory, and in 32 bit mode I got a 9600. From what I've read, if I upgrade to 64bit i could be looking in the 10,500-11,000 range. Unfortunately, my HD (the boot drive) is dying, so I'm only going to boot it up once i get the replacement drive in so I can transfer my files to the new one.

I'm calculating that I'll have about 1 hour to get the computer booted, new drive formatted, carbon copy cloner downloaded, and roughly 60gb of data transferred. Does that sound possible to you guys?

Impossible. You must be mistaken. No way a 2006 Quad can come up with those figures. That's 8-core territory. Link it or it didn't happen.

1 hour is not enough :)
 
Yodaisms are back.

Mine:

http://browse.geekbench.ca/geekbench2/567298

...

Not too bad a score either way. :)

jas

Far, far from bad - definitely excellent Geekbench 2 (GB 2) score you have. Trade your dual X5690s for my dual X5680s to underclock (N.B. clock speed) my WolfPack1 (aka FrankHacWinTosh) some more and turbo bias it an additional step/bin and get my top GB 2 and CB 11.5 scores higher [ http://browse.geekbench.ca/geekbench2/500630 and see pic and sig below], I would like. Better yet - swap your X5690s with my dual 5675's in my WolfPack2 and underclock it some more and turbo bias it up greatly to get it's scores [ http://browse.geekbench.ca/geekbench2/566675 and Cinebench 11.5 score is 20.56 at current underclocking and turbo biasing levels] higher than those of WolfPack1, I would like even more. Trade CPU chips, would you like?
 

Attachments

  • file.php.png
    file.php.png
    144 KB · Views: 195
Last edited:
The Yodaisms continue.


If faster CPUs for that box you ever get, aware you should be that in an EVGA SR2, a pair of 2.26 GHz E5520's at a CPU ratio of 13, a Bclock of 200 and with turbo biasing enabled and set to the max, a Geekbench 2 score of in excess of 21,000 it can yield. Scores in the neighborhood of those scores for the 12-core 2.66/2.93 GHz 2010 Mac Pro did mine yield before it was gifted. Thus, in the right environment, value/performance leaders are they. Food for thought for the future of your box and the security of you wallet, this is.
 
Last edited:
If faster CPUs for that box you ever get, aware you should be that in an EVGA SR2, a pair of 2.26 GHz E5520's at a CPU ratio of 13, a Bclock of 200 and with turbo biasing enabled and set to the max, a Geekbench 2 score of in excess of 21,000 it can yield. Scores in the neighborhood of those scores for the 12-core 2.66/2.93 GHz 2010 Mac Pro did mine yield before it was gifted. Thus, in the right environment, value/performance leaders are they. Food for thought for the future of your box and the security of you wallet, this is.

I guess you don't have anything for the 1,1? :p
 
I guess you don't have anything for the 1,1? :p

You'd be guessing wrong. Three of those 1,1's, upgraded by Apple to 1,2's*, do I keep as pets, having overclocked them to 3.6 - 3.7 GHz with help from ZDnet. To calm the bus onslaught, 800 mhz memory you must get. Used mainly as C4d render slaves, they come in handy - for sure, with an 11,324** Geekbench 2 score - when they're pure. An even higher Geekbench 2 score has, to me, a certain allure***, but as for that time clock running faster when accelerated, have I yet no cure.

No gadget is safe from my tinkering. A child of the 1950's - I was and am curious about everything. So curious was I that my mom would hide her watch and my dad nailed wooden slats on the back of the TV after they nabbed me in my explorations. I also have a host of fully functional O'ced and U'ced Windows, Atari and Amiga computers in my studio. If I don't gift it, I keep it working and improve on it and find a use for it.

*Dual 3.0 GHz X5365 Clovertowns
** http://browse.geekbench.ca/geekbench2/151150
*** 3.7/3.0 = 1.23; 1.23 x 11,324 = 13,966.26+, given that the whole bus system is now running 23% faster.
Also keep in mind that 800 mhz/667 mhz = 1.199; the cushion to allow more overclocking with http://www.zdnet.de/magazin/3919221...king-tool-enhances-performance-of-mac-pro.htm
 
Last edited:
9391 in 32-bit mode.

Absolute first generation 1,1 (bought Sep or Oct 2006) but yesterday I dropped in a pair of 5355s, making it an octocore.

12 gigs RAM.

Before the upgrade it was at about 5500, and I was weary of waiting for FCP X and Compressor to get stuff done for me.

I'll get another year out of it, for sure.
 
You'd be guessing wrong. Three of those 1,1's, upgraded by Apple to 1,2's*, do I keep as pets, having overclocked them to 3.6 - 3.7 GHz with help from ZDnet. To calm the bus onslaught, 800 mhz memory you must get. Used mainly as C4d render slaves, they come in handy - for sure, with an 11,324** Geekbench 2 score - when they're pure. An even higher Geekbench 2 score has, to me, a certain allure***, but as for that time clock running faster when accelerated, have I yet no cure.

No gadget is safe from my tinkering. A child of the 1950's - I was and am curious about everything. So curious was I that my mom would hide her watch and my dad nailed wooden slats on the back of the TV after they nabbed me in my explorations. I also have a host of fully functional O'ced and U'ced Windows, Atari and Amiga computers in my studio. If I don't gift it, I keep it working and improve on it and find a use for it.

*Dual 3.0 GHz X5365 Clovertowns
** http://browse.geekbench.ca/geekbench2/151150
*** 3.7/3.0 = 1.23; 1.23 x 11,324 = 13,966.26+, given that the whole bus system is now running 23% faster.
Also keep in mind that 800 mhz/667 mhz = 1.199; the cushion to allow more overclocking with http://www.zdnet.de/magazin/3919221...king-tool-enhances-performance-of-mac-pro.htm

I am amazed by your tricks, o master Jedi. Thanks heaps!
 
If faster CPUs for that box you ever get, aware you should be that in an EVGA SR2, a pair of 2.26 GHz E5520's at a CPU ratio of 13, a Bclock of 200 and with turbo biasing enabled and set to the max, a Geekbench 2 score of in excess of 21,000 it can yield. Scores in the neighborhood of those scores for the 12-core 2.66/2.93 GHz 2010 Mac Pro did mine yield before it was gifted. Thus, in the right environment, value/performance leaders are they. Food for thought for the future of your box and the security of you wallet, this is.
Ok, I don't even know enough to be dangerous here, but what I understand is this:

My W3680 has a CPU freq. of 133MHz, and a multiplier of 25. 133x25=3.325GHz
When turbo kicks in, it adds a multiplier or two, depending on number of cores, which yields up to:
133x27=3.591GHz (3.6 on Intel's spec sheet)

What I'm missing is how you're underclocking for more performance. Are you changing the CPU frequency from say, 133MHz to 200MHz and running with the turbo bias multiplier of 27, for example? Or is it the other way around, and you lower the freq to 100MHz and use more multipliers somehow, like 100MHz x40 multiplier?

Am I close, or way off? I don't get what you're doing to underclock and boost performance while maintaining cool temps and low voltages, and I'm not even an overclocker, so I really don't know jack. I probably won't do any of this for a while, since I'd prefer to play on a machine I don't rely on for income, but the first step is always knowledge. If I get it, I might be able to move on to experimenting.
 
Ok, I don't even know enough to be dangerous here, but what I understand is this:

My W3680 has a CPU freq. of 133MHz, and a multiplier of 25. 133x25=3.325GHz
When turbo kicks in, it adds a multiplier or two, depending on number of cores, which yields up to:
133x27=3.591GHz (3.6 on Intel's spec sheet)

What I'm missing is how you're underclocking for more performance. Are you changing the CPU frequency from say, 133MHz to 200MHz and running with the turbo bias multiplier of 27, for example?
Yep, in part.
wonderspark said:
Or is it the other way around, and you lower the freq to 100MHz and use more multipliers somehow, like 100MHz x40 multiplier?
Nope, but you've but your finger on one way it might be done with SandyBees, but that is highly dependent on what the motherboard makers expose through their bios implementations.


wonderspark said:
Am I close, or way off?
You're getting warm.
wonderspark said:
I don't get what you're doing to underclock and boost performance while maintaining cool temps and low voltages, and I'm not even an overclocker, so I really don't know jack. I probably won't do any of this for a while, since I'd prefer to play on a machine I don't rely on for income, but the first step is always knowledge. If I get it, I might be able to move on to experimenting.

I'll approach explaining what I do in two ways.

1) One way involves your buying one of those new Sandy Bridge E5 2012 Mac Pros. Done! Explanantion over - well sort of. Intel is now doing for the Sandy Bridges what I did for my Westmeres, the only differences being that I exaggerate it by setting the base from which turbo kicks in lower than the factory setting, and forcing a high turbo span. If you've been following the Sandy Bridge threads, you've seen that some complain about how these new chips run at slower clock speeds. Well, so do mine. And you may have noticed that these new chips have more steps/bins for turbo boosting than did Westmeres and earlier CPU generations. Well, so do mine - just that mine are higher, in fact almost doubled that coming standard in the SandyBees, because my chips and motherboards allow it.

2) Second way is a little more complicated, but here is how it goes:

Step one - you set the CPU ratio to as low a number as your computer will allow while giving you access to all of the variables necessary to have your system perform just as it would if you wanted it to run cool forever. My system with dual 5680s (rated at 3.33 GHz) has a non-turbo ratio range of 12 to 25 for them. That 12 setting stinks as it turns off some of those variables I'll mention below - so I can only go down to 13. Factory turbo for four of those six cores is 26 (I call this stage 1 turbo) and factory turbo for two of those six cores is 27 (I call this stage 2 turbo). It's not a coincidence that if you multiply a CPU's Base Clock (or Bclock or BCLK) by its highest, non-turbo CPU ratio at factory settings, you get its non-turbo speed as we're used to referring to it, namely, 133 (BCLK) x 25 = 3,325 MHz or 3.33 GHz (isn't rounding great) for all six cores when turbo isn't activated. Try to think of BCLK as, in a sense, a measure of the number of footsteps steps within a bin - those CPU ratios are each a bin, i.e., the 12, 13, 25, 26 and 27 {also just like the one's you mentioned}. View the number of footsteps as a proxy for speed with more footsteps representing more speed. For Westmeres, the normal span of footsteps is 133 MHz. For Sandy Bridges, the span is 100 MHz, but it'll be made up by the fact that they're wearing Nikes that can push more important stuff through faster. So at stage 1 turbo, four of each one of my 5680's six cores get kicked up to 133 x 26 = 3,458 MHz or 3.46 GHz (that's rounding again) and at stage 2 turbo, two of six cores get kicked up to 133 x 27 = 3,591 MHz or 3.59 Ghz {all of this also describes your system}. But in my case at this stage because I have chosen a lower CPU ratio, namely that 13 figure, we would get the following equations: 133 x 13 = 1,729 MHz or 1.73 GHz for all six cores at idle, or even lower with Speedstep. But because of a little instuction I give my systems through something called a DSDT.aml file [like the velociraptor in Jurassic Park - "It remembers"; here, what its true potential really is], at stage 1 turbo, four of six cores will still get kicked up to 133 x 26 = 3,458 MHz or 3.46 GHz and at stage 2 turbo, two of six cores will still get kicked up to 133 x 27 = 3,591 MHz or 3.59 Ghz. This turboboost activity is occurring continuously under load, assuming the right environment exists, faster than you could ever imagine. The more cores you have, the more fresh ones you have to participate in this round robin, if you stay within spec VID. See CPU-World site to get this Intel limit for your CPU - it's the amount of voltage you want to stay under to eat your cake and then have it to and keep your system stable, cool and not gulping on humongous electrical currents. This is the environment in which turbo thrives. For you true traditional overclockers, many of the variables you've been told to turn off, are fully enabled on all of my post-2008 systems: HPET 64, turbo boost, hyperthreading, and every native power management feature, including the PCIE/video ones, Speedstep, C1E, C/P/T states - they're all enabled. True overclockers also usually shoot for running all of their CPU's cores at one top stable speed, forgetting that overclocking and overvolting don't yield linear performance, but can yield heat exponentially. I SHALL NOT OVERVOLT ANY MORE, and surely a whole lot less, unless I really,really,triple really have to and then just a little.

Step two - you increase the BCLK. Just imagine again that the 133 (Westmeres and older) and 100 (SandyBees) represent foot steps. Increasing the Bclock from 133 to, e.g., 170 makes for many more footsteps in a given span of time. By the way, note the ratio: 170/133 = 1.28. It not only applies to the CPU's speed, albeit and most importantly to only a subset of cores at a time, but it also get applied to the memory and the connection between the processors, speeding them up as well. Unfortunately with the SandyBees it gets applied to even more things [like the PCIE bus] than I care to mention, but that's another tome that I defer writing until Intel's next announcement. Now apply the equations in step one, using 170, instead of 133. It works out as follow: 170 x 13 = 2,210 MHz or 2.21 GHz for all six cores at idle. But because it remembers what its true potential really is, at stage 1 turbo, four of six cores will now get kicked up to 170 x 26 = 4,420 MHz or 4.42 GHz and at stage 2 turbo, two of six cores will now get kicked up to 170 x 27 = 4,590 MHz or 4.59 Ghz. And that's the story that gets told twice with dual cores, i.e., at stage 1 turbo, eight of twelve cores now get kicked up to 4.42 GHz and at stage 2 turbo, four of twelve cores get kicked up to 4.59 GHz. And recall that this is occurring faster than you could ever imagine. When turbo queries: "Are all environmental variables extant for me began the hunt" The answer is invariably "hell yes." The total 2.21 GHz system is cool, sub-VID, not power gulping because that's it strolling with the tortoises. Even while running benches while simultaneously rendering video in Final Cut X and animations in Cinema 4d or Maya, my core temps stay relatively very cool, albeit fluctuating greatly while in the hunt. But if you were to ask, "Why is it better to be running fewer than all cores at max?" I would respond, "That's why I call it a 'paradox' and me who loves a good paradox a 'contrarian.'" But to give you a real world reference - what is the attribute that distinguishes the fastest factory clocked Sandybee from the rest of them. If you said it was, is and will be a four core, you'd be right - fewer cores can operated faster, safely in an extremely compact CPU environment than can the many. That's another reason why my system runs cooler than many and doesn't require much voltage and does not consume lots of current - yet another reason for me to rejoice because turbo is firmly always at hand. Moreover, keep in mind that I'm not discussing a static grouping where the same cores fall in the same four or two core turbo group. Quite to the contrary, it's truly a round robin and they all get to mingle with one another in different groups, again, faster than you care to and can imagine. Then, when the load is completed and turbo isn't needed any longer, all of those cores tuck their head into the CPU's shell, returning to creeping along with the 2.21 GHz tortoises, or even slower with Speedstep, until more prey is spotted; then, they morph and the WolfPack is released again. This is how my turbo biased, underclocked, undervolted 2 CPU system (40,100) can feed with the 4+CPU/48+core predators in territory [ http://browse.geekbench.ca/geekbench2/top ] where overheated and/or overvolted, traditionally overclocked 2 CPU systems have yet to be spotted nearby. But far off on the horizon, I can see Echrei (33,344) - the famed overclocker and my mentor d00d (33,066) who taught me that when tuning a system less can be more, licking their chops, eager to join me in the hunt. I look forward to their arrival, but they may have to share future prey with my upcoming WolfPack3, my forray into the new territory soon to be dominated by the SandyBee.
 
Even while running benches while simultaneously rendering video in Final Cut X and animations in Cinema 4d or Maya, my core temps stay relatively very cool, albeit fluctuating greatly while in the hunt.

So what happens when all 12 (24 virtual) cores in your system get pegged due to a massively multi-threaded application? That should prevent either stage turbo from kicking in, correct? And you'll therefore be running slightly slower than the base 5680s?

jas
 
Nice to see that the 2008 8 cores score quite well compared to much more recent systems !

Mine does 11231 64bits. 16 gigs of ram really were a breath of fresh air with photoshop. Still have not jumped the SSD (speed)boat though.
 
So, I think I get it. It cycles (rapidly) between three pairs of cores running under clock and over clock, as in two cores running at 2.21GHz, two running at 4.42GHz, and two running at 4.59GHz... and they keep rotating those speeds between the three pairs, keeping the core temps and voltages from ever peaking. Is that right?

I could see how that could work, similar to applying the brakes harder for a shorter period as opposed to normal braking for longer in an auto racing keeps the brakes cooler by giving them less time to build up heat, and more time without friction.
 
So what happens when all 12 (24 virtual) cores in your system get pegged due to a massively multi-threaded application? That should prevent either stage turbo from kicking in, correct? And you'll therefore be running slightly slower than the base 5680s?

jas

Under load, no matter how massively multi-threaded the application (such as when running Cinema 4d and/or Final Cut Pro X, together or singularly), turboboost has never failed to load continuously. In fact, my system's environmental variables are primed for that to occur. The actual performance displayed is amazing to watch, as HD frames with shadows, caustics, motion blur and many other effects get rendered in a couple of seconds per frame. If WolfPack3 is much faster than WolfPack1 (I already know that it'll be much faster than WolfPack2 with its dual x5675s, it'll definitely change how I setup my renderfarm job allocations. The only applications that don't trigger turboboost are lightly loaded apps like mail, Safari, Word, etc.
 
So, I think I get it. It cycles (rapidly) between three pairs of cores running under clock and over clock, as in two cores running at 2.21GHz, two running at 4.42GHz, and two running at 4.59GHz... and they keep rotating those speeds between the three pairs, keeping the core temps and voltages from ever peaking. Is that right?

I could see how that could work, similar to applying the brakes harder for a shorter period as opposed to normal braking for longer in an auto racing keeps the brakes cooler by giving them less time to build up heat, and more time without friction.

What diagnostic software applications appear to be indicating is occuring under load is that for each CPU each core is rapidly moving into and out of 6 (non-turbo), 4 (stage 1 turbo), and 2 (stage 2 turbo) core groupings where for only tiny periods of time they remain in that group where they take on the speed of that group. Also when rendering, its evident from watching four quickly variable core's temps bouncing between about 30-34 degrees (their usual idle range) to the high 40's and a quickly variable couple's temps bouncing from 30-34 up to the mid-50's. Ever seen the whack a mole cartoon?
 
Under load, no matter how massively multi-threaded the application (such as when running Cinema 4d and/or Final Cut Pro X, together or singularly), turboboost has never failed to load continuously.

Now, I've missed your previous discussions regarding building/clocking/de-clocking your machines. Is this something you're actually doing on your Mac Pro? Since I've already nuked (from orbit) my warranty by replacing the Westmeres with the X5690s, I'm wondering if I can make it even faster...?

jas
 
... Is this something you're actually doing on your Mac Pro?
Nothing yet perfected on 2009 or later Mac Pro that is worthy of release - still a work in progress.
jasonvp said:
Since I've already nuked (from orbit) my warranty by replacing the Westmeres with the X5690s, I'm wondering if I can make it even faster...?
jas
In the meantime, if you don't already have it get smcfan control and swap in some Sunon fans for those at the front and back of the cage surounding the processor/memory board that you'll have to remove. I've removed mine a couple of times - it takes less than 10 minutes to install new fans. Enhancing system coolness improves performance and longevity and it's cheap.
 
http://browse.geekbench.ca/geekbench2/281305

standard EVGA SR-2 X5650 OC's from 2.66 to 3.84GHz

Could probably get close to 30K with some fine tuning.

Nope - you could robably get much higher than 30k with some fine tuning. You have the exact same setup as my mentor d00d who got 33,066. That may be less than what you can get if you follow exactly the advice at http://www.insanelymac.com/forum/index.php?showtopic=233891&st=0 . Just remember that less is more. Compare the overclocks in the pic with the Geekbench2 performances. Your system probably has much more headroom than you give it credit for. By the way, it's my understanding that d00d sold that system for about $2,500. The purchaser made out like a bandit.

Also, note well that for that Geekbench 2 score d00d was running Vcore set to "Auto!" There was no overvolting!
 

Attachments

  • d00d copy.png
    d00d copy.png
    76 KB · Views: 223
Last edited:
Nope - you could probably get much higher than 30k with some fine tuning. You have the exact same setup as my mentor d00d who got 33,066. That may be less than what you can get if you follow exactly the advice at http://www.insanelymac.com/forum/index.php?showtopic=233891&st=0 . Just remember that less is more. Compare the overclocks in the pic with the Geekbench2 performances. Your system probably has much more headroom than you give it credit for. By the way, it's my understanding that d00d sold that system for about $2,500. The purchaser made out like a bandit.

I am waiting for my Mushkin Enhanced Chronos 240GB to arrive (http://www.neweggbusiness.com/Product/Product.aspx?Item=N82E16820226237). It had a $50 coupon code which brought the total to about $220 for a 240GB SandForce 2281 drive. It should be a small upgrade over my current Mushkin Callisto Deluxe 240GB.
I am considering upgrading to 48GB (12x4GB) 1600MHz memory (which costs between $180-220).

What is the current recommendation for a booting SATA 3 solution?
I am considering http://www.apricorn.com/products/desktop-ssd-hdd-upgrade-kits/velocity-solo-mac.html.

With the new drive I will try to upgrade to 10.7.3 and try to get better Geekbench scores.
 
Register on MacRumors! This sidebar will go away, and you'll see fewer ads.