My opinion:
The one on the right scores a little higher.
But, there's not enough difference between the two runs to be significant enough to make a purchase decision, as that kind of testing needs several runs (launch, quit, launch again, repeat) to show anything meaningful.
Why do you ask?