Makes total sense to me - I think its a great benchmark test, very well thought out to be simple and repeatable, yet give good data.
I cant do anything about the inevitable variation between different people's tests, but I can make sure its minimised within my tests to make them as repeatable...