As a simple example, a higher performance system might handle a number of transactions in a fixed time test. If during the baseline test, you generate a total of 1,000 transactions, but during a subsequent test, after code enhancements, you generate a total of 2,000 transactions, only the first 50 percent of the values were reproduced during the second test. Although the same random number seed was used for both runs, the workloads the two systems were tested against were different. Depending on the random numbers generated in the second half of the test, you may over or underestimate the impact of the performance improvement over the baseline performance.