From time to time, at the
Hotsos conferences on Oracle performance, I've heard the phrase,
"battle against any guess" (BAAG) used in presentations. It captures a good idea: eliminate guesswork from your decision making process. Although that's certainly a laudable goal, life is sometimes not so simple; particularly when it comes to performance analysis. Sometimes, you really can't seem to determine unequivocally what is going on. Inevitably, you are left with nothing but making a guess—preferably an
educated guess, not a
random guess (the type BAAG wants to eliminate). As I say in one of my
Guerrilla mantras: even
wrong expectations (or a guess) are better than
no expectations. In more scientific terms, such an educated guess is called a
hypothesis and it's a major way of making scientific progress.
Of course, it doesn't stop there. The most important part of making an educated guess is testing its validity. That's called hypothesis testing, in scientific circles. To paraphrase the well-known Russian proverb, in contradistinction to BAAG: Guess, but justify*. Because all hypothesis testing is a difficult process, it can easily get subverted into reaching the wrong conclusion. Therefore, it is extremely important not to set booby traps inadvertently along the way. One of the most common visual booby trap arises from
the inappropriate use of logarithmically-scaled axes (hereafter, log axes) when plotting data.
- Linear scale:
- Each major interval has a common difference $(d)$, e.g., $200, 400, 600, 800, 1000$ if $d=200$:
- Log scale:
- Each major interval has a common multiple or base $(b)$, e.g., $0.1, 1, 10, 100, 1000$ if $b=10$:
The general property of a log axis is to
stretch out the low end of the axis and
compress the high end. Notice the
unequal minor interval spacings. Hence, using a log scaled axis (either $x$ or $y$) is equivalent to applying a
nonlinear transformation to the data. In other words, you should be aware that introducing a log axis will
distort the visual representation of the data
†, which can lead to entirely wrong conclusions.