Comments on The Pith of Performance: PostgreSQL Scalability Analysis Deconstructed

Matteo, Indeed, it is possible to talk about res...

2012-04-30T09:43:03.865-07:00

Matteo,

Indeed, it is possible to talk about response-time (R) scalability and from the USL standpoint, we can derive R from the throughput data (X). I explain how to do all this in my upcoming Guerrilla classes.

If we take the Postgres data (above) as the example, then what we can calculate from the USL model is the mean R: as in, statistical mean (average) or 1st moment. The median is p50 and is not a moment of the underlying statistical dsn; which we generally don't know. If I were to plot mean-R for the Postgres data, I already know that it should have the classic "hockey stick" shape. To the degree that it doesn't, we have to explain why not.

With regard to your point about identifying queueing models, most queueing models compute measures that are characterized by the statistical mean. We may also be able to calculate higher moments from the assumed dsn in certain cases.

So, on the one hand, we would prefer to have sample moments (average, variance, etc.) from the data to compare with any queueing models. Percentiles (e.g., p50, p90, p95), on the other hand, are merely a way of producing a ranked categorization of the sampled data.

Hi Dr. Gunther, I have a question that is maybe mo...

2012-04-30T05:26:17.342-07:00

Hi Dr. Gunther,
I have a question that is maybe more linked to response time analysis than scaling (if you allow such a distinction). Many performance tools and collectors return service time metrics in a synthetic way, such as: avg time, median, 90th, 95th, 99th percentiles. Is it possible from these numbers to understand which dsn they belong to (exponential, power law, normal, etc)? I'm wondering if this would help as an indication of the correct queueing model to be used (I'm re-reading 2.11 paragraph "Generalized servers" of your great "Analyzing Computer System Performance with Perl::PDQ" book).
Thanks
MatteoP

I believe a list of basic reading resources could ...

2012-04-19T03:03:22.588-07:00

I believe a list of basic reading resources could help any technical lead who meaasures performance. Apart from your blog I think these well known links helped me.But I had to search hard. These are the basics though.

http://www.itl.nist.gov/div898/handbook/index.htm

The desk reference of statistical quality methods

What is recommendation for techniques to fit data to distributions ?

Is something like "Goodness-of-fit techniques" by Ralph B. D'Agostino, Michael A. Stephens is recommended ?

Also http://cran.r-project.org/doc/contrib/Ricci-distributions-en.pdf

Thanks.
for the enlightenment

The reason I ignored the points where N>32 is b...

2012-04-13T10:51:13.178-07:00

The reason I ignored the points where N>32 is because the data were collected on a system with 32 cores. So there are multiple factors limiting scalability here. Where clients <= cores, we have one set of bottlenecks, principally due to lock contention within PostgreSQL but perhaps also partly due to operating system or hardware effects. However, once the number of clients exceeds the number of cores, we're bound to hit a wall: if all the available CPU resources are already in use, adding more clients can't really continue to improve throughput. What I want to measure is whether it's possible to add throughput by adding more hardware resources (cores), NOT whether or not throughput will flatten out when we run out of cores. The answer to the latter question seems pretty self-evident: if we're efficiently using every core, then the best we can hope for when we run out of cores is that throughput will remain stable. In reality, of course, it will drop off slightly, because task-switching is not perfectly efficient.

Laks, You are quite correct but someone pointed o...

2012-04-11T20:24:59.472-07:00

Laks,

You are quite correct but someone pointed out the same typo via email earlier today. Now corrected and thank you for pointing it out.

Should the first Normalized Capacity/Throughput eq...

2012-04-11T20:12:43.589-07:00

Should the first Normalized Capacity/Throughput equation be otherway around i.e
Normalized capacity at N i.e CN = XN /X1

Please ignore if you had spotted it already.

Cheers
Laks

More about MJ on FF: * Browser Compatibility * MJ ...

2012-04-11T12:59:09.059-07:00

More about MJ on FF:
* Browser Compatibility
* MJ 2.0 and default rendering in Firefox

Welcome to the web. :/

Looks great in Safari. :)

2012-04-11T12:34:39.465-07:00

Looks great in Safari. :)

the equation didn't display correctly on Firef...

2012-04-11T12:29:05.441-07:00

the equation didn't display correctly on Firefox 11.