tag:blogger.com,1999:blog-6977755959349847093.post5937775132884856278..comments2024-03-01T21:53:15.921-08:00Comments on The Pith of Performance: Harmonic Averaging of Monitored Rate DataNeil Guntherhttp://www.blogger.com/profile/11441377418482735926noreply@blogger.comBlogger11125tag:blogger.com,1999:blog-6977755959349847093.post-70262399269112669262013-11-24T10:46:11.001-08:002013-11-24T10:46:11.001-08:00Added a Conclusions section as well as various twe...Added a Conclusions section as well as various tweaks throughout. In particular, added a section that addresses the question, "When should I apply the harmonic mean to aggregating monitored data?" Answer: When all of the following criteria are met:<br /><br /><b>R</b> - Rate data is the monitored metric<br /><b>A</b> - Async (unequal) time intervals<br /><b>T</b> - Too low data values are of interest<br /><b>E</b> - Event data, not sampled dataNeil Guntherhttps://www.blogger.com/profile/11441377418482735926noreply@blogger.comtag:blogger.com,1999:blog-6977755959349847093.post-46514699893695085112013-05-19T13:52:35.118-07:002013-05-19T13:52:35.118-07:00The harmonic mean made it as a blog topic in the H...The <b>harmonic mean</b> made it as a blog topic in the Huffington Post (which claims to offer <i>"fresh takes and real-time analysis from HuffPost's signature lineup of contributors"</i>) under the slightly tortured title <a href="http://www.huffingtonpost.com/colm-mulcahy/mean-questions-with-harmonious-answers_b_2469351.html" rel="nofollow">Mean Questions With Harmonious Answers</a>.<br /><br />As well as straining at popularism with statements like "The harmonic mean of two numbers A and B is the <b>flip</b> of the average of the <b>flips</b> of A and B" and the use of rather esoteric names like <a href="http://www.thinkbabynames.com/meaning/1/Aodh" rel="nofollow">Aodh</a> and <a href="http://www.rawstory.com/rs/2013/05/17/what-bea-arthur-naked-says-about-your-deep-seated-freudian-longings/" rel="nofollow">Bea</a>, it never gets beyond my The Ad Nauseam Example in Section 2 of the above slides.Neil Guntherhttps://www.blogger.com/profile/11441377418482735926noreply@blogger.comtag:blogger.com,1999:blog-6977755959349847093.post-52279781301312083442013-05-05T23:25:50.873-07:002013-05-05T23:25:50.873-07:00Just uploaded with new Section 7 on Accommodating ...Just uploaded with new Section 7 on Accommodating Zero Rates.Neil Guntherhttps://www.blogger.com/profile/11441377418482735926noreply@blogger.comtag:blogger.com,1999:blog-6977755959349847093.post-75672806003378435662013-04-17T08:37:17.028-07:002013-04-17T08:37:17.028-07:00Updated with new Section 6 on applying the weighte...Updated with new Section 6 on applying the weighted harmonic mean to time series.Neil Guntherhttps://www.blogger.com/profile/11441377418482735926noreply@blogger.comtag:blogger.com,1999:blog-6977755959349847093.post-89183332737768963702013-04-13T12:59:03.842-07:002013-04-13T12:59:03.842-07:00Updated with new Section 5: Application to Actual ...Updated with new Section 5: Application to Actual Time Series.Neil Guntherhttps://www.blogger.com/profile/11441377418482735926noreply@blogger.comtag:blogger.com,1999:blog-6977755959349847093.post-6312538414891309112013-04-10T16:16:53.313-07:002013-04-10T16:16:53.313-07:00Neil---
Good start on a discussion of the means. ...Neil---<br /><br />Good start on a discussion of the means. Fights break out on the subject of which mean should be used for specific tasks. Since you are tackling this controversial subject, I think that you should also discuss the geometric mean and the weighted averages of all three classical means. When working with data in graphite, I discovered that the preaggregated data was derived using a simple arithmetic average. One component comprised 90% of the workload and most of the other seventeen components comprised much less than 2%. A simple arithmetic mean reduced the 90% component to a 5.56% contribution. Components with a contribution less than 0.01% were increased to 5.556%. Needless to say, the performance picture was completely misrepresented. <br /><br />BobBobhttps://www.blogger.com/profile/17444865306417740475noreply@blogger.comtag:blogger.com,1999:blog-6977755959349847093.post-36533574210808650972013-04-10T16:15:43.399-07:002013-04-10T16:15:43.399-07:00Neil---
Good start on a discussion of the means. ...Neil---<br /><br />Good start on a discussion of the means. Fights break out on the subject of which mean should be used for specific tasks. Since you are tackling this controversial subject, I think that you should also discuss the geometric mean and the weighted averages of all three classical means. When working with data in graphite, I discovered that the preaggregated data was derived using a simple arithmetic average. One component comprised 90% of the workload and most of the other seventeen components comprised much less than 2%. A simple arithmetic mean reduced the 90% component to a 5.56% contribution. Components with a contribution less than 0.01% were increased to 5.556%. Needless to say, the performance picture was completely misrepresented. <br /><br />BobBobhttps://www.blogger.com/profile/17444865306417740475noreply@blogger.comtag:blogger.com,1999:blog-6977755959349847093.post-33801837156752244082013-04-10T08:16:00.044-07:002013-04-10T08:16:00.044-07:00I think I found 2 bugs in SlideShare:
1) If you ...I think I found 2 bugs in SlideShare: <br /><br />1) If you update the original file with more slides than the original had, you don't see the added slides. It truncates at the original number of slides.<br /><br />2) Internal hyperlinks don't work. External hyperlinks do work.<br /><br />This occurs with slides in PDF format generated by LaTeX.Neil Guntherhttps://www.blogger.com/profile/11441377418482735926noreply@blogger.comtag:blogger.com,1999:blog-6977755959349847093.post-53743220485988477822013-04-10T08:04:49.268-07:002013-04-10T08:04:49.268-07:00I'm not sure that I do understand the comment....I'm not sure that I do understand the comment. <br /><br />I thought I did say in Prop 4 that the AM is applicable to <b>sampled data</b>, meaning measurement (or reporting) intervals of fixed width. The total time window can be arbitrary. That will just produce a different AM value.<br /><br />For HM to apply, the data will not be sampled data b/c the intervals will not have fixed width. See Prop 2. This will be the case for <b>event data</b> that are only reported when some threshold is reached (e.g., 5000 page views). That then triggers an event which reports that count. The total time window can be arbitrary and will just produce a different HM value.<br /><br />Maybe you can give me a better example.Neil Guntherhttps://www.blogger.com/profile/11441377418482735926noreply@blogger.comtag:blogger.com,1999:blog-6977755959349847093.post-82835961728714915132013-04-09T19:50:46.809-07:002013-04-09T19:50:46.809-07:00For sampled count data, wouldn't AM make more ...For sampled count data, wouldn't AM make more sense, just like you point out that it's more like frequency histogram where the bars are all the same width? The AM * the total duration would be equal to the sum of individual counts. The open question is whether HM is good for sampled rate data to determine the average rate for a given larger interval, that is, aggregating up, but not the total observation period, but a multiple of the sampling period. Hope this is making sense to you.metasofthttps://www.blogger.com/profile/17149213781391733478noreply@blogger.comtag:blogger.com,1999:blog-6977755959349847093.post-80445797377853654072013-04-09T19:50:13.293-07:002013-04-09T19:50:13.293-07:00For sampled count data, wouldn't AM make more ...For sampled count data, wouldn't AM make more sense, just like you point out that it's more like frequency histogram where the bars are all the same width? The AM * the total duration would be equal to the sum of individual counts. The open question is whether HM is good for sampled rate data to determine the average rate for a given larger interval, that is, aggregating up, but not the total observation period, but a multiple of the sampling period. Hope this is making sense to you.<br /><br /><br />metasofthttps://www.blogger.com/profile/17149213781391733478noreply@blogger.com