Cache _log_ summaries

From: Martin Hamilton (martin@net.lut.ac.uk)
Date: Thu May 18 2000 - 11:44:46 MDT


-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

I hope this isn't an abuse of this list, but since it's so quiet (and
has so many of the right people on :-) I thought nobody would mind...

It's been a bugbear of mine for a long time that although we have
(more or less) a standard log file format for proxy caches (well, OK,
plain vanilla Common Logfile and Squid), there isn't a commonly
accepted format for summaries of those log files.

In case it's not clear, I'm thinking in terms of stats _such as_ (but
not necessarily including or limited to!) a periodic breakdown of
sites visited (bytes and requests shipped vs. HTTP status codes) and
clients visiting (ditto), plus performance measures for the cache
itself (e.g. median and standard deviation for hit service times per N
time units). It's worth bearing in mind that this is also the sort of
thing most logfile analysis tools (e.g. Calamaris) have to collate
internally before they can do their thing.

With one of my hats on I have to write code to process ~80 million
proxy cache logfile entries/day (aren't Service Level Agreements a
wonderful innovation? :-), and would dearly like to make some or all
of the resulting summaries available for people doing caching
research. Of course, if lots of people were to do this independently
using incompatible file formats it would be a real pain.

So... anyone interested in getting together (metaphorically :-) to
work something out ? Mail me, or post to the list if you have points
which you think other people would like to hear... !

Cheers,

Martin

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.0.1 (GNU/Linux)
Comment: Processed by Mailcrypt 3.5.4 and Gnu Privacy Guard <http://www.gnupg.org/>

iD8DBQE5JCwAVw+hz3xBJfQRAuRBAJ40aXIOZqbWhcSYrmIlaQLcPiWIIACgl9Su
qJaA0jRJ4pwjXOhvNE93nPM=
=29oB
-----END PGP SIGNATURE-----



This archive was generated by hypermail 2b29 : Thu Nov 18 2004 - 11:21:28 MST