community-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Rich Bowen <rbo...@rcbowen.com>
Subject Re: Standards for mail archive statistics gathering?
Date Tue, 28 Apr 2015 14:11:56 GMT


On 04/27/2015 09:36 AM, Shane Curcuru wrote:
> I'm interested in working on some visualizations of mailing list
> activity over time, in particular some simple analyses, like thread
> length/participants and the like.  Given that the raw data can all be
> precomputed from mbox archives, is there any semi-standard way to
> distill and save metadata about mboxes?
>
> If we had a generic static database of past mail metadata and statistics
> (i.e. not details of contents, but perhaps overall # of lines of text or
> something), it would be interesting to see what kinds of visualizations
> that different people would come up with.
>
> Anyone have pointers to either a data format or the best parsing library
> for this?  I'm trying to think ahead, and work on the parsing, storing
> statistics, and visualizations as separate pieces so it's easier for
> different people to collaborate on something.

Roberto posted something to the list a month or so ago about the efforts 
that he's been working on for this kind of thing. You might ping him.

--Rich


-- 
Rich Bowen - rbowen@rcbowen.com - @rbowen
http://apachecon.com/ - @apachecon

Mime
View raw message