incubator-ooo-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Donald Whytock <>
Subject Re: Word cloud for ooo-dev post subjects
Date Wed, 27 Jun 2012 21:36:20 GMT
On Wed, Jun 27, 2012 at 4:54 PM, Rob Weir <> wrote:
> This looks at the top 1000 terms used in ooo-dev post subjects since
> this project moved to Apache in June 2011.  The only thing I removed
> was "Re:", since that would have dominated the cloud and is machine,
> not user written
> In this particular cloud, I used all posts, including responses.  So
> if a term was used in a thread that had many responses, it would have
> additional weight in this chart.
> Technologies used:
> Python's mailbox API to extract the post titles.  Could have done this
> with any number of command line text tools as well, but it is trivial
> in Python as well:
> import mailbox
> box = mailbox.mbox(fileName)
> for message in box:
>     print message['Subject']
> Then I used to generate the graphic.
> Based on the reaction given to the previous word cloud, I know that
> some list subscribers are curious to see how often we write about
> LibreOffice.  So I'll help you find it in this graphic.  Look for the
> big "AOO", then under that see the "COMMIT".  Under COMMIT you can
> make out LIBREOFFICE, to the left of USERS.
> Regards,
> -Rob

Somehow not as stylish in this font.

"Bug" is visible in this one.  No one tweets about bugs?


View raw message