Return-Path: X-Original-To: apmail-incubator-ooo-dev-archive@minotaur.apache.org Delivered-To: apmail-incubator-ooo-dev-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id EBEF8D773 for ; Fri, 29 Jun 2012 03:47:20 +0000 (UTC) Received: (qmail 96701 invoked by uid 500); 29 Jun 2012 03:47:20 -0000 Delivered-To: apmail-incubator-ooo-dev-archive@incubator.apache.org Received: (qmail 96473 invoked by uid 500); 29 Jun 2012 03:47:18 -0000 Mailing-List: contact ooo-dev-help@incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: ooo-dev@incubator.apache.org Delivered-To: mailing list ooo-dev@incubator.apache.org Received: (qmail 96455 invoked by uid 99); 29 Jun 2012 03:47:18 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 29 Jun 2012 03:47:18 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of kevingrignon.oo@gmail.com designates 209.85.213.41 as permitted sender) Received: from [209.85.213.41] (HELO mail-yw0-f41.google.com) (209.85.213.41) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 29 Jun 2012 03:47:13 +0000 Received: by yhr47 with SMTP id 47so3329417yhr.0 for ; Thu, 28 Jun 2012 20:46:46 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=vlqpW4Vu/FnQdInn0qCiQVO3hON5kT/ax5Ls+/sDGv8=; b=pyqixflqBVi82uaGIhSilJ39Bl8exOP7cQeVJhbPEHzJgM6/6pe3sGeI6dBrw/sjOI FKQrrFbwqyGi+NSn0+xYwuGrjkD+GZOma+K+sbIGqwngqLZc9JVzEoVuqHJm3Fc5J5w6 ivIQgqT618yHM1dfHul7jsIqVYR/qRg5DXzDFX5SgLU00blNVdDZmvTpNa7XQy/hC2q9 7KJkcCvQpU1+ObDTCh1Z8b44cKY7Pfw+pSXVmTN6j/LO7io+t4S82lqxmFiZ/P2SJV1H J9TN3QgjoYoB76gzwlA/ySaUzOriJWxIl045AZE+gFGwBZ3N8USbKc4nDl54tNkOU0ZA UKqw== MIME-Version: 1.0 Received: by 10.236.193.99 with SMTP id j63mr438404yhn.45.1340941606707; Thu, 28 Jun 2012 20:46:46 -0700 (PDT) Received: by 10.236.48.238 with HTTP; Thu, 28 Jun 2012 20:46:46 -0700 (PDT) In-Reply-To: References: Date: Fri, 29 Jun 2012 11:46:46 +0800 Message-ID: Subject: Re: Word cloud for ooo-dev post subjects From: Kevin Grignon To: ooo-dev@incubator.apache.org Content-Type: multipart/alternative; boundary=20cf30563d2d21e33904c3944cc6 X-Virus-Checked: Checked by ClamAV on apache.org --20cf30563d2d21e33904c3944cc6 Content-Type: text/plain; charset=ISO-8859-1 KG01 - See comments inline. On Thu, Jun 28, 2012 at 6:08 AM, Rob Weir wrote: > On Wed, Jun 27, 2012 at 5:36 PM, Donald Whytock > wrote: > > On Wed, Jun 27, 2012 at 4:54 PM, Rob Weir wrote: > >> http://people.apache.org/~robweir/ooo-dev-cloud.png > >> > >> This looks at the top 1000 terms used in ooo-dev post subjects since > >> this project moved to Apache in June 2011. The only thing I removed > >> was "Re:", since that would have dominated the cloud and is machine, > >> not user written > >> > KG01 - Great stuff Rob. These simple analytics are really interesting. *Would you be open to harvesting and sharing the extracted post titles for both ooo-dev and ooo-users and sending along to me*. The Python bit is unfamiliar to me. The text files, like you did for the twitter feeds would be great. Then I can go into wordle and tweak, using a variety of filters. Also, I am exploring other analytic tools to parse the data. Thanks. > >> In this particular cloud, I used all posts, including responses. So > >> if a term was used in a thread that had many responses, it would have > >> additional weight in this chart. > >> > >> Technologies used: > >> > >> Python's mailbox API to extract the post titles. Could have done this > >> with any number of command line text tools as well, but it is trivial > >> in Python as well: > >> > >> import mailbox > >> > >> box = mailbox.mbox(fileName) > >> > >> for message in box: > >> print message['Subject'] > >> > >> > >> Then I used Wordle.net to generate the graphic. > >> > >> Based on the reaction given to the previous word cloud, I know that > >> some list subscribers are curious to see how often we write about > >> LibreOffice. So I'll help you find it in this graphic. Look for the > >> big "AOO", then under that see the "COMMIT". Under COMMIT you can > >> make out LIBREOFFICE, to the left of USERS. > >> > >> Regards, > >> > >> -Rob > > > > Somehow not as stylish in this font. > > > > "Bug" is visible in this one. No one tweets about bugs? > > > > I think this is a user/developer difference. Users talk in more > direct terms, about how bugs impact them. So very few mention a > "bug". But 18 mentions on Twitter of some form of > crash/crashed/crashing. On the ooo-dev list we call these "bugs" or > "issues". Users "lose all their work". We "debug an exception". The > army "pacifies the village", etc. > > It is good to remember the difference in impact our work (good or bad) > has on others, even though we use more clinical terms on this list. > > -Rob > > > > Don > --20cf30563d2d21e33904c3944cc6--