Return-Path: X-Original-To: apmail-incubator-ooo-dev-archive@minotaur.apache.org Delivered-To: apmail-incubator-ooo-dev-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 5E4EF9FD2 for ; Mon, 26 Mar 2012 18:46:54 +0000 (UTC) Received: (qmail 793 invoked by uid 500); 26 Mar 2012 18:46:54 -0000 Delivered-To: apmail-incubator-ooo-dev-archive@incubator.apache.org Received: (qmail 723 invoked by uid 500); 26 Mar 2012 18:46:54 -0000 Mailing-List: contact ooo-dev-help@incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: ooo-dev@incubator.apache.org Delivered-To: mailing list ooo-dev@incubator.apache.org Received: (qmail 707 invoked by uid 99); 26 Mar 2012 18:46:54 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 26 Mar 2012 18:46:54 +0000 X-ASF-Spam-Status: No, hits=0.7 required=5.0 tests=RCVD_IN_DNSWL_NONE,SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (nike.apache.org: local policy) Received: from [205.178.146.62] (HELO omr12.networksolutionsemail.com) (205.178.146.62) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 26 Mar 2012 18:46:43 +0000 Received: from cm-omr12 (mail.networksolutionsemail.com [205.178.146.50]) by omr12.networksolutionsemail.com (8.13.8/8.13.8) with ESMTP id q2QIkLEM032703 for ; Mon, 26 Mar 2012 14:46:21 -0400 Authentication-Results: cm-omr12 smtp.user=drew@baseanswers.com; auth=pass (LOGIN) X-Authenticated-UID: drew@baseanswers.com Received: from [174.140.79.82] ([174.140.79.82:42974] helo=[192.168.1.143]) by cm-omr12 (envelope-from ) (ecelerity 2.2.2.41 r(31179/31189)) with ESMTPA id 23/61-15514-D79B07F4; Mon, 26 Mar 2012 14:46:21 -0400 Subject: Re: Google Analytics on download.openoffice.org From: drew To: ooo-dev@incubator.apache.org In-Reply-To: References: <4F6C9EC7.1020905@gmail.com> <87B80DCE-449A-4716-B777-FDBAF93350E6@gmail.com> Content-Type: text/plain; charset="UTF-8" Date: Mon, 26 Mar 2012 14:46:20 -0400 Message-ID: <1332787580.20875.20.camel@sybil-gnome> Mime-Version: 1.0 X-Mailer: Evolution 2.32.2 Content-Transfer-Encoding: 8bit On Mon, 2012-03-26 at 10:40 -0400, Rob Weir wrote: > On Mon, Mar 26, 2012 at 10:28 AM, Louis Suárez-Potts wrote: > > > Hi, > > > > > > On 2012-03-23, at 12:03 , Kay Schenk wrote: > > > > > > > > > > > On 03/21/2012 07:23 PM, Rob Weir wrote: > > >> I'd like to enable Google Analytics on our download page. > > >> > > >> This would allow us to collect some important data, such as the > > >> geographical distribution of download requests. This information has > > >> been sought for 3.4 mirror distribution planning. It can also provide > > >> continuity of our download statistics which we would otherwise lose > > >> when moving off of MirrorBrain. > > >> > > >> Of course, if some else is willing to implement an alternative way of > > >> collecting this info, then I'd love it hear it. But I think GA is the > > >> most direct method. > > >> > > >> Lazy consensus, 72 hours, etc. > > >> > > >> -Rob > > > This sounds fine with me. Yes, we should state our privacy policy on > > use, and at some point, if you do produce a public report, maybe nix IP > > addresses if that's a concern. > > > > I think nixing IP addresses is a necessity, if one were to publish this > > data, as is informing the downloader of the privacy issues. > > > > > > This part is really easy, since Google Analytics does not provide us with > IP addresses. It is giving us aggregate information, not per-user > information. > > FWIW, we used to use several means to track downloads of the binaries. None > > was particularly great and none satisfied the desires of corporate > > marketing. And all made some in the corporate hierarchy uncomfortable, if > > only because a download of a binary is hardly the same as a contribution to > > source. > > > > We used Google Analytics but also, as was then called, Omniture. Selected > > data were published in graphical form to the services wiki. > > > > In addition, more or less from the start, I published spreadsheets of > > downloads, and particularized it according to language but not region. (I > > also listed OS of version downloaded.) There were many problems to these > > spreadsheets, as I noted at http://stats.openoffice.org/, not least of > > which was spurious duplication and misleading numeration. > > > > What I always desired was a mechanism by which a successful download could > > "call home", thus supplying rather useful information. In the end, a > > version of just this was indeed done, via update calls, extensions, etc. > > However, there was no direct insertion of such a mechanism. If we were ever > > to do that, I would argue that we do need then to inform any would-be > > downloader of the privacy issues. > > > > -louis > > > > PS Roberto asked me about the old data and if it a) was extant and b) > > reflected geolocation. Answers: It was not extant, and I didn't keep the > > raw data. (I could probably find it stuffed into some archive, but why? As > > I pointed out to Roberto, the ODF Alliance information regarding ODF uptake > > is actually a better indicator, as most ODF implementations they track were > > or are based on OOo.) > > > > Howdy Louis, Kay, Rob, et al I've certainly delayed this long enough, longer then my intent in fact. There is no real question that analytics are important, Google is likely the fastest and easiest road to acquiring them, I suppose. I'm glad that Louis added some historic view to the subject, it might be worth noting that in the case of the Omniture data gathering campaign individual site users could opt out. The real real question is access IMO, Louis also broached the subject of his employer when he uses the phrase, "the desires of corporate marketing". Historically of course, SUN or Oracle, the analytics where the purview of the corporate owner. With Apache OpenOffice there of course is no corporate owner, analytics are then a resource of the Apache Software Foundation and from this flows, I would say, to the (P)PMC. This distinction I would submit means that the full analytics are not available to any specific employer of someone volunteering their time to the communal effort within AOO. However, another way to look at that would be that the analytics are available to all PPMC members. Of course as Rob points out, with Google this requires access to a specific account, so it would make sense that individual PPMC would need to request full access. I just want to emphasize that whomever is maintaining the Google account is doing so as a steward for the entire Apache OpenOffice (P)PMC, at least this is how I see it. Otherwise, it would make sense with regards to public access to not look much beyond what basic charting was available in the past, but to at least shot for building a system to deliver that. So, it seems everyone is ok with this and I certainly don't want to deter it anylonger - it's a +1 from me now. @Rob - if I can help with generating reports, let me know. //drew