lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Doug Cutting <>
Subject Re: Apache logs and data
Date Mon, 19 Nov 2007 23:51:51 GMT
Chris Hostetter wrote:
> right ... i'm not suggesting we do this in an automatic un-human-involved 
> way; i'm suggesting that a "trusted" person generate this report, 
> ignore anything with a count less then some number (both to remove noise, 
> and eliminate most of the random "identifiable" queries), and then 
> manually remove anything that looks "personal"

I think the safest path is simply to not publish any queries, but rather 
to, e.g., permit committers to run experiments using them and publish 
the results of the experiments.  But no queries would be made available 
to the general public on a website.


To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message