Return-Path: Delivered-To: apmail-legal-discuss-archive@www.apache.org Received: (qmail 31072 invoked from network); 7 Nov 2010 02:27:42 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 7 Nov 2010 02:27:42 -0000 Received: (qmail 21523 invoked by uid 500); 7 Nov 2010 02:28:14 -0000 Delivered-To: apmail-legal-discuss-archive@apache.org Received: (qmail 21314 invoked by uid 500); 7 Nov 2010 02:28:13 -0000 Mailing-List: contact legal-discuss-help@apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: Reply-To: legal-discuss@apache.org List-Id: Delivered-To: mailing list legal-discuss@apache.org Received: (qmail 21307 invoked by uid 99); 7 Nov 2010 02:28:13 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 07 Nov 2010 02:28:13 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=10.0 tests=RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of spamassassin@dostech.ca designates 207.164.80.200 as permitted sender) Received: from [207.164.80.200] (HELO mail.csolve.net) (207.164.80.200) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 07 Nov 2010 02:28:03 +0000 Received: from cpe687f741b330b-cm001692fb3602.cpe.net.cable.rogers.com ([99.251.172.93] helo=dilbert.dostech.net) by mail.csolve.net with esmtpsa (TLSv1:AES256-SHA:256) (Exim 4.69 (FreeBSD)) (envelope-from ) id 1PEuyq-000LbM-Lq for legal-discuss@apache.org; Sat, 06 Nov 2010 22:27:41 -0400 Received: from [10.145.1.112] ([10.145.1.112]) by dilbert.dostech.net (8.13.8/8.13.8) with ESMTP id oA72RFkr014304 for ; Sat, 6 Nov 2010 22:27:34 -0400 Message-ID: <4CD60E7E.4040209@dostech.ca> Date: Sat, 06 Nov 2010 22:27:10 -0400 From: "Daryl C. W. O'Shea" Organization: DOS Technologies User-Agent: Mozilla/5.0 (Windows; U; Windows NT 6.1; en-GB; rv:1.9.2.11) Gecko/20101013 Thunderbird/3.1.5 MIME-Version: 1.0 To: legal-discuss@apache.org Subject: Re: Fair-use data in svn References: <4CD3FAF0.8070909@apache.org> <201011050923.59468.dkulp@apache.org> In-Reply-To: <201011050923.59468.dkulp@apache.org> Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit X-Authenticated-Id: dos X-Virus-Checked: Checked by ClamAV on apache.org On 05/11/2010 9:23 AM, Daniel Kulp wrote: > On Friday 05 November 2010 8:39:12 am Sim IJskes wrote: >> You cannot copy verbatim. But you can create and publish the tools. You >> can also create a internal representation, say a neural net, or >> statistics, and provide annotations, as long as it something new. >> >> So if you crawl the net, and build a statistics model of it, you can >> distribute the staticstics model data as your own. > > That's kind of what I was thinking. Doesn't Spamassassin do something > similar. They have a zone/jail someplace that collects a lot of copyrighted > spam data and runs various analysis on it and such and then commits the > results of said analysis into the repository. Yeah, I suppose we do. We collect ham (which I suppose would be copyrighted) and spam (which in many cases is ilegal itself, so I'm not sure about copyright protection for that) and then run statistical analysis on it (rule hits, rule generation, etc) with rules and scores generated and published in the repository. I think our case differs a little more, though, in that people send us the data (via email)... we don't go out and collect it. In any case, though, we're not publishing the actual ham and spam mail. Daryl --------------------------------------------------------------------- To unsubscribe, e-mail: legal-discuss-unsubscribe@apache.org For additional commands, e-mail: legal-discuss-help@apache.org