www-legal-discuss mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Sim IJskes <sijs...@apache.org>
Subject Re: Fair-use data in svn
Date Fri, 05 Nov 2010 12:39:12 GMT
On 05-11-10 13:25, Benson Margulies wrote:
> Let me be clear on the regime that this discussion is heading for.

First, i hope i didn't and don't give you the impression that i'm a 
lawyer. :-)

So anything i say right now needs to be cleared by a lawyer.

You cannot copy verbatim. But you can create and publish the tools. You 
can also create a internal representation, say a neural net, or 
statistics, and provide annotations, as long as it something new.

So if you crawl the net, and build a statistics model of it, you can 
distribute the staticstics model data as your own.

A practical rule might be, that it must be impossible to recreate the 
original crawled webpages of the news publishers from the published 
dataset. So you can count words, correlate them, score them etc. But you 
cannot crawl the net, collect the sourcematerial put it in an archive 
and say, "here's the data i build the model with".


Gr. Sim

To unsubscribe, e-mail: legal-discuss-unsubscribe@apache.org
For additional commands, e-mail: legal-discuss-help@apache.org

View raw message