Return-Path: X-Original-To: apmail-legal-discuss-archive@www.apache.org Delivered-To: apmail-legal-discuss-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 340EE175FF for ; Sun, 10 May 2015 23:40:05 +0000 (UTC) Received: (qmail 69123 invoked by uid 500); 10 May 2015 23:40:00 -0000 Delivered-To: apmail-legal-discuss-archive@apache.org Received: (qmail 68904 invoked by uid 500); 10 May 2015 23:39:59 -0000 Mailing-List: contact legal-discuss-help@apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: Reply-To: legal-discuss@apache.org List-Id: Delivered-To: mailing list legal-discuss@apache.org Received: (qmail 68893 invoked by uid 99); 10 May 2015 23:39:59 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 10 May 2015 23:39:59 +0000 Date: Sun, 10 May 2015 23:39:59 +0000 (UTC) From: "Henri Yandell (JIRA)" To: legal-discuss@apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (LEGAL-90) What are the licensing implications for statistical information drawn from non-ASL2-licensed data, e.g. word frequency lists from Wikipedia dumps? MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/LEGAL-90?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14537425#comment-14537425 ] Henri Yandell commented on LEGAL-90: ------------------------------------ Is this still require Steve? I think you answered it yourself, though it's a bit circular: Loose language: "Can I generate a model from a non-category-A licensed work?" "Yes, as long as the model is not a derivative of the work". Whether it's a derivative will probably depend on both the license, but more importantly on what of the original work ends up in the model. > What are the licensing implications for statistical information drawn from non-ASL2-licensed data, e.g. word frequency lists from Wikipedia dumps? > -------------------------------------------------------------------------------------------------------------------------------------------------- > > Key: LEGAL-90 > URL: https://issues.apache.org/jira/browse/LEGAL-90 > Project: Legal Discuss > Issue Type: Question > Reporter: Steve Rowe > > I have generated word frequency lists from full Wikipedia dumps in several languages. For the purposes of inclusion in ASL2-licensed products, do I need to care about the license(s) covering the original text? > My interpretation (IANAL) of the [Creative Commons Attribution-ShareAlike 3.0 Unported license|http://creativecommons.org/licenses/by-sa/3.0/legalcode], under which [Wikipedia text is licensed|http://wikimediafoundation.org/wiki/Terms_of_Use], is that the license applies only to the Covered Works, Adaptations, and Collections, and that a word frequency list qualifies as none of these: Adaptations are "recognizably derived from the original"; and Collections "the Work is included in its entirety in unmodified form along with one or more other contributions". > My interpretation of the answer to the resolved question ["Can Apache projects include Creative Commons Attribution-Share Alike works?"|http://www.apache.org/legal/resolved.html#cc-sa] is that even if the CC-SA license applies to my word frequency lists, I can still include them in an ASL2-licensed product, as long as attribution is provided. > I'm also interested in the more general question, as posed in the issue summary: do the licenses covering arbitrary data, text or otherwise, have any bearing on stastical products created over the data? -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: legal-discuss-unsubscribe@apache.org For additional commands, e-mail: legal-discuss-help@apache.org