Return-Path: X-Original-To: apmail-incubator-ooo-dev-archive@minotaur.apache.org Delivered-To: apmail-incubator-ooo-dev-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 490A297A6 for ; Thu, 22 Mar 2012 12:47:25 +0000 (UTC) Received: (qmail 76530 invoked by uid 500); 22 Mar 2012 12:47:25 -0000 Delivered-To: apmail-incubator-ooo-dev-archive@incubator.apache.org Received: (qmail 76450 invoked by uid 500); 22 Mar 2012 12:47:24 -0000 Mailing-List: contact ooo-dev-help@incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: ooo-dev@incubator.apache.org Delivered-To: mailing list ooo-dev@incubator.apache.org Received: (qmail 76442 invoked by uid 99); 22 Mar 2012 12:47:24 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 22 Mar 2012 12:47:24 +0000 X-ASF-Spam-Status: No, hits=0.0 required=5.0 tests=SPF_PASS,WEIRD_PORT X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: local policy) Received: from [87.253.162.5] (HELO server5.configcenter.info) (87.253.162.5) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 22 Mar 2012 12:47:18 +0000 Received: from [9.155.131.22] (deibp9eh1--blueice3n2.emea.ibm.com [195.212.29.180]) (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits)) (Client did not present a certificate) (Authenticated sender: web445p1) by server5.configcenter.info (Postfix) with ESMTP id 646971BB0638 for ; Thu, 22 Mar 2012 13:46:54 +0100 (CET) Message-ID: <4F6B1F3E.8000301@a-w-f.de> Date: Thu, 22 Mar 2012 13:46:54 +0100 From: Andre Fischer User-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:10.0.2) Gecko/20120216 Thunderbird/10.0.2 MIME-Version: 1.0 To: ooo-dev@incubator.apache.org Subject: Re: Rat scan vs SGA References: <4F689FC6.3040800@a-w-f.de> In-Reply-To: Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-Greylist: Sender succeeded SMTP AUTH, not delayed by milter-greylist-4.0 (server5.configcenter.info [0.0.0.0]); Thu, 22 Mar 2012 13:46:54 +0100 (CET) X-server5-MailScanner-Information: Please contact the ISP for more information X-MailScanner-ID: 646971BB0638.A15EA X-server5-MailScanner: Found to be clean X-server5-MailScanner-From: af@a-w-f.de X-Virus-Checked: Checked by ClamAV on apache.org X-Old-Spam-Status: No On 22.03.2012 10:12, Armin Le Grand wrote: > [...] > > // No information, originally from > http://odur.let.rug.nl/~vannoord/TextCat/, adapted by Jocelyn MERAND > // delivered in libtextcat\prj\d.lst > // used in instsetoo_native, lingucomponent, scp2 (DEFAULT_CONF_FILE_NAME) > ?? libtextcat\data\new_fingerprints\ > > [...] The libtextcat version 2.2 library is under BSD license. It was released 2003 by WiseGuys Internet B.V. ([1]). No problem there. The data files in main/libtextcat/data/new_fingerprints are of unknown origin. However, there is a LICENSE file in this directory that in all but its title is a BSD license, with the exception of this additional paragraph: "- Neither the name of the WiseGuys Internet B.V. nor the names of its contributors may be used to endorse or promote products derived from this software without specific prior written permission." Therefore, I propose to assume a BSD license (category A) for the libtextcat data files. Plan B would be to delete the additional data files and rely on the one that are shipped with the library source code. Plan C is to create our own data files by feeding medium sized texts of each language to a tool that is shipped with the library. This tool "learns" language specific text patterns, the data files. Regards, Andre [1] http://software.wise-guys.nl:1080/libtextcat/index.html