incubator-ooo-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Rob Weir <robw...@apache.org>
Subject Re: help requested - Re: update of license headers for data files in i18npool
Date Fri, 02 Dec 2011 13:43:52 GMT
On Fri, Dec 2, 2011 at 3:50 AM, Oliver-Rainer Wittmann
<orwittmann@googlemail.com> wrote:
> Hi,
>
> Thanks for the hint.
>
> Yesterday, late in the evening I have also found IBM's ftp server with the
> former ICU releases. But, I had not got the time to search for the original
> source files.
> Now, I found them - also consulting markmail to get hints for the ICU
> version. There are part of the ICU version 2.2, released 2002-08-15 found at
> [3].
> This ICU release is completely under ICU license.
>

Excellent.  A good idea would be to document this with a readme file
in the same directory, something that explains what the files are,
where they came from, give the URL, etc.  That will help anyone in the
future who has the same question.

> Best regards, Oliver.
>
> [3] ftp://ftp.software.ibm.com/software/globalization/icu/2.2/
>
>
> On 02.12.2011 01:42, Rob Weir wrote:
>>
>> On Thu, Dec 1, 2011 at 12:03 PM, Oliver-Rainer Wittmann
>> <orwittmann@googlemail.com>  wrote:
>>>
>>> Hi,
>>>
>>> I need some help here.
>>>
>>> It is about the following data files in folder
>>> i18npool/source/breakiterator/data/
>>> -- char_in.txt
>>> -- count_word*.txt
>>> -- dict_word*.txt
>>> -- edit_word*.txt
>>> -- line.txt
>>> -- sent.txt
>>>
>>> (A) I did not find the original sources of these data files on [2].
>>> Does somebody know the original source for these data files?
>>>
>>
>> Maybe try searching the old list archives:
>>
>> http://openoffice.markmail.org/
>>
>> When I typed in some file names, like dict_word.txt I see activity
>> going back to 2002 in the ancient CVS.  At that point it looks like it
>> was in the ICU component, or at least its placement in the tree
>> suggests that.  ICU came from IBM, as you know.
>>
>> Perhaps it would line up more with an earlier ICU version, like in the
>> 2.x series:
>>
>> ftp://ftp.software.ibm.com/software/globalization/icu/
>>
>>> (B) The data files count_word*.txt, dict_word*.txt and edit_word*.txt do
>>> not
>>> differ much. I assume that they are adapted from the original source for
>>> certain usages and languages.
>>> Can someone confirm this?
>>>
>>> (C) I have found files at [3] which correspond to these data files. The
>>> found files are named char.txt, line.txt, sent.txt and word.txt. Thus, it
>>> looks like that the original source of these data files is ICU. This
>>> would
>>> mean that the license for these files seems to be the ICU license.
>>> Can someone confirm this?
>>>
>>> Note: Eike Rathke stated in an posting made in June 2011 that these data
>>> files are taken from ICU and had been adpated for OOo.
>>>
>>> Thus again, can somebody help here?
>>>
>>> Best regards, Oliver.
>>>
>>>
>>> [3]
>>>
>>> http://www.opensource.apple.com/source/ICU/ICU-400.39/icuSources/data/brkitr/
>>> and
>>>
>>> http://www.opensource.apple.com/source/ICU/ICU-400.42/icuSources/data/brkitr/
>>>
>>> On 01.12.2011 14:48, Oliver-Rainer Wittmann wrote:
>>>>
>>>>
>>>> Hi,
>>>>
>>>> looking at our IP clearance wiki page showed that there is an entry for
>>>> which I
>>>> was volunteering, but which get out of my focus. Now, it gets back to my
>>>> attention.
>>>>
>>>> It is the issue regarding the license headers for the data files in
>>>> module
>>>> i18npool - see [1].
>>>>
>>>> Status update:
>>>> - Most data files are covered by Oracle's SGA
>>>> - The data files in folder i18npool/source/breakiterator/data/ which
>>>> have
>>>> an IBM
>>>> copyright does not have a proper license header.
>>>>
>>>> I will look at ICU [2] for an appropriate replacement.
>>>>
>>>> [1] https://cwiki.apache.org/confluence/display/OOOUSERS/IP_Clearance
>>>> [2] http://site.icu-project.org/

Mime
View raw message