lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Andrzej Bialecki ...@getopt.org>
Subject Re: Lucene indexing on Hadoop distributed file system
Date Mon, 27 Mar 2006 17:31:21 GMT
Doug Cutting wrote:
> Igor Bolotin wrote:
>> If somebody is interested - I can post our changes in TermInfosWriter 
>> and
>> SegmentTermEnum code, although they are pretty trivial.
>
> Please submit this as a patch attached to a bug report.
>
> I contemplated making this change to Lucene myself, when writing 
> Nutch's FsDirectory, but thought that no one else would ever be 
> interested in using it.  Now that's been proven wrong!
>
> Note that any change to the file format must be back-compatible.

This could be solved by putting a marker value in the first 8 bytes (== 
-1L), which would indicate that the real length is at the end. This way 
the new implementation will be able to read old indexes.

-- 
Best regards,
Andrzej Bialecki     <><
 ___. ___ ___ ___ _ _   __________________________________
[__ || __|__/|__||\/|  Information Retrieval, Semantic Web
___|||__||  \|  ||  |  Embedded Unix, System Integration
http://www.sigram.com  Contact: info at sigram dot com



---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message