lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Andrzej Bialecki ...@getopt.org>
Subject Re: Text is not indexed when passed as a StringReader
Date Sat, 17 Sep 2005 08:02:42 GMT
Daniel Naber wrote:
> On Friday 16 September 2005 21:51, Matthias Bräuer wrote:
> 
> 
>>but
>>unstored field and the passed in Reader happens to be a StringReader
>>(e.g. when extracting Word documents using the Textmining library) the
>>field is not indexed at all. That means Luke shows no terms for this
>>field and, consequently, searches do not yield any result.
> 
> 
> Luke only shows terms if the field is *stored* (which it isn't for a 
> reader). You need to click the "Reconstruct & Edit" button to see if the 
> text really isn't *indexed*.

Caveat emptor - the "Restore" function just collects existing terms from 
the index. If the input text passed through an aggresive analyzer (like 
the StandardAnalyzer), many tokens will be missing and the reconstructed 
text will be incomplete.

-- 
Best regards,
Andrzej Bialecki     <><
  ___. ___ ___ ___ _ _   __________________________________
[__ || __|__/|__||\/|  Information Retrieval, Semantic Web
___|||__||  \|  ||  |  Embedded Unix, System Integration
http://www.sigram.com  Contact: info at sigram dot com


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message