lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ian Lea <ian....@gmail.com>
Subject Re: Index Field feeded from Reader that also stores cleartext
Date Fri, 03 Sep 2010 13:01:51 GMT
If you can't use one of the Reader based Field methods then no.
You'll have to convert the data to a string.  If you do it a doc at a
time and you still don't have enough memory then I don't know what you
can do.


--
Ian.


On Fri, Sep 3, 2010 at 10:23 AM, Gregor Dorfbauer
<gregor.dorfbauer@lagentz.at> wrote:
> Hi!
>
> I'm working on an indexer that should process documents on hard-disk which
> are of arbitrary size and type. I use Apache Tika for plain text extraction
> which offers the feature to stream the parsers output through a reader.
>
> My problem is following:
> Is there a possibility to generate a document field that gets its data from
> an Reader-instance and where the plain text is also stored into the index
> (like the Store.YES field denotes)?
> If I can't stream the data, memory usage is exceeding the limits of my
> machine.
>
>
> Thanks for your help,
> Gregor
>

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message