lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Mike Snare <>
Subject Re: retrieve tokens
Date Wed, 22 Dec 2004 18:19:39 GMT
> But for the other issue on 'store lucene' vs 'store db'. Does anyone can
> provide me with some field experience on size?
> The system I'm developing will provide searching through some 2000
> pdf's, say some 200 pages each. I feed the plain text into Lucene on a
> Field.UnStored bases. I also store this plain text in the database for
> the sole purpose of presenting a context snippet.

Why not store the snippet in another field that is stored, but not
indexed?  You could then immediately retrieve the snippet from the
doc.  This would only increase your index by num_docs * size_snippet
and would save the db access time and complexity.


To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message