Greetings,
I have a requirement to search documents page by page. For example, in a
500 page document, if someone searches for "foo", I need to return "Found
foo on page 4,6,24,100,223,401, and 455".
The way I've implemented this is to index each *page* separately, so my 500
page document is actually treated as not one but 500 documents. Then when I
get hits, I can play sort games to aggregate the results to look as
necessary.
Is this the best way to do this? Is there a way to store location
information associated with each term within a field? Note that there can
be thousands of documents containing thousands of pages.
Thanks in advance,
JMA
---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org
|