lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Mike Streeton" <mike.stree...@ardentia.co.uk>
Subject RE: Searching for a phrase which spans on 2 pages
Date Wed, 12 Jul 2006 11:43:31 GMT
The simplest solution is always the best - when storing the page, do not
break up sentences. So a page will be all the sentences that occur on
it. If a sentence starts on one page and finishes on the next it will be
included in both pages in the index.

Hope this helps

Mike

www.ardentia.com the home of NetSearch
-----Original Message-----
From: Mile Rosu [mailto:mile.rosu@level7.ro] 
Sent: 11 July 2006 15:55
To: java-user@lucene.apache.org
Subject: Searching for a phrase which spans on 2 pages

Hello,

I am working on an application similar to google books which allows 
searching on documents which represent a scanned page. Of course, one 
might search for a phrase starting at the end of one page and ending at 
the beginning of the next one. In this case I do not know how I might 
treat this. Both pages should be returned as hit results.
Do you have any idea on how this situation might be handled?

Thank you,
Mile Rosu

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message