lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ramesh Salla <ramesh.sa...@iptouch.com>
Subject RE: Searching for a phrase which spans on 2 pages
Date Thu, 13 Jul 2006 11:17:16 GMT
Yes, this can be easily done using TokenStream class and hence getting
the the BestTokens.....
But ofcourse you have to have this content in the index.

DONE>>>>

Ramesh Reddy



On Wed, 2006-07-12 at 12:43 +0100, Mike Streeton wrote:

> The simplest solution is always the best - when storing the page, do not
> break up sentences. So a page will be all the sentences that occur on
> it. If a sentence starts on one page and finishes on the next it will be
> included in both pages in the index.
> 
> Hope this helps
> 
> Mike
> 
> www.ardentia.com the home of NetSearch
> -----Original Message-----
> From: Mile Rosu [mailto:mile.rosu@level7.ro] 
> Sent: 11 July 2006 15:55
> To: java-user@lucene.apache.org
> Subject: Searching for a phrase which spans on 2 pages
> 
> Hello,
> 
> I am working on an application similar to google books which allows 
> searching on documents which represent a scanned page. Of course, one 
> might search for a phrase starting at the end of one page and ending at 
> the beginning of the next one. In this case I do not know how I might 
> treat this. Both pages should be returned as hit results.
> Do you have any idea on how this situation might be handled?
> 
> Thank you,
> Mile Rosu
> 
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org
> 
> 
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org
> 
> 

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message