lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Paul Libbrecht <>
Subject Re: Sentence boundary storage
Date Fri, 28 Oct 2005 22:04:36 GMT
One thing that I know has bogged me is when matching a phrase where I 
would expect mathematical formula (which is "just a subphrase"). I 
would have liked the phrase-query to extend as far as it wishes but not 
passed a given token... would this be possible ?
Presumably a period token and this feature would have provided the same?


Le 28 oct. 05, à 23:46, Grant Ingersoll a écrit :

> Hi,
> Was wondering what people's experience is with storing sentence (or 
> other) boundary information in Lucene.  For instance, for phrase 
> queries, you may not want to match when two terms lie on either side 
> of a sentence boundary.  I know for phrase queries the common approach 
> is to make the position increment larger than one, which solves that 
> immediate problem, but I have other uses for such information, too.  
> Should I just store some type of boundary marker at the appropriate 
> position and check to see if I have a boundary marker when doing my 
> processing?  I know I need an Analyzer that can detect the boundaries, 
> for starters.  What other issues have people run up against?
> Thanks,
> Grant
> ---------------------------------------------------------------------
> To unsubscribe, e-mail:
> For additional commands, e-mail:

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message