lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Will Martin" <wmartin...@gmail.com>
Subject RE: A really hairy token graph case
Date Fri, 24 Oct 2014 21:44:35 GMT
HI Benson:

This is the case with n-gramming (though you have a more complicated start chooser than most
I imagine).  Does that help get your ideas unblocked?

Will

-----Original Message-----
From: Benson Margulies [mailto:bimargulies@gmail.com] 
Sent: Friday, October 24, 2014 4:43 PM
To: java-user@lucene.apache.org
Subject: A really hairy token graph case

Consider a case where we have a token which can be subdivided in several ways. This can happen
in German. We'd like to represent this with positionIncrement/positionLength, but it does
not seem possible.

Once the position has moved out from one set of 'subtokens', we see no way to move it back
for the second set of alternatives.

Is this something that was considered?

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message