lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Yonik Seeley <yo...@lucidimagination.com>
Subject Re: Proposal: Full support for multi-word synonyms at query time
Date Fri, 10 Aug 2012 18:15:20 GMT
On Fri, Aug 10, 2012 at 2:10 PM, Jack Krupansky <jack@basetechnology.com> wrote:
> The Javadoc says "positionLength determines how many positions this token
> spans". It's not obvious from the documentation how the full graph structure
> for nested multi-word synonyms can be expressed merely using that attribute.
> Is this detailed anywhere? (Maybe in Jira... but it is still down.) I mean,
> a multi-word synonym is multiple tokens. How does any of the "tokens" span
> more than one position?

You sort of do it in reverse I think... make the small token take up a
bigger amount of space.

so for
 (US | united states) gold medals

"US" would have a length of 2 so it would skip ahead to "gold", while
"united" and "states" would both have normal values of 1.

-Yonik
http://lucidworks.com

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message