lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jack Krupansky" <j...@basetechnology.com>
Subject Re: Proposal: Full support for multi-word synonyms at query time
Date Fri, 10 Aug 2012 18:10:59 GMT
The Javadoc says "positionLength determines how many positions this token 
spans". It's not obvious from the documentation how the full graph structure 
for nested multi-word synonyms can be expressed merely using that attribute. 
Is this detailed anywhere? (Maybe in Jira... but it is still down.) I mean, 
a multi-word synonym is multiple tokens. How does any of the "tokens" span 
more than one position?

-- Jack Krupansky

-----Original Message----- 
From: Robert Muir
Sent: Friday, August 10, 2012 1:44 PM
To: dev@lucene.apache.org
Subject: Re: Proposal: Full support for multi-word synonyms at query time

On Fri, Aug 10, 2012 at 1:36 PM, Jack Krupansky <jack@basetechnology.com> 
wrote:
> One of the ongoing potholes of Solr and Lucene is lack of full support for
> multi-word synonyms at query time. The root of the problem is twofold:
> individual terms are presented for analysis which precludes recognition of
> multi-term synonyms, and the output stream from the analyis process is a
> single, linear stream without regard to any graph/lattice structure for
> multiple synonyms.

But this is not true. PositionLengthAttribute was already added, which
makes it a graph.

-- 
lucidimagination.com

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org 


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message