lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jack Krupansky" <j...@basetechnology.com>
Subject Re: Why PhraseQuery translate stopwords to "?"
Date Mon, 09 Dec 2013 22:02:00 GMT
The analyzer is generating holes for the stop words - the position of the 
subsequent term is incremented an extra time for each stop word so that 
their positions are maintained.

-- Jack Krupansky

-----Original Message----- 
From: Jean-Claude Dauphin
Sent: Monday, December 09, 2013 4:15 PM
To: java-user@lucene.apache.org
Subject: Why PhraseQuery translate stopwords to "?"

Hi,

My application uses an analyzer with a StopWordFilter. PhraseQuery
translates queries with stopwords by replacing stopwords to "?" characters.
For example, "Java and Lucene" is replaced by "Java ? Lucene" and "to
contribute" is replaced by "? contribute" . Sequence of terms are indexed
without stopwords. Query Searching works when the stopword starts the
phrase but no results when the "?"  is not at the beginning.

Searching for phrases without stopwords works well.

Any explanation/FAQ/user-list-message that explains why PhraseQuery
translate stopwords to "?" would be appreciated.

Thank you in advance

Jean-Claude Dauphin

-- 
Jean-Claude Dauphin

jc.dauphin@gmail.com
jc.dauphin@afus.unesco.org

http://kenai.com/projects/j-isis/
http://www.unesco.org/isis/
http://www.unesco.org/idams/
http://www.greenstone.org 


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message