lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Grant Ingersoll <gsing...@apache.org>
Subject Re: prefix matching
Date Thu, 23 Apr 2009 20:37:39 GMT
Hmm, did some poking around and this conversation rung a bell from the  
Lucene list see http://www.lucidimagination.com/search/document/3e4ce083206664d2/ngrams_and_positions#3e4ce083206664d2

Looks like Lucene would need to solve LUCENE-1224 and LUCENE-1225.

https://issues.apache.org/jira/browse/LUCENE-1224
https://issues.apache.org/jira/browse/LUCENE-1225

-Grant


On Apr 23, 2009, at 10:52 AM, Tom Morton wrote:

> Hi all,
>  I'm trying to use prefixes to match similar strings to a query  
> string.  I
> have the following field type:
>
>  <fieldtype name="prefix" stored="true" indexed="true"
> class="solr.TextField">
>      <analyzer>
>        <tokenizer class="solr.StandardTokenizerFactory"/>
>        <filter class="solr.LowerCaseFilterFactory"/>
>        <filter class="solr.StopFilterFactory"/>
>        <filter class="solr.EdgeNGramFilterFactory" minGramSize="2"
> maxGramSize="10"/>
>      </analyzer>
>  </fieldtype>
>
> field:
>   <field name="wordPrefix" type="prefix" indexed="true"  
> stored="true"/>
>
> copyField:
> <copyField source="word" dest="wordPrefix"/>
>
> If I apply this to an indexed string: "ipod shuffle" and query string:
> "shufle" (missing f) I get matching terms for "sh", "shu" "shuf"
> Index Analyzer  ipodshuffle  ipodshuffle  ipodshuffle   
> ipipoipodshshushuf
> shuffshufflshuffle Query Analyzer  shufle  shufle  shufle  
> shshushufshufl
> shufle
> However when I query for with "shufle" i get no results:
>
> http://localhost:8983/solr/select?q=wordPrefix%3Ashufle&fl=wordPrefix&qt=standard&debugQuery=on
>
> <lst name="debug">
> <str name="rawquerystring">wordPrefix:shufle</str>
> <str name="querystring">wordPrefix:shufle</str>
> -
> <str name="parsedquery">
> PhraseQuery(wordPrefix:"sh hu uf fl le shu huf ufl fle shuf hufl  
> ufle shufl
> hufle shufle")
> </str>
> -
> <str name="parsedquery_toString">
> wordPrefix:"sh hu uf fl le shu huf ufl fle shuf hufl ufle shufl hufle
> shufle"
> </str>
>
> This post suggests that I need to set the Position Increment for the  
> my
> token filter, but I'm not sure how to do that or if it's possible.
>
> http://www.lucidimagination.com/search/document/bc643c39f0b6e423/queryparser_and_ngrams#629b39ea39aa9cd4
>
> Thoughts?  Thanks...Tom

--------------------------
Grant Ingersoll
http://www.lucidimagination.com/

Search the Lucene ecosystem (Lucene/Solr/Nutch/Mahout/Tika/Droids)  
using Solr/Lucene:
http://www.lucidimagination.com/search


Mime
View raw message