lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ahmet Arslan <iori...@yahoo.com>
Subject Re: Which Tokeniser (and/or filter)
Date Tue, 07 Feb 2012 13:11:01 GMT
> I'm still finding matches across
> newlines
> 
> index...
> 
> i am fluent
> german racing
> 
> search...
> 
> "fluent german" 
> 
> Any suggestions?  

You can use a multiValued field for this. Split your document according to new line at client
side.

<arr>i am fluent</arr>
<arr>german racing</arr>

positionIncrementGap="100" will prevent query "fluent german" to match.

Or, may be you can inject artificial tokens via 

http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.PatternReplaceCharFilterFactory

Your document becomes : i am fluent NEWLINE german racing

Mime
View raw message