lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Walter Underwood <wun...@wunderwood.org>
Subject Re: WordDelimiter filter, expanding to multiple words, unexpected results
Date Tue, 30 Dec 2014 18:02:06 GMT
There are two approaches for the query “mixedCase” to match “mixed Case” in the original
document.

1. Add an index time synonym.
2. Add a ShingleFilterFactory to the index analysis chain.

wunder
Walter Underwood
wunder@wunderwood.org
http://observer.wunderwood.org/


On Dec 30, 2014, at 9:50 AM, Michael Sokolov <msokolov@safaribooksonline.com> wrote:

> On 12/30/14 12:42 PM, Jonathan Rochkind wrote:
>> On 12/30/14 12:35 PM, Walter Underwood wrote:
>>> You want preserveOriginal=“1”.
>>> 
>>> You should only do this processing at index time.
>> 
>> If I only do this processing at index time, then "mixedCase" at query time will no
longer match "mixed Case" in the index/source material.
>> 
>> I think I'm having trouble explaining. Let's say the source material being indexed
included "mixed Case", not "mixedCase".  I want "mixedCase" in query to still match it.
>> 
>> But if the source material that went into the index contained "mixedCase", I still
want "mixedCase" in query to match it as well.
>> 
> I think the idea is like this:
> 
> index (with preserveOriginal=1):
> 
>   mixedCase -> mixed case | mixedcase
>   mixed Case -> mixed case
> 
> query (without preserveOriginal):
>   mixedCase -> mixed case
>   mixed Case -> mixed case
> 
> so both should match
> 
> -Mike


Mime
View raw message