lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jolinar13 <jolina...@gmail.com>
Subject Re: Very odd behaviour of FrenchAnalyzer with strings in capital letters
Date Mon, 28 May 2007 14:00:38 GMT

Thanks to Luke, I realized my terms were not parsed correctly, and this has
nothing to do with upper case!
It seems to happen when the word ends with "ni". For example "giovanni" is
parsed "giovann".
Something about this?
Florian


Jolinar13 wrote:
> 
> Hello Mark!
> Thank you a lot for your answer.
> You are right for the Luke part. My Luke version was too old. My bad.
> But with Luke I still observe the problem I described.
> Any idea how to sort this out?
> Maybe this has to do with the fact I use Compass?
> Thank you
> Florian
> 
>>>>> I got strange
>>>>> search results on strings in uppercase. (example : VEHICLE)
>>>>> When I search the string (in lower case), I get no result. I get
>>>>> results
>>>>> if
>>>>> I use "vehicle*" or "vehiclE", or "vehicLe" etc.
>>>>>
>>>>> What is odd is that it affects only some of the strings, not all of
>>>>> them.
> 
> 
> markrmiller wrote:
>> 
>> FrenchAnalyzer does lowercase and using it would not in anyway alter 
>> Lukes ability to read your index.
>> 
>> - Mark
>> 
>> Jolinar13 wrote:
>>> Hello Erick,
>>> Still no idea about my problem?
>>> Anybody here using the FrenchAnalyzer?
>>> Thanks,
>>> Florian
>>>
>>>
>>> Jolinar13 wrote:
>>>   
>>>> Hello,
>>>> Thank you for your quick answer.
>>>> I use Luke to examine the index, but since I switched to
>>>> FrenchAnalyzer,
>>>> it says 'Not a Lucene index'.
>>>> If I open the index files in a text viewer, the strings are in UPPER
>>>> case.
>>>> I do use the same analyzer to index and search.
>>>> So, do I have to specify the FrenchAnalyzer not to be case sensitive?
>>>> How
>>>> to do that?
>>>> Thanks a lot
>>>> Florian
>>>>
>>>>
>>>> Erick Erickson wrote:
>>>>     
>>>>> First have you gotten a copy of Luke to examine your index to see
>>>>> what's actually indexed?
>>>>>
>>>>> The default behavior is usually to lowercase everything, but I'm not
>>>>> entirely sure if the French analyzer does this. But I suspect so.
>>>>>
>>>>> Searches are case sensitive. To get caseless searching, you need
>>>>> to put everything in the same case. This is usually done for you with
>>>>> any of the standard analyzers, but check specifically.
>>>>>
>>>>> Are you using the same analyzer at index AND search time?
>>>>>
>>>>> Best
>>>>> Erick
>>>>>
>>>>> On 5/21/07, Jolinar13 <jolinar13@gmail.com> wrote:
>>>>>       
>>>>>> Hello,
>>>>>>
>>>>>> I tried org.apache.lucene.analysis.fr.FrenchAnalyzer and I got
>>>>>> strange
>>>>>> search results on strings in uppercase. (example : VEHICLE)
>>>>>> When I search the string (in lower case), I get no result. I get
>>>>>> results
>>>>>> if
>>>>>> I use "vehicle*" or "vehiclE", or "vehicLe" etc.
>>>>>>
>>>>>> What is odd is that it affects only some of the strings, not all
of
>>>>>> them.
>>>>>> Anyone who has ever experienced this problem?
>>>>>>
>>>>>> Thanks,
>>>>>> Florian
>>>>>> --
>>>>>> View this message in context:
>>>>>> http://www.nabble.com/Very-odd-behaviour-of-FrenchAnalyzer-with-strings-in-capital-letters-tf3789153.html#a10715673
>>>>>> Sent from the Lucene - Java Users mailing list archive at Nabble.com.
>>>>>>
>>>>>>
>>>>>> ---------------------------------------------------------------------
>>>>>> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
>>>>>> For additional commands, e-mail: java-user-help@lucene.apache.org
>>>>>>
>>>>>>
>>>>>>         
>>>>>       
>>>>     
>>>
>>>   
>> 
>> ---------------------------------------------------------------------
>> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
>> For additional commands, e-mail: java-user-help@lucene.apache.org
>> 
>> 
>> 
> 
> 

-- 
View this message in context: http://www.nabble.com/Very-odd-behaviour-of-FrenchAnalyzer-with-strings-in-capital-letters-tf3789153.html#a10836694
Sent from the Lucene - Java Users mailing list archive at Nabble.com.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message