lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Gusenbauer Stefan <gusenba...@eduhi.at>
Subject Re: Confused about non-tokenized fields
Date Sat, 28 May 2005 02:10:52 GMT
Erik Hatcher wrote:

>
> On May 27, 2005, at 12:14 PM, Gusenbauer Stefan wrote:
>
>> Max Pfingsthorn wrote:
>>
>>
>>> Hi!
>>>
>>> Thanks for the reply. I figured already that fields are actually 
>>> not tokenized... I lost track of the filenames/dirnames and there 
>>> were some duplicates...
>>>
>>> About case-insensitivity: Okay, I can make my query lower case,  but
>>> my strings in the field are not... I guess I have to do that 
>>> manually during indexing? Or is there some nicer way?
>>>
>>>
>>>
>> I think this is not a problem. This should be done automatically when
>> you make a case insensitiv search so that you don't have to think  about
>> it. If it should become a problem write another email *g*
>
>
> If you index but do not tokenize, then case is preserved from the 
> original text.  It's the tokenization process, via the specified 
> Analyzer, that typically lowercases.
>
> So, yes, you would need to do that manually on the text you hand to a 
> Field for untokenized fields.
>
>     Erik
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org
>
>
>
thanks that was new to me i will be more carefull before i give out some
suggestions
stefan


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message