lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Erik Hatcher <e...@ehatchersolutions.com>
Subject Re: Confused about non-tokenized fields
Date Fri, 27 May 2005 20:03:22 GMT

On May 27, 2005, at 12:14 PM, Gusenbauer Stefan wrote:

> Max Pfingsthorn wrote:
>
>
>> Hi!
>>
>> Thanks for the reply. I figured already that fields are actually  
>> not tokenized... I lost track of the filenames/dirnames and there  
>> were some duplicates...
>>
>> About case-insensitivity: Okay, I can make my query lower case,  
>> but my strings in the field are not... I guess I have to do that  
>> manually during indexing? Or is there some nicer way?
>>
>>
>>
> I think this is not a problem. This should be done automatically when
> you make a case insensitiv search so that you don't have to think  
> about
> it. If it should become a problem write another email *g*

If you index but do not tokenize, then case is preserved from the  
original text.  It's the tokenization process, via the specified  
Analyzer, that typically lowercases.

So, yes, you would need to do that manually on the text you hand to a  
Field for untokenized fields.

     Erik


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message