lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Michael J. Prichard" <michael_prich...@mac.com>
Subject Re: To Tokenize or Un_Tokenize?
Date Wed, 26 Jul 2006 20:43:10 GMT
karl wettin wrote:

>On Wed, 2006-07-26 at 16:33 -0400, Michael J. Prichard wrote:
>  
>
>>If I want to search an email address (i.e. michael@foo.com) do I need to 
>>Tokenize that field?
>>    
>>
>
>Do you want to match on the full address only, or on parts too? 
>
>If A, don't tokenize. 
>If B, tokenize. And write an analyzer that will handle it.
>
>
>---------------------------------------------------------------------
>To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
>For additional commands, e-mail: java-user-help@lucene.apache.org
>
>  
>
I am using a StandardAnalyzer which (i think) keeps the email intact.   
I think the most I need to do is a PrefixQuery.  Will that work w/ 
UN_TOKENIZED?

Now, say I do want to search on parts...I guess my analyzer would have 
to break the email apart as follows:

michael@foo.com  -->  [michael] [foo] [com] 

??

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message