lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Erik Hatcher <e...@ehatchersolutions.com>
Subject Re: Analyzer which distributes tokens to many fields
Date Tue, 16 May 2006 09:43:19 GMT

On May 16, 2006, at 3:02 AM, Mathias Keilbach wrote:
> I'm going to create a small application with Lucene, which analyze  
> diffenrent Strings. While analyzing the strings, patterns (like  
> emails or urls) shall be sort out and saved in a seperate index field.
> I'm not sure if I can handle this with a self implemented Analyzer  
> class. Afaik you can't distribute single tokens to different fields  
> while analyzing a field.
> At the worst I have to distribute the tokens in a first step and  
> then analyze them in a second step.
> Have anyone an idea how to handle this case with lucene in a better  
> way than mine?

Yeah, an Analyzer is not capable of splitting terms into separate  
fields dynamically.  As you've surmised, you'll need to _parse_ your  
text prior to analyzing it.

	Erik


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message