lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From TimF <...@timflanders.com>
Subject Re: Custom Analyzer Help please
Date Wed, 28 Mar 2007 15:18:57 GMT

Grant,
Thanks for your reply and the pointer to the custom code sample. I will be
checking into that today. I did delve into the src for the OOTB analyzers
and was aware of what they did. Still, the StandardAnalyzer does not do what
I want. The real difference between my needs and the results of the
StandardAnalyzer is that what I want is the union of the StandardAnalyzer
and the StopAnalyzer. If you refer back to my original example...

An example of the data might be as follows:
   Hello XY&Z Corporation - abc@example.com 
I would like the following terms to come out of the analyzer:
 [hello]  [xy&z]  [corporation] [abc@example] [com]  //this is the
StandardAnalyzer output
as well as
  [xy] [z]  [abc] [example] 

I figured that creating a custom analyzer is the only way to do that, but
unfortunately I am not that familiar with how the analyzers "really" work
internally( I am more of a mathematician than a lexicon). 

If you have any other thoughts or ideas I would love to hear.
Thanks,
Tim

Grant Ingersoll-6 wrote:
> 
> So, I think the answer is that StandardAnalyzer already has what you  
> state you want.  Is it, perhaps, that certain stopwords that you are  
> interested in are not currently being stopped?
> 

-- 
View this message in context: http://www.nabble.com/Custom-Analyzer-Help-please-tf3469904.html#a9716016
Sent from the Lucene - Java Users mailing list archive at Nabble.com.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message