lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Erick Erickson" <erickerick...@gmail.com>
Subject Re: is there such an analyzer?
Date Wed, 16 Aug 2006 20:03:34 GMT
I suspect you'll have to roll your own. I'd use the SynonymAnalyzer from
Lucene in Action as a model, starting around page 129. I really doubt that
there's much you can expect Lucene to do for you for this specialized kind
of tokenizing.....

Erick


On 8/16/06, Van Nguyen <vnguyen@ur.com> wrote:
>
>  I'm looking for a cross between a WhitespaceAnalyzer and
> StandardAnalyzer.  If I pass in:
>
> I-Pity-da-fool who has a 1" ladder said MR.T
>
> I want it to index these:
>
> i-pity-da-fool
> pity
> fool
> 1"
> 1
> ladder
> mr.t
>
>
>
> United Rentals
> Consider it done.™
> 800-UR-RENTS
> unitedrentals.com
>
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org
>
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message