lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Erick Erickson" <erickerick...@gmail.com>
Subject Re: What is the best way to split substring words
Date Sat, 19 May 2007 21:50:55 GMT
You probably should write a custom analyzer and/or filter that breaks
your streams up into the custom tokens you want. Depending upon
what you're really trying to accomplish, you may well need to use the
same analyzer at BOTH index and search times.

Best
Erick


On 5/19/07, bhecht <bhecht@ams-sys.com> wrote:
>
>
> Hi there,
>
> I want to be able to split tokens by giving a list of substring words.
> So I can give a list f subwords like: "strasse", "gasse",
> And the token "mainstrasse" or "maingasse"  will be split to 2 tokens
> "main"
> and "strasse".
>
> Thanks
>
> --
> View this message in context:
> http://www.nabble.com/What-is-the-best-way-to-split-substring-words-tf3782977.html#a10698288
> Sent from the Lucene - Java Users mailing list archive at Nabble.com.
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org
>
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message