lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Soeren Pekrul <>
Subject Re: What is the best way to split substring words
Date Sun, 20 May 2007 07:02:04 GMT
bhecht wrote:
> I want to be able to split tokens by giving a list of substring words.
> So I can give a list f subwords like: "strasse", "gasse",
> And the token "mainstrasse" or "maingasse"  will be split to 2 tokens "main"
> and "strasse".

IMBEMBA, PASQUALINO: A Splitter for German Compound Words. Free 
University of Bolzano, Bozen, 2006

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message