lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Soeren Pekrul <soeren.pek...@gmx.de>
Subject Re: What is the best way to split substring words
Date Sun, 20 May 2007 07:02:04 GMT
bhecht wrote:
> I want to be able to split tokens by giving a list of substring words.
> So I can give a list f subwords like: "strasse", "gasse",
> And the token "mainstrasse" or "maingasse"  will be split to 2 tokens "main"
> and "strasse".

IMBEMBA, PASQUALINO: A Splitter for German Compound Words. Free 
University of Bolzano, Bozen, 2006

http://www.gossamer-threads.com/lists/lucene/java-user/40164?do=post_view_threaded

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message