lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Morus Walter <>
Subject Re: StopWord elimination pls. HELP
Date Mon, 18 Oct 2004 07:46:23 GMT
Miro Max writes:

> String cont = rs.getString("x");
> d.add(Field.Text("cont", cont));
> writer.addDocument(d);
> to get results from a database into lucene index. but
> when i check println(d) i can see the german stopwords
> too. how can i eliminate this?
Stopwords in an analyzer don't make the stopwords disappear from the document,
they only prevent them from beeing indexed.
So you will allways see stopwords in the document (before indexing and,
if the field is stored, when the document is retrieved from the index).

A meaningful check, if stopwords are recognized, would be to search for
a stopword. You shouldn't find anything...


To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message