lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Miro Max <kikirik...@yahoo.de>
Subject Re: StopWord elimination pls. HELP
Date Mon, 18 Oct 2004 10:15:40 GMT
thans for your help

 --- Morus Walter <morus.walter@tanto.de> schrieb: 
> Miro Max writes:
> 
> > String cont = rs.getString("x");
> > d.add(Field.Text("cont", cont));
> > writer.addDocument(d);
> > 
> > to get results from a database into lucene index.
> but
> > when i check println(d) i can see the german
> stopwords
> > too. how can i eliminate this?
> > 
> Stopwords in an analyzer don't make the stopwords
> disappear from the document,
> they only prevent them from beeing indexed.
> So you will allways see stopwords in the document
> (before indexing and,
> if the field is stored, when the document is
> retrieved from the index).
> 
> A meaningful check, if stopwords are recognized,
> would be to search for
> a stopword. You shouldn't find anything...
> 
> HTH
> 	Morus
> 
>
---------------------------------------------------------------------
> To unsubscribe, e-mail:
> lucene-user-unsubscribe@jakarta.apache.org
> For additional commands, e-mail:
> lucene-user-help@jakarta.apache.org
> 
>  


	

	
		
___________________________________________________________
Gesendet von Yahoo! Mail - Jetzt mit 100MB Speicher kostenlos - Hier anmelden: http://mail.yahoo.de

---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-user-help@jakarta.apache.org


Mime
View raw message