lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Max Pfingsthorn" <>
Subject Implicit Stopping in StandardTokenizer??
Date Mon, 20 Jun 2005 14:41:22 GMT

I've been trying to make an Analyzer which works like the StandardAnalyzer but without stopping.
For some reason though, I still don't get words like "is" or "a" out of it... I checked with
Luke (one doc in one index with the contents "hello,this,is,a,keyword,hello!,nicetomeetyou".
This should tokenize into "hello this is a keyword hello nicetomeetyou", but actually it does
"hello keyword hello nicetomeetyou". Does anyone know why it drops those extra terms?

Best regards,

Max Pfingsthorn


Oosteinde 11
1017WT Amsterdam
The Netherlands
Tel  +31 (0)20 5224466
------------------------------------------------------------- /

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message