lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From yahootintin.11533...@bloglines.com
Subject Strange tokenization with StandardFilter
Date Mon, 21 Nov 2005 23:54:04 GMT
I'm using a StandardFilter and seeing some strange tokenization.

Here's
the input:
apache.org hosts lucene at apache.org.

Here's the tokens it
outputs:
 apache.org
 hosts
 lucene
 at 
 apacheorg

Is this a bug
that apache.org and apache.org. don't convert to the same token?

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message