lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Andrzej Bialecki ...@getopt.org>
Subject Re: LetterTokenizer to allow digits
Date Fri, 05 Nov 2004 17:48:13 GMT
Peter Pimley wrote:

> 
> Hi everybody,
> 
> I have just found myself in the situation of having to subclass 
> CharTokenizer with a class that tests against 
> Character.isLetterOrDigit.  I would use a LetterTokenizer, but it's 
> important for me to allow numbers through, as the documents I'm indexing 
> often have dates such as '2000' or '1945'.
> 
> Obviously it's only a few lines to do this, but I'm sure I'm not the 
> first person to have had to do it.  May I make the feature request that 
> LetterTokenizer should have an 'AllowDigits' property?

I found myself in exactly the same situation, so I support your request.

-- 
Best regards,
Andrzej Bialecki

-------------------------------------------------
Software Architect, System Integration Specialist
CEN/ISSS EC Workshop, ECIMF project chair
EU FP6 E-Commerce Expert/Evaluator
-------------------------------------------------
FreeBSD developer (http://www.freebsd.org)


---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-user-help@jakarta.apache.org


Mime
View raw message