lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Igal @ getRailo.org" <i...@getrailo.org>
Subject using CharFilter to inject a space
Date Sat, 03 Nov 2012 23:35:29 GMT
hi,

I want to make sure that every comma (,) and semi-colon (;) is followed 
by a space prior to tokenizing.

the idea is to then use a WhitespaceTokenizer which will keep commas but 
still split the phrase in a case like:

     "I bought red apples,green pears,and yellow oranges"

I'm thinking of extending CharFilter to "inject" a space after the 
comma.  my questions are:

     1) does it make sense or am I completely off here?

     2) are there any code examples of CharFilter implementations with 
injection of a char?

TIA

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message