lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Daisy <>
Subject Solr - Remove specific punctuation marks
Date Mon, 24 Sep 2012 09:08:08 GMT

I am working with apache-solr-3.6.0 on windows machine. I would like to
remove all punctuation marks before indexing except the colon and the

I tried:

<fieldType name="text_ar" class="solr.TextField" positionIncrementGap="100">
        <tokenizer class="solr.WhitespaceTokenizerFactory"/>
        <filter class="solr.PatternReplaceFilterFactory"
pattern="[\p{Punct}&&[^\.^\:]]" replacement="" replace="all"/>
But it didn't work. Any Ideas?

View this message in context:
Sent from the Solr - User mailing list archive at

View raw message