lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Dawid Weiss (Commented) (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (SOLR-3185) PatternReplaceCharFilterFactory can't replace with ampersands in index
Date Thu, 01 Mar 2012 22:08:01 GMT

    [ https://issues.apache.org/jira/browse/SOLR-3185?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13220421#comment-13220421
] 

Dawid Weiss commented on SOLR-3185:
-----------------------------------

Are there any other filters in the chain? Because PatternReplaceCharFilterFactory itself doesn't
replace any html entities so it'd be weird. Also, can you quote the XML verbatim? If you have
this:

{noformat}
<charFilter class="solr.PatternReplaceCharFilterFactory" 
                    pattern="(^\w)\s[&amp;]\s(\w)" 
                    replacement="$1&amp;amp;$2" />
{noformat}
then indeed the replaced value will be:
{noformat}
$1&amp;$2
{noformat}
                
> PatternReplaceCharFilterFactory can't replace with ampersands in index
> ----------------------------------------------------------------------
>
>                 Key: SOLR-3185
>                 URL: https://issues.apache.org/jira/browse/SOLR-3185
>             Project: Solr
>          Issue Type: Bug
>          Components: Schema and Analysis
>    Affects Versions: 3.5
>            Reporter: Mike Spencer
>            Priority: Minor
>              Labels: PatternReplaceCharFilter, regex
>
> Using solr.PatternReplaceCharFilterFactory to replace 'A & B' (no quotes) with 'A&B'
(no spaces) will result in 'A&amp;amp;B' being indexed. Query analysis will give the expected
result of 'A&B'. I examined the index with both standalone Luke and the schema browser
field and the index value is incorrect in both tools.
> This is the affected charFilter:
> <charFilter class="solr.PatternReplaceCharFilterFactory"
>                     pattern="(^\w)\s[&amp;]\s(\w)"
>                     replacement="$1&amp;amp;$2" />

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message