lucene-solr-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Chantal Ackermann (JIRA)" <>
Subject [jira] Created: (SOLR-1498) RegexTransformer: sourceColName version not handling multiValued fields correctly
Date Thu, 08 Oct 2009 14:44:32 GMT
RegexTransformer: sourceColName version not handling multiValued fields correctly

                 Key: SOLR-1498
             Project: Solr
          Issue Type: Bug
          Components: contrib - DataImportHandler
    Affects Versions: 1.4
         Environment: Windows XP, JDK 6, Tomcat 6
Linux (RedHat), JDK, Tomcat 5
            Reporter: Chantal Ackermann

Versions in use/compared:
Solr 1.3
(Nightly 5th August)
Nightly 22nd September

As RegexTransformer is not different between the two nightlies, the
issue probably appeared before.

Using RegexTransformer with the 'sourceColName' notation will not populate
multiValued (actually containing multiple values) fields with a list but
instead add only one value per document.

The version with 'groupNames' does.

worked for 1.3 (regression):
<field column="participant" sourceColName="person" regex="([^\|]+)\|.*" />
<field column="role" sourceColName="person"
regex="[^\|]+\|\d+,\d+,\d+,(.*)" />

works for nightly 22nd Sept:
<field column="person" groupNames="participant,role"
regex="([^\|]+)\|\d+,\d+,\d+,(.*)" />

(Both fields are of type solr.StrField and multiValued.)

Comparing the source code of RegexTransformer 1.3 vs. 22nd Sept, I found:

for (Object result : results)
         row.put(col, result);

(lines 106-107 of transformRow() 22nd of Sept)

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message