lucene-solr-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Chantal Ackermann (JIRA)" <j...@apache.org>
Subject [jira] Created: (SOLR-1498) RegexTransformer: sourceColName version not handling multiValued fields correctly
Date Thu, 08 Oct 2009 14:44:32 GMT
RegexTransformer: sourceColName version not handling multiValued fields correctly
---------------------------------------------------------------------------------

                 Key: SOLR-1498
                 URL: https://issues.apache.org/jira/browse/SOLR-1498
             Project: Solr
          Issue Type: Bug
          Components: contrib - DataImportHandler
    Affects Versions: 1.4
         Environment: Windows XP, JDK 6, Tomcat 6
Linux (RedHat), JDK, Tomcat 5
            Reporter: Chantal Ackermann


Versions in use/compared:
Solr 1.3
(Nightly 5th August)
Nightly 22nd September

As RegexTransformer is not different between the two nightlies, the
issue probably appeared before.

ISSUE:
Using RegexTransformer with the 'sourceColName' notation will not populate
multiValued (actually containing multiple values) fields with a list but
instead add only one value per document.

The version with 'groupNames' does.

worked for 1.3 (regression):
<field column="participant" sourceColName="person" regex="([^\|]+)\|.*" />
<field column="role" sourceColName="person"
regex="[^\|]+\|\d+,\d+,\d+,(.*)" />


works for nightly 22nd Sept:
<field column="person" groupNames="participant,role"
regex="([^\|]+)\|\d+,\d+,\d+,(.*)" />

(Both fields are of type solr.StrField and multiValued.)


Comparing the source code of RegexTransformer 1.3 vs. 22nd Sept, I found:

for (Object result : results)
         row.put(col, result);

(lines 106-107 of transformRow() 22nd of Sept)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message