lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "James Dyer (JIRA)" <>
Subject [jira] [Created] (SOLR-2549) DIH LineEntityProcessor support for delimited & fixed-width files
Date Thu, 26 May 2011 21:53:47 GMT
DIH LineEntityProcessor support for delimited & fixed-width files

                 Key: SOLR-2549
             Project: Solr
          Issue Type: Improvement
          Components: contrib - DataImportHandler
    Affects Versions: 4.0
            Reporter: James Dyer
            Priority: Minor
         Attachments: SOLR-2549.patch

Provides support for Fixed Width and Delimited Files without needing to write a Transformer.

The following xml properties are supported with this version of LineEntityProcessor:

For fixed width files:
 - colDef[#]

For Delimited files:
 - fieldDelimiterRegex
 - firstLineHasFieldnames
 - delimitedFieldNames
 - delimitedFieldTypes

These properties are described in the api documentation.  See patch.

When combined with the cache improvements from SOLR-2382 this allows you to join a flat file
entity with other entities (sql, etc).

This message is automatically generated by JIRA.
For more information on JIRA, see:

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message