streams-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Steve Blackmon (JIRA)" <j...@apache.org>
Subject [jira] [Created] (STREAMS-293) allow for missing metadata fields in streams-persist-hdfs
Date Tue, 24 Feb 2015 21:42:07 GMT
Steve Blackmon created STREAMS-293:
--------------------------------------

             Summary: allow for missing metadata fields in streams-persist-hdfs
                 Key: STREAMS-293
                 URL: https://issues.apache.org/jira/browse/STREAMS-293
             Project: Streams
          Issue Type: Improvement
            Reporter: Steve Blackmon


Currently streams-persist-hdfs writer creates (and reader expects) exactly four columns. 
this could be made much more flexible without too much effort.  

Update reader and writer to support additional use cases:
a) files with field delimiter other than \t
b) files with line delimiter other than \n
c) file paths containing one json document per line
d) file paths containing just id and json on each line, 
e) file paths containing id timestamp and json document on each line






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message