hadoop-common-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jens Rabe (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (HADOOP-11561) Join multiple files on the fly and read the records in order
Date Sat, 07 Feb 2015 14:20:34 GMT

     [ https://issues.apache.org/jira/browse/HADOOP-11561?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Jens Rabe updated HADOOP-11561:
-------------------------------
    Description: In a scenario where there are many files which all share the same key/value
types, e.g., when dealing with measured data from sensors, it should be possible to chain-load
multiple files. That means, there should be a reader which can be supplied with one or more
directories containing files, and it should be possible to read the records of all files in
order.  (was: In a scenario where there are many MapFiles which all share the same key/value
types, e.g., when dealing with measured data from sensors, it should be possible to chain-load
multiple MapFiles. That means, there should be a reader which can be supplied with one or
more directories containing MapFiles, and it should be possible to read the records of all
files in order.)
        Summary: Join multiple files on the fly and read the records in order  (was: It should
be possible to chain-load multiple MapFiles on the fly and read the records in an ascending
order)

> Join multiple files on the fly and read the records in order
> ------------------------------------------------------------
>
>                 Key: HADOOP-11561
>                 URL: https://issues.apache.org/jira/browse/HADOOP-11561
>             Project: Hadoop Common
>          Issue Type: Improvement
>            Reporter: Jens Rabe
>            Assignee: Jens Rabe
>            Priority: Minor
>              Labels: composite
>         Attachments: HADOOP-11561.patch
>
>   Original Estimate: 96h
>  Remaining Estimate: 96h
>
> In a scenario where there are many files which all share the same key/value types, e.g.,
when dealing with measured data from sensors, it should be possible to chain-load multiple
files. That means, there should be a reader which can be supplied with one or more directories
containing files, and it should be possible to read the records of all files in order.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message