hadoop-common-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hadoop QA (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HADOOP-8597) FsShell's Text command should be able to read avro data files
Date Mon, 10 Sep 2012 22:08:07 GMT

    [ https://issues.apache.org/jira/browse/HADOOP-8597?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13452506#comment-13452506

Hadoop QA commented on HADOOP-8597:

+1 overall.  Here are the results of testing the latest attachment 
  against trunk revision .

    +1 @author.  The patch does not contain any @author tags.

    +1 tests included.  The patch appears to include 1 new or modified test files.

    +1 javac.  The applied patch does not increase the total number of javac compiler warnings.

    +1 javadoc.  The javadoc tool did not generate any warning messages.

    +1 eclipse:eclipse.  The patch built with eclipse:eclipse.

    +1 findbugs.  The patch does not introduce any new Findbugs (version 1.3.9) warnings.

    +1 release audit.  The applied patch does not increase the total number of release audit

    +1 core tests.  The patch passed unit tests in hadoop-common-project/hadoop-common.

    +1 contrib tests.  The patch passed contrib unit tests.

Test results: https://builds.apache.org/job/PreCommit-HADOOP-Build/1429//testReport/
Console output: https://builds.apache.org/job/PreCommit-HADOOP-Build/1429//console

This message is automatically generated.
> FsShell's Text command should be able to read avro data files
> -------------------------------------------------------------
>                 Key: HADOOP-8597
>                 URL: https://issues.apache.org/jira/browse/HADOOP-8597
>             Project: Hadoop Common
>          Issue Type: New Feature
>          Components: fs
>    Affects Versions: 2.0.0-alpha
>            Reporter: Harsh J
>            Assignee: Ivan Vladimirov Ivanov
>              Labels: newbie
>         Attachments: HADOOP-8597-2.patch, HADOOP-8597.patch, HADOOP-8597.patch, HADOOP-8597.patch
> Similar to SequenceFiles are Apache Avro's DataFiles. Since these are getting popular
as a data format, perhaps it would be useful if {{fs -text}} were to add some support for
reading it, like it reads SequenceFiles. Should be easy since Avro is already a dependency
and provides the required classes.
> Of discussion is the output we ought to emit. Avro DataFiles aren't simple as text, nor
have they the singular Key-Value pair structure of SequenceFiles. They usually contain a set
of fields defined as a record, and the usual text emit, as available from avro-tools via http://avro.apache.org/docs/current/api/java/org/apache/avro/tool/DataFileReadTool.html,
is in proper JSON format.
> I think we should use the JSON format as the output, rather than a delimited form, for
there are many complex structures in Avro and JSON is the easiest and least-work-to-do way
to display it (Avro supports json dumping by itself).

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

View raw message