hadoop-common-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Xiaoyong Zhu (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HADOOP-3315) New binary file format
Date Wed, 16 Sep 2015 01:12:49 GMT

    [ https://issues.apache.org/jira/browse/HADOOP-3315?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14746623#comment-14746623

Xiaoyong Zhu commented on HADOOP-3315:

Hi guys,

We want to check if there's a TFile parser in non-JVM languages? For example in Python/.NET,
etc., or anyone has some prototypes that we could reference...Our scenario is that we want
to parse the TFile in non-Hadoop machines offline - besides what I mentioned above, do you
have any suggestions? Also, which file should we look at for the implementation details....?


> New binary file format
> ----------------------
>                 Key: HADOOP-3315
>                 URL: https://issues.apache.org/jira/browse/HADOOP-3315
>             Project: Hadoop Common
>          Issue Type: New Feature
>          Components: io
>            Reporter: Owen O'Malley
>            Assignee: Hong Tang
>             Fix For: 0.20.1
>         Attachments: HADOOP-3315_20080908_TFILE_PREVIEW_WITH_LZO_TESTS.patch, HADOOP-3315_20080915_TFILE.patch,
TFile Specification 20081217.pdf, hadoop-3315-0507.patch, hadoop-3315-0509-2.patch, hadoop-3315-0509.patch,
hadoop-3315-0513.patch, hadoop-3315-0514.patch, hadoop-3315-0601.patch, hadoop-3315-0602.patch,
hadoop-3315-0605.patch, hadoop-3315-0612.patch, hadoop-3315-0623-2.patch, hadoop-3315-0701-yhadoop-20.patch,
hadoop-3315-0710-1-hadoop-20.patch, hadoop-trunk-tfile.patch, hadoop-trunk-tfile.patch
> SequenceFile's block compression format is too complex and requires 4 codecs to compress
or decompress. It would be good to have a file format that only needs 

This message was sent by Atlassian JIRA

View raw message