Return-Path: Delivered-To: apmail-hadoop-core-dev-archive@www.apache.org Received: (qmail 24186 invoked from network); 24 Sep 2008 17:06:07 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 24 Sep 2008 17:06:07 -0000 Received: (qmail 89830 invoked by uid 500); 24 Sep 2008 17:06:04 -0000 Delivered-To: apmail-hadoop-core-dev-archive@hadoop.apache.org Received: (qmail 89417 invoked by uid 500); 24 Sep 2008 17:06:03 -0000 Mailing-List: contact core-dev-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: core-dev@hadoop.apache.org Delivered-To: mailing list core-dev@hadoop.apache.org Received: (qmail 89401 invoked by uid 99); 24 Sep 2008 17:06:03 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 24 Sep 2008 10:06:02 -0700 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.140] (HELO brutus.apache.org) (140.211.11.140) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 24 Sep 2008 17:05:10 +0000 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id 969E7234C226 for ; Wed, 24 Sep 2008 10:05:44 -0700 (PDT) Message-ID: <1863673739.1222275944616.JavaMail.jira@brutus> Date: Wed, 24 Sep 2008 10:05:44 -0700 (PDT) From: "Doug Cutting (JIRA)" To: core-dev@hadoop.apache.org Subject: [jira] Commented: (HADOOP-3315) New binary file format In-Reply-To: <774412089.1209158875801.JavaMail.jira@brutus> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HADOOP-3315?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12634201#action_12634201 ] Doug Cutting commented on HADOOP-3315: -------------------------------------- > this jira is not supposed to be a replacement for MapFile That's unfortunate. I thought that was the point of including indexes in the file, rather than in a side file. I wonder whether tfile should start out in contrib until it is more full-featured? Without support for java comparators or random access it is not yet a replacement for SequenceFile. It also doesn't yet have any inputformats, so it cannot be used from mapreduce. Nor does it yet have bindings for other programming languages. So my preference is that, until tfile is proven to be of general utility to Hadoop applications, it should live in contrib. We don't want code in core that's not both widely usable and actually used. > New binary file format > ---------------------- > > Key: HADOOP-3315 > URL: https://issues.apache.org/jira/browse/HADOOP-3315 > Project: Hadoop Core > Issue Type: New Feature > Components: io > Reporter: Owen O'Malley > Assignee: Amir Youssefi > Attachments: HADOOP-3315_20080908_TFILE_PREVIEW_WITH_LZO_TESTS.patch, HADOOP-3315_20080915_TFILE.patch, TFile Specification Final.pdf > > > SequenceFile's block compression format is too complex and requires 4 codecs to compress or decompress. It would be good to have a file format that only needs -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.