Return-Path: Delivered-To: apmail-hadoop-core-dev-archive@www.apache.org Received: (qmail 13774 invoked from network); 24 Sep 2008 19:00:41 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 24 Sep 2008 19:00:41 -0000 Received: (qmail 44215 invoked by uid 500); 24 Sep 2008 19:00:33 -0000 Delivered-To: apmail-hadoop-core-dev-archive@hadoop.apache.org Received: (qmail 44205 invoked by uid 500); 24 Sep 2008 19:00:33 -0000 Mailing-List: contact core-dev-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: core-dev@hadoop.apache.org Delivered-To: mailing list core-dev@hadoop.apache.org Received: (qmail 44194 invoked by uid 99); 24 Sep 2008 19:00:33 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 24 Sep 2008 12:00:33 -0700 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.140] (HELO brutus.apache.org) (140.211.11.140) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 24 Sep 2008 18:59:40 +0000 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id 9D35F234C224 for ; Wed, 24 Sep 2008 11:59:44 -0700 (PDT) Message-ID: <608879088.1222282784642.JavaMail.jira@brutus> Date: Wed, 24 Sep 2008 11:59:44 -0700 (PDT) From: "Owen O'Malley (JIRA)" To: core-dev@hadoop.apache.org Subject: [jira] Commented: (HADOOP-3315) New binary file format In-Reply-To: <774412089.1209158875801.JavaMail.jira@brutus> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HADOOP-3315?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12634254#action_12634254 ] Owen O'Malley commented on HADOOP-3315: --------------------------------------- I think that a reasonable path to get the committed are to make it useful to map/reduce. That would require, precisely: 1. Support for user comparators. 2. An ObjectFile class that extends (or wraps) TFile and uses the serialization libraries to provide objects. 3. {In,Out}putFormats that use ObjectFile. 4. A testcase that has a map/reduce job that reads and writes ObjectFiles. > New binary file format > ---------------------- > > Key: HADOOP-3315 > URL: https://issues.apache.org/jira/browse/HADOOP-3315 > Project: Hadoop Core > Issue Type: New Feature > Components: io > Reporter: Owen O'Malley > Assignee: Amir Youssefi > Attachments: HADOOP-3315_20080908_TFILE_PREVIEW_WITH_LZO_TESTS.patch, HADOOP-3315_20080915_TFILE.patch, TFile Specification Final.pdf > > > SequenceFile's block compression format is too complex and requires 4 codecs to compress or decompress. It would be good to have a file format that only needs -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.