Return-Path: Delivered-To: apmail-hadoop-mapreduce-dev-archive@minotaur.apache.org Received: (qmail 76607 invoked from network); 9 Aug 2010 17:52:42 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 9 Aug 2010 17:52:42 -0000 Received: (qmail 56396 invoked by uid 500); 9 Aug 2010 17:52:42 -0000 Delivered-To: apmail-hadoop-mapreduce-dev-archive@hadoop.apache.org Received: (qmail 56073 invoked by uid 500); 9 Aug 2010 17:52:41 -0000 Mailing-List: contact mapreduce-dev-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: mapreduce-dev@hadoop.apache.org Delivered-To: mailing list mapreduce-dev@hadoop.apache.org Received: (qmail 55946 invoked by uid 99); 9 Aug 2010 17:52:41 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 09 Aug 2010 17:52:40 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.22] (HELO thor.apache.org) (140.211.11.22) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 09 Aug 2010 17:52:38 +0000 Received: from thor (localhost [127.0.0.1]) by thor.apache.org (8.13.8+Sun/8.13.8) with ESMTP id o79HqHPZ014294 for ; Mon, 9 Aug 2010 17:52:17 GMT Message-ID: <12067933.237811281376337208.JavaMail.jira@thor> Date: Mon, 9 Aug 2010 13:52:17 -0400 (EDT) From: "David Rosenstrauch (JIRA)" To: mapreduce-dev@hadoop.apache.org Subject: [jira] Created: (MAPREDUCE-2001) Enhancement to SequenceFileOutputFormat to allow user to set MetaData MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org Enhancement to SequenceFileOutputFormat to allow user to set MetaData --------------------------------------------------------------------- Key: MAPREDUCE-2001 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2001 Project: Hadoop Map/Reduce Issue Type: Improvement Affects Versions: 0.20.2 Reporter: David Rosenstrauch Priority: Minor The org.apache.hadoop.mapreduce.lib.output.SequenceFileOutputFormat class currently does not provide a way for the user to pass in a MetaData object to be written to the SequenceFile. Currently he only way for a developer to implement this functionality appears to be to create a subclass which overrides the SequenceFileOutputFormat's getRecordWriter() method, which is a bit of a kludge. This seems to be a common enough request to warrant a fix of some sort. (It's already been brought up twice in the past year: http://www.mail-archive.com/common-user@hadoop.apache.org/msg02198.html and http://www.mail-archive.com/mapreduce-user@hadoop.apache.org/msg00904.html) A couple of possible solutions: 1) provide a static method SequenceFileOutputFormat.setMetaData(Job, MetaData) 2) Provide a (non-static) setMetaData() method on the SequenceFileOutputFormat class. The user would create a subclass of SequenceFileOutputFormat which, say, implements Configurable. Then in the setConf() method, the user could create the MetaData object (using data from the Configuration), and then call setMetaData. The SequenceFileOutputFormat would then use this MetaData object when creating the SequenceFile. (Note that the user would have to create a subclass of SequenceFileOutputFormat to make this solution work.) -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.