Return-Path: Delivered-To: apmail-hadoop-mapreduce-issues-archive@minotaur.apache.org Received: (qmail 52339 invoked from network); 29 Oct 2009 17:42:23 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 29 Oct 2009 17:42:23 -0000 Received: (qmail 99655 invoked by uid 500); 29 Oct 2009 17:42:23 -0000 Delivered-To: apmail-hadoop-mapreduce-issues-archive@hadoop.apache.org Received: (qmail 99595 invoked by uid 500); 29 Oct 2009 17:42:23 -0000 Mailing-List: contact mapreduce-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: mapreduce-issues@hadoop.apache.org Delivered-To: mailing list mapreduce-issues@hadoop.apache.org Received: (qmail 99562 invoked by uid 99); 29 Oct 2009 17:42:23 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 29 Oct 2009 17:42:23 +0000 X-ASF-Spam-Status: No, hits=-10.5 required=5.0 tests=AWL,BAYES_00,RCVD_IN_DNSWL_HI X-Spam-Check-By: apache.org Received: from [140.211.11.140] (HELO brutus.apache.org) (140.211.11.140) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 29 Oct 2009 17:42:19 +0000 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id D088E234C4AE for ; Thu, 29 Oct 2009 10:41:59 -0700 (PDT) Message-ID: <688146464.1256838119852.JavaMail.jira@brutus> Date: Thu, 29 Oct 2009 17:41:59 +0000 (UTC) From: "Hudson (JIRA)" To: mapreduce-issues@hadoop.apache.org Subject: [jira] Commented: (MAPREDUCE-1017) Compression and output splitting for Sqoop In-Reply-To: <1231720017.1253579475993.JavaMail.jira@brutus> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/MAPREDUCE-1017?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12771501#action_12771501 ] Hudson commented on MAPREDUCE-1017: ----------------------------------- Integrated in Hadoop-Mapreduce-trunk #127 (See [http://hudson.zones.apache.org/hudson/job/Hadoop-Mapreduce-trunk/127/]) > Compression and output splitting for Sqoop > ------------------------------------------ > > Key: MAPREDUCE-1017 > URL: https://issues.apache.org/jira/browse/MAPREDUCE-1017 > Project: Hadoop Map/Reduce > Issue Type: New Feature > Components: contrib/sqoop > Reporter: Aaron Kimball > Assignee: Aaron Kimball > Fix For: 0.22.0 > > Attachments: MAPREDUCE-1017.2.patch, MAPREDUCE-1017.3.patch, MAPREDUCE-1017.4.patch, MAPREDUCE-1017.patch > > > Sqoop "direct mode" writing will generate a single large text file in HDFS. It is important to be able to compress this data before it reaches HDFS. Due to the difficulty in splitting compressed files in HDFS for use by MapReduce jobs, data should also be split at compression time. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.