Return-Path: X-Original-To: apmail-crunch-dev-archive@www.apache.org Delivered-To: apmail-crunch-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 8BAE510703 for ; Sat, 29 Jun 2013 00:01:20 +0000 (UTC) Received: (qmail 23991 invoked by uid 500); 29 Jun 2013 00:01:20 -0000 Delivered-To: apmail-crunch-dev-archive@crunch.apache.org Received: (qmail 23971 invoked by uid 500); 29 Jun 2013 00:01:20 -0000 Mailing-List: contact dev-help@crunch.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@crunch.apache.org Delivered-To: mailing list dev@crunch.apache.org Received: (qmail 23962 invoked by uid 500); 29 Jun 2013 00:01:20 -0000 Delivered-To: apmail-incubator-crunch-dev@incubator.apache.org Received: (qmail 23959 invoked by uid 99); 29 Jun 2013 00:01:20 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 29 Jun 2013 00:01:20 +0000 Date: Sat, 29 Jun 2013 00:01:20 +0000 (UTC) From: "Josh Wills (JIRA)" To: crunch-dev@incubator.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Updated] (CRUNCH-228) FileTargetImpl cuts off extensions of output files MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/CRUNCH-228?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Wills updated CRUNCH-228: ------------------------------ Attachment: CRUNCH-229.patch The patch I came up with, which does the error reporting in MSCROutputHandler. > FileTargetImpl cuts off extensions of output files > -------------------------------------------------- > > Key: CRUNCH-228 > URL: https://issues.apache.org/jira/browse/CRUNCH-228 > Project: Crunch > Issue Type: Bug > Reporter: Dave Beech > Attachments: CRUNCH-228.patch, CRUNCH-228.patch, CRUNCH-229.patch > > > Compressed files written by mapreduce often have extensions, e.g. '.deflate', '.gz' or '.snappy'. Crunch currently cuts off these extensions during the move of output files to their final destination, which is fine in some circumstances but causes problems in others. > For example, running 'hadoop fs -text myfile.deflate' will show the decompressed text on screen but running 'hadoop fs -text myfile' on a deflate-compressed file with no extension prints unreadable compressed data instead. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira