crunch-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Dave Beech (JIRA)" <j...@apache.org>
Subject [jira] [Created] (CRUNCH-228) FileTargetImpl cuts off extensions of output files
Date Fri, 28 Jun 2013 14:14:20 GMT
Dave Beech created CRUNCH-228:
---------------------------------

             Summary: FileTargetImpl cuts off extensions of output files
                 Key: CRUNCH-228
                 URL: https://issues.apache.org/jira/browse/CRUNCH-228
             Project: Crunch
          Issue Type: Bug
            Reporter: Dave Beech


Compressed files written by mapreduce often have extensions, e.g. '.deflate', '.gz' or '.snappy'.
Crunch currently cuts off these extensions during the move of output files to their final
destination, which is fine in some circumstances but causes problems in others. 

For example, running 'hadoop fs -text myfile.deflate' will show the decompressed text on screen
but running 'hadoop fs -text myfile' on a deflate-compressed file with no extension prints
unreadable compressed data instead. 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message