avro-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Mike Hurley (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (AVRO-1862) AvroOutputFormat saves compressed avrò files without respecting codec's default extension
Date Wed, 13 Jul 2016 18:06:20 GMT

    [ https://issues.apache.org/jira/browse/AVRO-1862?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15375461#comment-15375461
] 

Mike Hurley commented on AVRO-1862:
-----------------------------------

Is this really that useful? For instance, with zip files I would never name a file ".stored.zip"
or ".deflate.zip". Some people use names like ".h264.mp4" for videos, but that seems kind
of rare. I guess it really doesn't hurt since it's a new feature that somebody explicitly
has to opt-in to use.

> AvroOutputFormat saves compressed avrò files without respecting codec's default extension
> -----------------------------------------------------------------------------------------
>
>                 Key: AVRO-1862
>                 URL: https://issues.apache.org/jira/browse/AVRO-1862
>             Project: Avro
>          Issue Type: Improvement
>          Components: java
>            Reporter: Piotr Wikieł
>            Priority: Minor
>         Attachments: AVRO-1862-1.patch, AVRO-1862.patch
>
>
> Common pattern in naming compressed files is giving them extension derived from compression
codec, for example: {{.gz}}, {{.zip}}, {{.bz2}}. 
> {{AvroOutputFormat}} currently does not respect this convention. 
> I've adapted some code from Hadoop's {{TextOutputFormat}} in backward-compatible manner
adding following {{JobConf}} property:
> {{avro.mapred.output.extension.from-codec}} ({{boolean}}, default: {{false}}) - when
set to {{true}}, extension will be changed according to above rule.
> EDIT: Please take a look at first comment for an update. {{.gz.avro}}, {{.snappy.avro}}
will be an extension of the file when above property will be set to true.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message