avro-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Piotr Wikieł (JIRA) <j...@apache.org>
Subject [jira] [Updated] (AVRO-1862) AvroOutputFormat saves compressed avrò files without respecting codec's default extension
Date Wed, 15 Jun 2016 07:29:09 GMT

     [ https://issues.apache.org/jira/browse/AVRO-1862?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Piotr Wikieł updated AVRO-1862:
-------------------------------
    Attachment: AVRO-1862.patch

> AvroOutputFormat saves compressed avrò files without respecting codec's default extension
> -----------------------------------------------------------------------------------------
>
>                 Key: AVRO-1862
>                 URL: https://issues.apache.org/jira/browse/AVRO-1862
>             Project: Avro
>          Issue Type: Improvement
>          Components: java
>            Reporter: Piotr Wikieł
>            Priority: Minor
>         Attachments: AVRO-1862.patch
>
>
> Common pattern in naming compressed files is giving them extension derived from compression
codec, for example: {.gz}, {.zip}, {.bz2}. 
> {AvroOutputFormat} currently does not respect this convention. 
> I've adapted some code from Hadoop's {TextOutputFormat} in backward-compatible manner
adding following {JobConf} property:
> {avro.mapred.output.extension.from-codec} ({boolean}, default: {false}) - when set to
{true}, extension will be changed according to above rule.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message