avro-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Patrick Linehan (JIRA)" <j...@apache.org>
Subject [jira] Updated: (AVRO-684) Java tool for altering the codec of an Avro data file stream.
Date Fri, 29 Oct 2010 04:51:19 GMT

     [ https://issues.apache.org/jira/browse/AVRO-684?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Patrick Linehan updated AVRO-684:
---------------------------------

    Attachment: AVRO-684.patch

I've finished a first draft.  Still to be done:

* Write the test.
* Preserve file metadata.
* Implement the concatenation described by Scott Carey.

I'm assuming that for concatenation, the following would be considered reasonable behavior:

* Only the metadata from the first input file is written to the output file.
* The schema from the first input file becomes the schema of the output file.  The remaining
input file schemas only need to resolve with said schema, not be identical.

Anyway, the first draft is here in case anyone gets the urge to finish it for me :)  Otherwise
I hope to finish it in the next few weeks.

> Java tool for altering the codec of an Avro data file stream.
> -------------------------------------------------------------
>
>                 Key: AVRO-684
>                 URL: https://issues.apache.org/jira/browse/AVRO-684
>             Project: Avro
>          Issue Type: New Feature
>          Components: java
>            Reporter: Patrick Linehan
>         Attachments: AVRO-684.patch
>
>
> An example is worth a thousand words:
>   cat infile.avro | avro-tools recodec deflate - - > outfile.avro
> The above example would create a new file, "outfile.avro", with the same contents as
"infile.avro".  However, the codec of "outfile.avro" would be "deflate", regardless of the
codec of "infile.avro".
> Proposed features:
> * The tool should preserve any metadata present in the input file.
> * Supported codecs will be "deflate" and "null".
> * Optionally add support for specifying the deflation level, perhaps with syntax as follows:
 "deflate:N" where N is the deflation level, e.g. "deflate:4".
> Does this proposal sound reasonable?

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message