avro-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Vyacheslav Zholudev (Created) (JIRA)" <j...@apache.org>
Subject [jira] [Created] (AVRO-1035) Add the possibility to append to existing avro files
Date Wed, 22 Feb 2012 15:35:49 GMT
Add the possibility to append to existing avro files  

                 Key: AVRO-1035
                 URL: https://issues.apache.org/jira/browse/AVRO-1035
             Project: Avro
          Issue Type: New Feature
            Reporter: Vyacheslav Zholudev

Currently it is not possible to append to avro files that were written and closed. 

Here is a Scott Carey's reply on the mailing list:
It is not possible without modifying DataFileWriter. Please open a JIRA

It could not simply append to an OutputStream, since it must either:
* Seek to the start to validate the schemas match and find the sync
marker, or
* Trust that the schemas match and find the sync marker from the last block

DataFileWriter cannot refer to Hadoop classes such as FileSystem, but we
could add something to the mapred module that takes a Path and FileSystem
and returns
something that implemements an interface that DataFileWriter can append
to.  This would be something that is both a
and an OutputStream, or has both an InputStream from the start of the
existing file and an OutputStream at the end.

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira


View raw message