avro-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Sean Busbey <bus...@cloudera.com>
Subject Re: Is Avro Splittable?
Date Fri, 26 Jun 2015 14:11:39 GMT
Avro Container Files are always splittable[1]. They're the way you will
commonly interact with Avro serialized data.

Data serialized as Avro's binary encoding is not splittable by itself,
because the encoding includes no markers[2]. This may be the source of the
disconnect you're finding in online docs.

[1]: http://avro.apache.org/docs/1.7.7/spec.html#Object+Container+Files
[2]: http://avro.apache.org/docs/1.7.7/spec.html#Data+Serialization

On Thu, Jun 25, 2015 at 12:54 AM, Ankur Jain <ankur.jain@yash.com> wrote:

>  Hello,
> I am reading various forms and docs, somewhere it is mentioned that avro
> is splittable and somewhere non-splittable.
> So which one is right??
> Regards,
> Ankur
>  Information transmitted by this e-mail is proprietary to YASH
> Technologies and/ or its Customers and is intended for use only by the
> individual or entity to which it is addressed, and may contain information
> that is privileged, confidential or exempt from disclosure under applicable
> law. If you are not the intended recipient or it appears that this mail has
> been forwarded to you without proper authority, you are notified that any
> use or dissemination of this information in any manner is strictly
> prohibited. In such cases, please notify us immediately at info@yash.com
> and delete this mail from your records.


View raw message