avro-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Russell Jurney <russell.jur...@gmail.com>
Subject Re: Can serialized Avro records be efficiently compared without deserializing?
Date Tue, 22 May 2012 21:43:26 GMT

I need this kind of access too, to roll back Avro records that fail to
finish writing when python dies from a UTF error.

Russell Jurney http://datasyndrome.com

On May 22, 2012, at 1:22 PM, Jonathan Coveney <jcoveney@gmail.com> wrote:

> Imagine I use Avro to serialize an object (without loss of generality let's say an array
of longs). I'm curious if it is possible to compare those arrays without deserializing...
ie look at the bytes in memory or on disk, and do the comparison based on those bytes (ie
the raw comparison that Hadoop does in the shuffle sort).
> I poked around the documentation but wasn't sure where to look.
> Thanks for your help!
> Jon

View raw message