avro-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Doug Cutting <cutt...@apache.org>
Subject Re: Avro and Hadoop streaming
Date Fri, 03 Jun 2011 08:43:35 GMT
Miki,

Have you looked at AvroAsTextInputFormat?

http://avro.apache.org/docs/current/api/java/org/apache/avro/mapred/AvroAsTextInputFormat.html

Also, release 1.5.2 will include AvroTextOutputFormat:

https://issues.apache.org/jira/browse/AVRO-830

Are these perhaps what you're looking for?

Doug

On 06/02/2011 11:30 PM, Miki Tebeka wrote:
> Greetings,
> 
> I'd like to use hadoop streaming with Avro files.
> My plan is to write an inputformat class that emits json records, one
> per line. This way the streaming application can read one record per
> line.
> (http://hadoop.apache.org/common/docs/r0.15.2/streaming.html#Specifying+Other+Plugins+for+Jobs)
> 
> I couldn't find any documentation/help about writing inputformat
> classes. Can someone point me to the right direction?
> 
> Thanks,
> --
> Miki

Mime
View raw message