flink-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Newport, Billy" <Billy.Newp...@gs.com>
Subject RE: Avro Parquet/Flink/Beam
Date Mon, 12 Dec 2016 16:29:57 GMT
I don't mind writing one, is there a fork for the ParquetIO works that's already been done
or is it in trunk?

The ParquetIO is independent of the runner being used? Is that right?

Thanks

-----Original Message-----
From: Jean-Baptiste Onofré [mailto:jb@nanthrax.net] 
Sent: Monday, December 12, 2016 11:25 AM
To: user@flink.apache.org
Subject: Re: Avro Parquet/Flink/Beam

Hi,

Beam provides a AvroCoder/AvroIO that you can use, but not yet a 
ParquetIO (I created a Jira about that and started to work on it).

You can use the Avro reader to populate the PCollection and then use a 
custom DoFn to create the Parquet (waiting for the ParquetIO).

Regards
JB

On 12/12/2016 05:19 PM, Newport, Billy wrote:
> Are there any examples showing the use of beam with avro/parquet and a
> flink runner? I see an avro reader for beam, is it a matter of writing
> another one for avro-parquet or does this need to use the flink
> HadoopOutputFormat for example?
>
>
>
> Thanks
>
> Billy
>
>
>

-- 
Jean-Baptiste Onofré
jbonofre@apache.org
https://urldefense.proofpoint.com/v2/url?u=http-3A__blog.nanthrax.net&d=DgID-g&c=7563p3e2zaQw0AB1wrFVgyagb2IE5rTZOYPxLxfZlX4&r=rlkM70D3djmDN7dGPzzbVKG26ShcTFDMKlX5AWucE5Q&m=wsZfFaIgCU4OQCJzjCyCLIVFFKeRBjbv4lB3kSqYRjw&s=AnmdxwKDl7BYeuvQ001GrywGxW0Kvnwtgs3ikrNou8Y&e=

Talend - https://urldefense.proofpoint.com/v2/url?u=http-3A__www.talend.com&d=DgID-g&c=7563p3e2zaQw0AB1wrFVgyagb2IE5rTZOYPxLxfZlX4&r=rlkM70D3djmDN7dGPzzbVKG26ShcTFDMKlX5AWucE5Q&m=wsZfFaIgCU4OQCJzjCyCLIVFFKeRBjbv4lB3kSqYRjw&s=5T8pN5Tz5hIpwH9uf77csajX0wJLjHzJ3kyqSzxQ2Xw&e=


Mime
View raw message