flink-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Aljoscha Krettek (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (FLINK-6022) Don't serialise Schema when serialising Avro GenericRecord
Date Wed, 08 Nov 2017 12:41:12 GMT

     [ https://issues.apache.org/jira/browse/FLINK-6022?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Aljoscha Krettek updated FLINK-6022:
------------------------------------
    Summary: Don't serialise Schema when serialising Avro GenericRecord  (was: Improve support
for Avro GenericRecord)

> Don't serialise Schema when serialising Avro GenericRecord
> ----------------------------------------------------------
>
>                 Key: FLINK-6022
>                 URL: https://issues.apache.org/jira/browse/FLINK-6022
>             Project: Flink
>          Issue Type: Improvement
>          Components: Type Serialization System
>            Reporter: Robert Metzger
>            Assignee: Stephan Ewen
>            Priority: Blocker
>             Fix For: 1.4.0
>
>
> Currently, Flink is serializing the schema for each Avro GenericRecord in the stream.
> This leads to a lot of overhead over the wire/disk + high serialization costs.
> Therefore, I'm proposing to improve the support for GenericRecord in Flink by shipping
the schema to each serializer  through the AvroTypeInformation.
> Then, we can only support GenericRecords with the same type per stream, but the performance
will be much better.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Mime
View raw message