flink-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Stephan Ewen <se...@apache.org>
Subject Re: Efficient datatypes?
Date Thu, 19 Feb 2015 10:11:42 GMT
Hey!

All data types are always kept serialized for caching/hashing/sorting.
Deserialization is sometimes needed in the internal algorithms (on hash
collisions and sort-prefix collisions). The most efficient data types for
that are actually Tuples. POJOs and other data types are a little less
efficient for that, but we are trying to improve that.

Stephan


On Thu, Feb 19, 2015 at 10:23 AM, Kruse, Sebastian <Sebastian.Kruse@hpi.de>
wrote:

>  Hi everyone,
>
>
>
> I think that during one of the meetups, it was mentioned that Flink can in
> some cases operate on serialized data. Given I understood that correctly,
> which cases that would be, i.e, which data types and operators support such
> a feature?
>
>
>
> Cheers,
>
> Sebastian
>
>
>
>
>
> ---
>
> Sebastian Kruse
>
> Doktorand am Fachbereich Information Systems Group
>
> Hasso-Plattner-Institut an der Universität Potsdam
> Prof.-Dr.-Helmert-Str. 2-3, D-14482 Potsdam
> Tel +49 331 5509 240
>
> Amtsgericht Potsdam, HRB 12184
> Geschäftsführung: Prof. Dr. Christoph Meinel
>
>
>
>
>

Mime
View raw message