flink-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Anirudh Mallem <anirudh.mal...@247-inc.com>
Subject Re: Query regarding state backend for Custom Map Function
Date Thu, 01 Dec 2016 19:29:42 GMT
Thanks a lot Stefan. I got what I was looking for. Is the MapState functionality coming as
a part of the 1.2 release?

From: Stefan Richter
Reply-To: "user@flink.apache.org<mailto:user@flink.apache.org>"
Date: Thursday, December 1, 2016 at 2:53 AM
To: "user@flink.apache.org<mailto:user@flink.apache.org>"
Subject: Re: Query regarding state backend for Custom Map Function


using the ValueState and RocksDB to store a map inside the value state means that you will
have a different map for each key, which is automatically swapped on a per record basis, depending
on the record’s key. If you are using a map and Checkpointed, there is only one map and
your code is responsible for dispatching state between different keys.

If you use a map and Checkpointed, the map will be on the heap and the checkpoint will go
directly against the filesystem; this is independent of the chosen backend, so no RocksDB
is involved.

On a further note, we are working on an alternative to ValueState that is like a MapState.
In contrast to ValueState, MapState does not deserialize the whole map on each access, but
can access individual key/value pairs. This might be what you are looking for.


Am 01.12.2016 um 09:35 schrieb Anirudh Mallem <anirudh.mallem@247-inc.com<mailto:anirudh.mallem@247-inc.com>>:

Hi Everyone,
I am trying to understand the Working With State feature page of the Flink documentation.
 My question is in case I am using a ValueState in my CustomMap class to store my states with
the RocksDb as my state backend then it is clear that every state value is stored in RocksDb.
Now instead of a ValueState if I just use a normal Java Hashmap to store my states and implement
the Checkpointed interface then will the entire HashMap reside on the RocksDb backend or will
the HashMap be in memory and just the snapshots sent to RocksDb? I am trying to see what will
I lose/gain if I have my own data structure to do state maintenance. Thanks.


View raw message