Return-Path: X-Original-To: apmail-flink-user-archive@minotaur.apache.org Delivered-To: apmail-flink-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 7F4D0192C5 for ; Tue, 12 Apr 2016 16:21:58 +0000 (UTC) Received: (qmail 33623 invoked by uid 500); 12 Apr 2016 16:21:57 -0000 Delivered-To: apmail-flink-user-archive@flink.apache.org Received: (qmail 33512 invoked by uid 500); 12 Apr 2016 16:21:57 -0000 Mailing-List: contact user-help@flink.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@flink.apache.org Delivered-To: mailing list user@flink.apache.org Received: (qmail 33502 invoked by uid 99); 12 Apr 2016 16:21:57 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd4-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 12 Apr 2016 16:21:57 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd4-us-west.apache.org (ASF Mail Server at spamd4-us-west.apache.org) with ESMTP id D56A4C0224 for ; Tue, 12 Apr 2016 16:21:56 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd4-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -0.002 X-Spam-Level: X-Spam-Status: No, score=-0.002 tagged_above=-999 required=6.31 tests=[RCVD_IN_DNSWL_NONE=-0.0001, RP_MATCHES_RCVD=-0.001, SPF_PASS=-0.001] autolearn=disabled Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd4-us-west.apache.org [10.40.0.11]) (amavisd-new, port 10024) with ESMTP id 7H5dG9Iz2qXS for ; Tue, 12 Apr 2016 16:21:54 +0000 (UTC) Received: from zimbra.tngtech.com (zimbra.tngtech.com [212.204.93.107]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTPS id 99E415FAEA for ; Tue, 12 Apr 2016 16:21:54 +0000 (UTC) Received: from localhost (localhost [127.0.0.1]) by zimbra.tngtech.com (Postfix) with ESMTP id 5FAABB00210 for ; Tue, 12 Apr 2016 18:21:47 +0200 (CEST) Received: from zimbra.tngtech.com ([127.0.0.1]) by localhost (zimbra.tngtech.com [127.0.0.1]) (amavisd-new, port 10032) with ESMTP id 9mqW0Injj5OD; Tue, 12 Apr 2016 18:21:47 +0200 (CEST) Received: from localhost (localhost [127.0.0.1]) by zimbra.tngtech.com (Postfix) with ESMTP id 14827B0051A; Tue, 12 Apr 2016 18:21:47 +0200 (CEST) X-Virus-Scanned: amavisd-new at tngtech.com Received: from zimbra.tngtech.com ([127.0.0.1]) by localhost (zimbra.tngtech.com [127.0.0.1]) (amavisd-new, port 10026) with ESMTP id Pau5MArDc8gf; Tue, 12 Apr 2016 18:21:47 +0200 (CEST) Received: from [192.168.43.179] (ip-109-43-1-102.web.vodafone.de [109.43.1.102]) by zimbra.tngtech.com (Postfix) with ESMTPSA id A9342B00210; Tue, 12 Apr 2016 18:21:46 +0200 (CEST) To: user@flink.apache.org Cc: Michael Pisula From: Konstantin Knauf Subject: RocksDB Statebackend Message-ID: <570D2097.8040502@tngtech.com> Date: Tue, 12 Apr 2016 18:21:43 +0200 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:38.0) Gecko/20100101 Thunderbird/38.6.0 MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit Hi everyone, my experience with RocksDBStatebackend have left me a little bit confused. Maybe you guys can confirm that my epxierence is the expected behaviour ;): I have run a "performancetest" twice, once with FsStateBackend and once RocksDBStatebackend in comparison. In this particular test the state saved is generally not large (in a production scenario it will be larger.) These are my observations: 1. Minimal Checkpoint Size (no records) with RocksDB was 33MB compared to <<1MB with the FSStatebackend. 2. Throughput dropped from 28k/s -> 18k/s on a small cluster. 3. Checkpoint sizes as reported in the Dashboard was ca. 1MB for FsStatebackend but >100MB for RocksDbStatebackend. I hope the difference gets smaller for very large state. Can you confirm? 4. Checkpointing Times as reported in the Dashboard were 26secs for RocksDB during the test and <1 second for FsStatebackend. Does the reported time correspond to the sync. + asynchronous part of the checkpointing in case of RocksDB? Is there any way to tell how long the synchronous part takes? Form these first observations RocksDB does seem to bring a large overhead for state < 1GB, I guess? Is this expected? Cheers, Konstantin