From dev-return-38651-archive-asf-public=cust-asf.ponee.io@ignite.apache.org Tue Sep 4 16:16:19 2018 Return-Path: X-Original-To: archive-asf-public@cust-asf.ponee.io Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by mx-eu-01.ponee.io (Postfix) with SMTP id 82DA0180629 for ; Tue, 4 Sep 2018 16:16:18 +0200 (CEST) Received: (qmail 7552 invoked by uid 500); 4 Sep 2018 14:16:17 -0000 Mailing-List: contact dev-help@ignite.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@ignite.apache.org Delivered-To: mailing list dev@ignite.apache.org Received: (qmail 7541 invoked by uid 99); 4 Sep 2018 14:16:17 -0000 Received: from mail-relay.apache.org (HELO mailrelay1-lw-us.apache.org) (207.244.88.152) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 04 Sep 2018 14:16:17 +0000 Received: from mail-oi0-f41.google.com (mail-oi0-f41.google.com [209.85.218.41]) by mailrelay1-lw-us.apache.org (ASF Mail Server at mailrelay1-lw-us.apache.org) with ESMTPSA id 81654D26 for ; Tue, 4 Sep 2018 14:16:16 +0000 (UTC) Received: by mail-oi0-f41.google.com with SMTP id k12-v6so6958952oiw.8 for ; Tue, 04 Sep 2018 07:16:16 -0700 (PDT) X-Gm-Message-State: APzg51AZlefMX8Fw7vdheLlf4/W2fBtB7hQVC7vfW8MZTx7vklWifNSM 0RJcC1WbysncvRK5TpeC8W7oelflP+xs/SBqSa/sCw== X-Google-Smtp-Source: ANB0VdaAnGwpag0WnXi4b8LSTK32S/jZDHvNtiN7jSykoRgF3azViu5nH9WQzxXJoBWh0batPYj+UGAtt2IfzhC9bGc= X-Received: by 2002:aca:5a45:: with SMTP id o66-v6mr10352222oib.155.1536070575739; Tue, 04 Sep 2018 07:16:15 -0700 (PDT) MIME-Version: 1.0 Received: by 2002:ac9:2927:0:0:0:0:0 with HTTP; Tue, 4 Sep 2018 07:15:34 -0700 (PDT) In-Reply-To: References: <1535996189536-0.post@n4.nabble.com> From: Dmitriy Setrakyan Date: Tue, 4 Sep 2018 07:15:34 -0700 X-Gmail-Original-Message-ID: Message-ID: Subject: Re: Compression prototype To: dev Content-Type: multipart/alternative; boundary="000000000000047b3605750c4da7" --000000000000047b3605750c4da7 Content-Type: text/plain; charset="UTF-8" On Tue, Sep 4, 2018 at 2:55 AM, Ilya Kasnacheev wrote: > Hello! > > Each node has a local dictionary (per node currently, per cache planned). > Dictionary is never shared between nodes. As data patterns shift, > dictionary rotation is also planned. > > With Zstd, the best dictionary size seems to be 1024 bytes. I imagine It is > enough to store common BinaryObject boilerplate, and everything else is > compressed on the fly. The source sample is 16k records. > > Thanks, Ilya, understood. I think per-cache is a better idea. However, I have a question about dictionary size. Ignite stores TBs of data. How do you plan the dictionary to fit in 1K bytes? D. --000000000000047b3605750c4da7--