From user-return-1358-archive-asf-public=cust-asf.ponee.io@kudu.apache.org Tue May 8 18:29:22 2018 Return-Path: X-Original-To: archive-asf-public@cust-asf.ponee.io Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by mx-eu-01.ponee.io (Postfix) with SMTP id 0EFC218063B for ; Tue, 8 May 2018 18:29:21 +0200 (CEST) Received: (qmail 19764 invoked by uid 500); 8 May 2018 16:29:21 -0000 Mailing-List: contact user-help@kudu.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@kudu.apache.org Delivered-To: mailing list user@kudu.apache.org Received: (qmail 19754 invoked by uid 99); 8 May 2018 16:29:20 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd4-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 08 May 2018 16:29:20 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd4-us-west.apache.org (ASF Mail Server at spamd4-us-west.apache.org) with ESMTP id 04DCAC018B for ; Tue, 8 May 2018 16:29:20 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd4-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 1.879 X-Spam-Level: * X-Spam-Status: No, score=1.879 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=2, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd4-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=cloudera.com Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd4-us-west.apache.org [10.40.0.11]) (amavisd-new, port 10024) with ESMTP id aZ63a01buLt7 for ; Tue, 8 May 2018 16:29:19 +0000 (UTC) Received: from mail-lf0-f42.google.com (mail-lf0-f42.google.com [209.85.215.42]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTPS id D8D375F178 for ; Tue, 8 May 2018 16:29:18 +0000 (UTC) Received: by mail-lf0-f42.google.com with SMTP id o123-v6so46750625lfe.8 for ; Tue, 08 May 2018 09:29:18 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=cloudera.com; s=google; h=mime-version:in-reply-to:references:from:date:message-id:subject:to; bh=mesBwuwo8fNGQ8oXTxuxy8IWjhtGWJjoWxGFS2fq9+U=; b=R5Sjfd6xx9z7avB7jpxZrlrnonxp2McLUl9XCHKFMy/tZyCHNuW595KK0NnjpfukVm ZxF+bd87snb7sP/HWuaTvFq74KvKi2vHLufwke/PKI3BzpcTO9xxx1ejNkIf+LLuMu2z oD56CRtOdiIh9HGo6lL8VBA+fugtY8J/hBMTbHS66vrn3wQo+Z6GM2reWBAtSwqFQjP5 m7t6mHkDZlm1MrMahbX6iUdkiLXWoJL+U0pwH46YCUbibBGS3W5xDxn1FV3hli3duUML sf+Yn2KMA+BaeccUPjeTLR/yoHSDQA0MQ1LprjUcCGtXntjNuum/kl5nvpRmwCBquST5 hk1Q== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:in-reply-to:references:from:date :message-id:subject:to; bh=mesBwuwo8fNGQ8oXTxuxy8IWjhtGWJjoWxGFS2fq9+U=; b=aL0A2DYs3A0OI9IGBLe4GBSC2BEh16JFPDY9QgAB0n1dHDEXlQ84BtyhMx6QcxjfoM Q+apb0/4XDdndrdZMtdcnEjmjr1XFloxufrM6I1LJlGSAli5WCeVhPSCaHlNW0JvaA9E xRLTApsf9+uXZ0JbdxShTVgCxGGYXSUimHZ1ZC5/hP5tszo3ezND3B7DCqRh1yAWpxl9 UsbYIuPO8pHQgRZs1nvibO+QN42GWv/uKNBal1sG97PTymJKFi7agK/bS7rT0dbcwLze p4+dhkNFTjZvqRCLwc+QXpj1IIurgxOOlqQLoL2YRkbi0RBXE8ULm3nDGtA4DJWD92rS d0kA== X-Gm-Message-State: ALQs6tAONj8qqcG9D7qiaf7vcDt9aU7uF/IcxeyHnduND0Q8VsG95zOg rA35CEaPa89YlsQKpoWPdxm8hWk3SCwLstpLlDVGU29N X-Google-Smtp-Source: AB8JxZoVYgrr9pgQbw6eNV+bKxlbajBvanquJbjwYqdRuba2NeJ/xL6jR9M4nS2UuTnD1h7tH7daZFHuZW5TIbL2ho4= X-Received: by 2002:a19:6d02:: with SMTP id i2-v6mr27845943lfc.81.1525796951472; Tue, 08 May 2018 09:29:11 -0700 (PDT) MIME-Version: 1.0 Received: by 10.46.82.138 with HTTP; Tue, 8 May 2018 09:28:50 -0700 (PDT) In-Reply-To: References: From: Todd Lipcon Date: Tue, 8 May 2018 09:28:50 -0700 Message-ID: Subject: Re: Column Compression and Encoding To: user@kudu.apache.org Content-Type: multipart/alternative; boundary="0000000000004af6f8056bb449b2" --0000000000004af6f8056bb449b2 Content-Type: text/plain; charset="UTF-8" On Tue, May 8, 2018 at 9:25 AM, Saeid Sattari wrote: > Hi Todd, > > Thanks for these tips. Does compressing (LZ4,..) primary key's columns > cause performance loss? > If you have a composite primary key, Kudu already creates an internal combined column for their encoded concatenation. That internal column is already automatically compressed using PREFIX_ENCODING (because it's stored sorted, this is almost always a win) and using LZ4 (because there may be compressible patterns in non-prefix components of the composite key). So, if a column is part of the PK but not the entire PK, it will only be used on the read path when that actual column is selected, and it has the same performance impact (positive or negative) as any other column in the row. -Todd -- Todd Lipcon Software Engineer, Cloudera --0000000000004af6f8056bb449b2 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable


--
T= odd Lipcon
Software Engineer, Cloudera
--0000000000004af6f8056bb449b2--