From user-return-1564-archive-asf-public=cust-asf.ponee.io@kudu.apache.org Sun Dec 16 04:17:18 2018 Return-Path: X-Original-To: archive-asf-public@cust-asf.ponee.io Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by mx-eu-01.ponee.io (Postfix) with SMTP id 9683C180652 for ; Sun, 16 Dec 2018 04:17:17 +0100 (CET) Received: (qmail 28664 invoked by uid 500); 16 Dec 2018 03:17:14 -0000 Mailing-List: contact user-help@kudu.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@kudu.apache.org Delivered-To: mailing list user@kudu.apache.org Received: (qmail 28654 invoked by uid 99); 16 Dec 2018 03:17:14 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd1-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 16 Dec 2018 03:17:14 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd1-us-west.apache.org (ASF Mail Server at spamd1-us-west.apache.org) with ESMTP id 971ECC1962 for ; Sun, 16 Dec 2018 03:17:13 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 1.798 X-Spam-Level: * X-Spam-Status: No, score=1.798 tagged_above=-999 required=6.31 tests=[DKIMWL_WL_MED=-0.001, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, HTML_MESSAGE=2, RCVD_IN_DNSWL_NONE=-0.0001, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd1-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd1-us-west.apache.org [10.40.0.7]) (amavisd-new, port 10024) with ESMTP id cI-dWpwqEjyp for ; Sun, 16 Dec 2018 03:17:12 +0000 (UTC) Received: from mail-lj1-f179.google.com (mail-lj1-f179.google.com [209.85.208.179]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTPS id F2BF95F58A for ; Sun, 16 Dec 2018 03:17:11 +0000 (UTC) Received: by mail-lj1-f179.google.com with SMTP id k19-v6so8090268lji.11 for ; Sat, 15 Dec 2018 19:17:11 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:from:date:message-id:subject:to; bh=QcoQFFexjxYso7pI3icZScnn6CxKVmFl3fvafxJMe/w=; b=S5Ka8hsgGD/vlLbAt8xoCP9Z1za54rnHrwLOomQvoM7nmFMW+ZQ5H64EOO0b3XoMig kZi6EFRaNCM4ANsXnSmMO0IPz2gdBKi3mF6Sr5h6cKMnKApQozNeUCpJezV2Lf3Y+ulr h6SzapygEgSJ/HmgKidrW7J4m/lvs1ONNWx33HB+on0uLPhYQwRWYDzJBYpUtWWARNOn S7eU51ZhAP4gbtZuFwau1w9RMVnrgzyxrkkuLAbv9j5BwXqMGpUKtBdoX3o7MxEbw87+ 1bBbRllIOYdFxlxAngRNL7XdGOZxIZmTqxLv3e+qOCC4LRkmDjc5C5TsYDgChqlB9BYz EnhQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:from:date:message-id:subject:to; bh=QcoQFFexjxYso7pI3icZScnn6CxKVmFl3fvafxJMe/w=; b=h5WclACe/i8GTooQcvJGPR6tPMX+Bj+bTDnuT6gFrCWrfQN43ulmV1z80SfGjf615j Vfw0z89KuiedlgTp0rqPINC19N4B4GytTGboWpORItQTyrO+eLzu3otSc4YbkR0jpC02 iE8G00PKhft0E4VUxil59ILUxK0kMw3XPyqeiUpmOrprS1+Nt8j/RAkNtvyCr6cOcTgE DlpRwibur9NaS32SXxcLV/dMMugGn0GWpy9GRXfq/SagDGbLljnbOXay2evF5/QlX2uR VziLZ39xY6pSWdNdGOmiTNW6cP0HMoZasQbhirhaz2I6nTuFbbkWuSYZ1ftE+8Vue92q wzrQ== X-Gm-Message-State: AA+aEWb2S4rIuYwjz0BDKvdV5GU1NBQCfIjEe3RK9fMeA6nFK+7fA2D4 RvR1H8At1+wtY7t8arKmJudWxOVxOfgXnQlYx/6LwA== X-Google-Smtp-Source: AFSGD/Vj62gC2GBPS2EYwxmeyJKFr/DoLXqzzfQdBpcLFU5FAcSk2Ry9cViuCATzfPPUNt18YvL69c5j6auw7CSO7GQ= X-Received: by 2002:a2e:5054:: with SMTP id v20-v6mr4769963ljd.45.1544930224122; Sat, 15 Dec 2018 19:17:04 -0800 (PST) MIME-Version: 1.0 From: Cliff Resnick Date: Sat, 15 Dec 2018 22:16:51 -0500 Message-ID: Subject: using Kudu binary column in Impala To: user@kudu.apache.org, user@impala.apache.org Content-Type: multipart/alternative; boundary="000000000000363064057d1b1945" --000000000000363064057d1b1945 Content-Type: text/plain; charset="UTF-8" We're doing some testing storing Hyperloglog synopsis in Kudu. It works well in spark, but the hope is to also query through Impala with a UDF. Spark would remain as the writer, with Impala read-only. To work with Impala I'm wondering if it's best to define the HLL data as Kudu string type with plain encoding, or perhaps it's possible to keep it as binary but declare it as string in an external table definition? I presume the latter is not possible since Kudu's generated external table script does not do this. Please forgive me for not conducting my own experimentation but I figured someone here has run up against this before, and if so please let me know! -Cliff --000000000000363064057d1b1945 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable
We're doing some testing storing Hyperloglog synopsis = in Kudu.=C2=A0 It works well in spark, but the hope is to also query throug= h Impala with a UDF.=C2=A0 Spark would remain as the writer, with Impala re= ad-only. To work with Impala I'm wondering if it's best to define t= he HLL data as Kudu string type with plain encoding, or perhaps it's po= ssible to keep it as binary but declare it as string in an external table d= efinition? I presume the latter is not possible since Kudu's generated = external table script does not do this. Please forgive me for not conductin= g my own experimentation but I figured someone here has run up against this= before, and if so please let me know!

-Cliff=C2=A0
=


--000000000000363064057d1b1945--