Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id EF989200D34 for ; Fri, 3 Nov 2017 07:32:49 +0100 (CET) Received: by cust-asf.ponee.io (Postfix) id EE1F9160BFC; Fri, 3 Nov 2017 06:32:49 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 40E18160BE9 for ; Fri, 3 Nov 2017 07:32:49 +0100 (CET) Received: (qmail 98604 invoked by uid 500); 3 Nov 2017 06:32:48 -0000 Mailing-List: contact dev-help@impala.incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@impala.incubator.apache.org Delivered-To: mailing list dev@impala.incubator.apache.org Received: (qmail 98580 invoked by uid 99); 3 Nov 2017 06:32:48 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd1-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 03 Nov 2017 06:32:48 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd1-us-west.apache.org (ASF Mail Server at spamd1-us-west.apache.org) with ESMTP id 2EA85DCA15 for ; Fri, 3 Nov 2017 06:32:47 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 2.379 X-Spam-Level: ** X-Spam-Status: No, score=2.379 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=2, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, RCVD_IN_SORBS_SPAM=0.5, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd1-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd1-us-west.apache.org [10.40.0.7]) (amavisd-new, port 10024) with ESMTP id WUhtYRFAKcl6 for ; Fri, 3 Nov 2017 06:32:45 +0000 (UTC) Received: from mail-lf0-f51.google.com (mail-lf0-f51.google.com [209.85.215.51]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTPS id 4AF485F6CD for ; Fri, 3 Nov 2017 06:32:45 +0000 (UTC) Received: by mail-lf0-f51.google.com with SMTP id g70so1968084lfl.3 for ; Thu, 02 Nov 2017 23:32:45 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:from:date:message-id:subject:to; bh=l4r9rZO6xO3+F+HTDPIVggRwGFRzYNaapq+0jARsnHs=; b=E6QgOjIxxfPufby5dh8CZOVf3oJrrfpavlrXPUVpHmy7ziER5LAnz8KPxkoSMoHaj1 4VrNWXA7dV1rZI8Q/2kEORagEdKQ4vpIjepmq7Mr65V5FzBXpozDsNoCyxNwtqftij+B bS5gbyeGQY4946Hc0OUpefnXeE8rXD66KEAJ/Q6vKFf08BxmkKRkdakA6Uytf9E+Px3u Kgidl5+TNckCXxQxd8Fnav9DTYGecMijGsFfeURyhJ8CUOhxzvl1gsI5ljFqgnmz+K2E 0Xi/WbQXGkvPdO5OndWR+VLqzVGayfSUhdVf6ADP2TiJ7vQV/Jin1bWiEtLTSa4ANfAg UH8A== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:from:date:message-id:subject:to; bh=l4r9rZO6xO3+F+HTDPIVggRwGFRzYNaapq+0jARsnHs=; b=ZlZQUDTjpcsGuEXMJ1tzprEdXXQNiipsMT9E/+u9Q+REkHEMlcUMddmwvMuuNx/v0s scyJHIRquPoV75onZxh0XW18kM0goiKAA9u4Kre2fhdmc/7Wqr0TxghR3FysekxHKuGh 2oO9sw2BLHTIDQxeFphWc9M+DCoy7CNYF0VDDglDKOIFQZjonqCIuWReY4075MteKoXg T1kITOA7BBqME8AJpDBN2XP6gU9ODcWKSmrMkzkFLn3/2PiheNosWyaAeNJiy9NyaeKS nD0i72AFqtMneLPUnTmRfHY9z0IggDJy40EJ9fQA50SnqEqoofZNeImN1HK+bnZHEvbk X0OQ== X-Gm-Message-State: AMCzsaUFepqSY9JL1jTNnbeQeP1KMJkU5Ot/rKo6kuRK/1NzvnuVdiKd xzmy20uj2Hez3IpqnL5BzZEAJpi34Kb/KX3P2Gmygg== X-Google-Smtp-Source: ABhQp+TRhjDffm/3Y4jbx4c8O/HW9OlAVb4O+AaQ5xIlWw7LIqR3f3eI/2mtzWBbFRzIvLXazC+I9sU57ljQJgVtDpk= X-Received: by 10.46.4.140 with SMTP id a12mr2538148ljf.37.1509690764005; Thu, 02 Nov 2017 23:32:44 -0700 (PDT) MIME-Version: 1.0 Received: by 10.25.198.5 with HTTP; Thu, 2 Nov 2017 23:32:43 -0700 (PDT) From: yu feng Date: Fri, 3 Nov 2017 14:32:43 +0800 Message-ID: Subject: extend impala to query from Elasticsearch To: dev@impala.incubator.apache.org Content-Type: multipart/alternative; boundary="94eb2c0de9d6b59378055d0e45fa" archived-at: Fri, 03 Nov 2017 06:32:50 -0000 --94eb2c0de9d6b59378055d0e45fa Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable Hi All =EF=BC=9A We are try to query data from Elasticsearch using impala, we want to take advantage of fast speed of impala engine and fast filter and aggregation speed of Elasticsearch. I want to do it in the following way : 1=E3=80=81add a new Table type=EF=BC=88metadata=EF=BC=89 called ES Table. 2=E3=80=81add two new ExecNode(ESScanNode and ESAggregation) to implements = query to ES. 3=E3=80=81when a query to ES Table, try to rewrite execution plan while con= tains Aggregation(parent) and ESScanNode(child) to a ESAggregation. In this way, I think it can scan and do aggregation by ES. I want to know what attitude about the combination, and Is it some better way to implement it ? Thanks a lot. --94eb2c0de9d6b59378055d0e45fa--