Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id A7E02200BB4 for ; Tue, 18 Oct 2016 04:08:57 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id A690F160AF0; Tue, 18 Oct 2016 02:08:57 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id C5BB1160AEC for ; Tue, 18 Oct 2016 04:08:56 +0200 (CEST) Received: (qmail 89974 invoked by uid 500); 18 Oct 2016 02:08:56 -0000 Mailing-List: contact dev-help@hawq.incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@hawq.incubator.apache.org Delivered-To: mailing list dev@hawq.incubator.apache.org Received: (qmail 89962 invoked by uid 99); 18 Oct 2016 02:08:55 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd3-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 18 Oct 2016 02:08:55 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd3-us-west.apache.org (ASF Mail Server at spamd3-us-west.apache.org) with ESMTP id 42A4A1806DE for ; Tue, 18 Oct 2016 02:08:55 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd3-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 2.499 X-Spam-Level: ** X-Spam-Status: No, score=2.499 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, HTML_MESSAGE=2, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H2=-0.001, RCVD_IN_SORBS_SPAM=0.5] autolearn=disabled Authentication-Results: spamd3-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=pivotal-io.20150623.gappssmtp.com Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd3-us-west.apache.org [10.40.0.10]) (amavisd-new, port 10024) with ESMTP id mgyaCvqyCz-D for ; Tue, 18 Oct 2016 02:08:53 +0000 (UTC) Received: from mail-qk0-f171.google.com (mail-qk0-f171.google.com [209.85.220.171]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTPS id 4A0A65F4E5 for ; Tue, 18 Oct 2016 02:08:52 +0000 (UTC) Received: by mail-qk0-f171.google.com with SMTP id z190so266929479qkc.2 for ; Mon, 17 Oct 2016 19:08:52 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=pivotal-io.20150623.gappssmtp.com; s=20150623; h=mime-version:references:in-reply-to:from:date:message-id:subject:to; bh=hTnUnj18+b8R/dKBTyyseCWQIb0F6CojbKJJlWiEVNk=; b=f2QzQK5zIC7HltbvK1Ne2n2lbDPCzSZqp787urEqngxJCuq8a4Cms0qEO1WR38wZWq Jg5tChAz26TR8OB8QE95Qo+mt0s+EJSqenkuRtAKYn5dnpC2o5odyhb/p7EwIH7TF36w RIpAJsLIfHjfZdG0F8LxK8+OQyo5bm/dSIPopXNgJWpd1O95VfFurCW30TUzQmbif363 rKhoCQc2biHsDNXMm9afoFuBdyEvGDKNgCJTtlIU05XQ/JFPMW3RZ9+DezB/U/1fflpz lRhHoaHeVUvn5kL46+jpwzTKy3+exs+C1E38qZIPz4E2g78un5fG6WOcEItojqa0qu3E Cy5Q== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to; bh=hTnUnj18+b8R/dKBTyyseCWQIb0F6CojbKJJlWiEVNk=; b=YS2beRfwYhm6x+N6nWjGHA8ykzOZpEDFwE559Irax0kj3oLVGnsJmQbKMOqG8IYPzE XH2wEMlnIeMIbkcG012iA+YRARcDlCA0gebYiRnRiFbOd/WOaM27KWj88Qw/cCPQ3LCc Eg4AETjT7mxOyaWAyx22HtmNNDdM6hHX3pMExNAvB71ym4yT3DolTkBmg9razsFOWIHd 6fcwBUPp4vb0SPpXXqbnX/i4ZvLTEGodln8LOluOk/DY8UreC67yRgtA8m9Nn+inAQIK /jR5SxD8M859rrsXYNtelGR1dxAeWnkI6bqNBBdLjzmsTRyPOcEG/Z1K5kH393P6MDyx kreA== X-Gm-Message-State: AA6/9RnMGjNEv4pUoPJNo3sy13yRY1EttBaQO2wJM80+ZKqr4w0G81stXHZzommlumG1CiF0smopL0wPKGosYl7j X-Received: by 10.55.192.70 with SMTP id o67mr394331qki.164.1476756530428; Mon, 17 Oct 2016 19:08:50 -0700 (PDT) MIME-Version: 1.0 References: <1476738705923.97577@ig.com> In-Reply-To: From: Kyle Dunn Date: Tue, 18 Oct 2016 02:08:39 +0000 Message-ID: Subject: Re: HAWQ Perfomance. To: dev@hawq.incubator.apache.org Content-Type: multipart/alternative; boundary=001a1149ae126a963c053f1a2c70 archived-at: Tue, 18 Oct 2016 02:08:57 -0000 --001a1149ae126a963c053f1a2c70 Content-Type: text/plain; charset=UTF-8 I'm also in strong agreement here. Codegen is a logical next step on my mind. There are multiple inherent benefits, ranging from vectorised processing to runtime GPU offload support. I think data locality and PXF performance are important although in pure cloud deployments, compute is, above all, what we influence most of all. Not to mention, the Greenplum team is showing good potential with codegen; we should incorporate that work, in any way possible, with HAWQ. -Kyle On Mon, Oct 17, 2016, 20:26 Hong Wu wrote: > Strong +1 on this. > > Performance is one of the reasons why our customers choose HAWQ, the > existing leading performance might come from C implementation and Postgres > implementation I think. Hawq will definitely focus on some performance > improvement but frankly speaking plan/roadmap should be shaped and > discussed in detail like this thread. Below are some of our > beforehand consideration and to-do list: > > - Codegen tech to optimize executor efficiency. > - Data-skipping tech to optimize I/O performance. > - Optimize external table access, especially PXF. > - Some vectorized refactor. > - Optimize data locality. > - Optimize distributed resource organization and management. > - Optimize communication module of interconnect. > - Gpus, SSDs > - ... > > We are running performance tests in several cluster environment for HAWQ > every week and continue paying attention to latest performance update from > our competitor and research paper. But we need some more guys joining us to > be focused on performance feature. We are very very welcome that some > developers from HAWQ open-source community to be a member of us in > performance part. > > Best > xunzhang > > > > 2016-10-18 5:11 GMT+08:00 Michael Pearce : > > > Hi All, > > > > > > As now HAWQ is being caught up with by some competitors in terms of real > > use performance, and in some cases be out performed, most notably Spark > 2.0 > > some queries we can perform faster since project tungsten. > > > > > > Obviously HAWQ still has the SQL completeness advantage but this also is > a > > slowly changing space, where Spark and others are improving. > > > > > > Is there any plans to start looking improving the execution performance > of > > HAWQ further with parquet vectorisation and whole stage codegen? > > > > > > http://www.slideshare.net/databricks/spark-performance-whats-next > > > > > > http://blog.2ndquadrant.com/postgresql-10-roadmap/ > > > > > > On the note of the postgres 10 roadmap. Is there any plans of updating > > compatibility / the fork of postgres to later versions (back merging), > > afaik HAWQ is a fork of 8.x which is quite dated. > > > > > > Im sure already all of these questions are answered/discussed, but it be > > great to get some visibility into the roadmap for these areas for HAWQ. > > > > > > Cheers > > > > Mike > > > > > > > > > > The information contained in this email is strictly confidential and for > > the use of the addressee only, unless otherwise indicated. If you are not > > the intended recipient, please do not read, copy, use or disclose to > others > > this message or any attachment. Please also notify the sender by replying > > to this email or by telephone (+44(020 7896 0011) and then delete the > > email and any copies of it. Opinions, conclusion (etc) that do not relate > > to the official business of this company shall be understood as neither > > given nor endorsed by it. IG is a trading name of IG Markets Limited (a > > company registered in England and Wales, company number 04008957) and IG > > Index Limited (a company registered in England and Wales, company number > > 01190902). Registered address at Cannon Bridge House, 25 Dowgate Hill, > > London EC4R 2YA. Both IG Markets Limited (register number 195355) and IG > > Index Limited (register number 114059) are authorised and regulated by > the > > Financial Conduct Authority. > > > -- *Kyle Dunn | Data Engineering | Pivotal* Direct: 303.905.3171 <3039053171> | Email: kdunn@pivotal.io --001a1149ae126a963c053f1a2c70--