Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 22E2A200B4B for ; Thu, 7 Jul 2016 05:08:03 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 215BE160A73; Thu, 7 Jul 2016 03:08:03 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 6982B160A64 for ; Thu, 7 Jul 2016 05:08:02 +0200 (CEST) Received: (qmail 96056 invoked by uid 500); 7 Jul 2016 03:08:01 -0000 Mailing-List: contact dev-help@hawq.incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@hawq.incubator.apache.org Delivered-To: mailing list dev@hawq.incubator.apache.org Received: (qmail 96044 invoked by uid 99); 7 Jul 2016 03:08:01 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd2-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 07 Jul 2016 03:08:01 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd2-us-west.apache.org (ASF Mail Server at spamd2-us-west.apache.org) with ESMTP id C232A1A001F for ; Thu, 7 Jul 2016 03:08:00 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd2-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 1.28 X-Spam-Level: * X-Spam-Status: No, score=1.28 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, HTML_MESSAGE=2, RCVD_IN_DNSWL_LOW=-0.7, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01] autolearn=disabled Authentication-Results: spamd2-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=pivotal-io.20150623.gappssmtp.com Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd2-us-west.apache.org [10.40.0.9]) (amavisd-new, port 10024) with ESMTP id oLEKnnHaOvWx for ; Thu, 7 Jul 2016 03:07:56 +0000 (UTC) Received: from mail-qt0-f179.google.com (mail-qt0-f179.google.com [209.85.216.179]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTPS id 9A31E5FB6F for ; Thu, 7 Jul 2016 03:07:56 +0000 (UTC) Received: by mail-qt0-f179.google.com with SMTP id m2so2627927qtd.1 for ; Wed, 06 Jul 2016 20:07:56 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=pivotal-io.20150623.gappssmtp.com; s=20150623; h=mime-version:in-reply-to:references:from:date:message-id:subject:to; bh=6oKkj3vEfRbnR5ZOkDGmdAKqwyRX0bqIAMXfGwhx4ww=; b=cy9ykn8k11RqFzG+RnwNeGiZSZMzHzuizBREwVIogu3LhsaB+WtncxrC/Wv2VBb65R k3Hpy2TPFOp/1azTUq7RahHl4pahCFybstMBUwyi+/V4JQen79u2T9lGqi9cA9Dkzjxl uV55mcdzNyl+UjQF20RY5U17x2hZfR5pAxZlTZPKKSauzZjL5KKEtt0RDuBn5/nvl8fl e3eA1RMdrqJ0m3Pc0FjtnoJmYRAvwO2FnVKRptZB/X0NUlaZROrX3X6Us07GFiZjFwR+ n8bSoR/RX+5eAvy624UcGO0loAXTuuzwO5bRegKjgDRlF/TFvqF78yfJBCoXo4NmpFsa 6eaA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:mime-version:in-reply-to:references:from:date :message-id:subject:to; bh=6oKkj3vEfRbnR5ZOkDGmdAKqwyRX0bqIAMXfGwhx4ww=; b=Nx4P34yEV+uiLIjMJDz1UOKfowB4EUtFX04YonCxPY74tIrXOw+8tt1MbmP5ZpVwtY ibkLZAgkDeC1XsVM+mXmFoxt7Ljudzyyqzop4+UD1pVZOknUosAWRWzk8TXx56S5SxFQ 97A+roYgxUUIAVqe7jCEAVKq+2Yuakk6Ox9Wn120zOy5SVuUhrwYNOdsH3QcroulNWit dx/mxnm8mYn6F9lZbqko1Ebs8CaY0AemgHREsXuGZI4cMOKu1UvpCYUGY8NtyTlPEvIc H9aT08KepJUPl6kxgBVSRwUivxewvtXlkjXdowJwSTdQOYGwsszaerWPqwDZL15y0/NG X+gg== X-Gm-Message-State: ALyK8tIsfWtSHPHwQASaDhxm+ZJAxwNwCCps6H4WMLPA5LgiDwy+i/J8KFjqPlcZ/jEIXzmcNoziLQko3Eq+4GK2 X-Received: by 10.237.55.168 with SMTP id j37mr40164728qtb.102.1467860875371; Wed, 06 Jul 2016 20:07:55 -0700 (PDT) MIME-Version: 1.0 Received: by 10.200.56.113 with HTTP; Wed, 6 Jul 2016 20:07:54 -0700 (PDT) In-Reply-To: References: From: Hubert Zhang Date: Thu, 7 Jul 2016 11:07:54 +0800 Message-ID: Subject: Re: [Propose] New PXF profile optimized for ORC (predicate pushdown) To: dev@hawq.incubator.apache.org Content-Type: multipart/alternative; boundary=001a1142d3dc0e81a8053702feb7 archived-at: Thu, 07 Jul 2016 03:08:03 -0000 --001a1142d3dc0e81a8053702feb7 Content-Type: text/plain; charset=UTF-8 +1 for lazy reader, It can save a lot of decompression and deserialization(CPU bound) time. On Wed, Jul 6, 2016 at 7:00 AM, Roman Shaposhnik wrote: > On Tue, Jul 5, 2016 at 12:01 PM, Shivram Mani > wrote: > > I've created the following jira HAWQ-866 > > which is focussed on > > improving/enhancing the existing PXF profile to read ORC files. The goal > is > > to make use of the underlying ORC reader's capability of supporting > > predicate push-down among others. > > > > Presto has also contributed an alternative ORC reader which provides both > > predicate push down and Lazy reads > > > https://code.facebook.com/posts/370832626374903/even-faster-data-at-the-speed-of-presto-orc/ > > . > > > > Will be evaluating both the options as part of this effort. > > Great to see this effort! Do you plan to come up with any kind of > benchmark to > be able to compare the native ORC reader vs. PXF ORC reader performance > and capabilities? > > Or does it really all just boil down to TPC? > > Thanks, > Roman. > -- Thanks Hubert Zhang --001a1142d3dc0e81a8053702feb7--