Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id CB005200D23 for ; Thu, 19 Oct 2017 20:18:13 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id C980C1609EE; Thu, 19 Oct 2017 18:18:13 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 19DA71609D7 for ; Thu, 19 Oct 2017 20:18:12 +0200 (CEST) Received: (qmail 60770 invoked by uid 500); 19 Oct 2017 18:18:12 -0000 Mailing-List: contact reviews-help@impala.incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list reviews@impala.incubator.apache.org Received: (qmail 60759 invoked by uid 99); 19 Oct 2017 18:18:12 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd2-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 19 Oct 2017 18:18:12 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd2-us-west.apache.org (ASF Mail Server at spamd2-us-west.apache.org) with ESMTP id 4F7651A0917 for ; Thu, 19 Oct 2017 18:18:11 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd2-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 2.363 X-Spam-Level: ** X-Spam-Status: No, score=2.363 tagged_above=-999 required=6.31 tests=[HTML_MESSAGE=2, RDNS_DYNAMIC=0.363, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=disabled Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd2-us-west.apache.org [10.40.0.9]) (amavisd-new, port 10024) with ESMTP id 1q-wo2H8DLyf for ; Thu, 19 Oct 2017 18:18:09 +0000 (UTC) Received: from ip-10-146-233-104.ec2.internal (ec2-75-101-130-251.compute-1.amazonaws.com [75.101.130.251]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTPS id A613D5FBE5 for ; Thu, 19 Oct 2017 18:18:09 +0000 (UTC) Received: from localhost (localhost [127.0.0.1]) by ip-10-146-233-104.ec2.internal (8.14.4/8.14.4) with ESMTP id v9JII8bP004760; Thu, 19 Oct 2017 18:18:08 GMT Message-Id: <201710191818.v9JII8bP004760@ip-10-146-233-104.ec2.internal> X-Gerrit-PatchSet: 7 Date: Thu, 19 Oct 2017 18:18:07 +0000 From: "Tianyi Wang (Code Review)" To: Thomas Tauber-Marshall , Sailesh Mukil , Tim Armstrong , impala-cr@cloudera.com, reviews@impala.incubator.apache.org X-Gerrit-MessageType: newpatchset Subject: =?UTF-8?Q?=5BImpala-ASF-CR=5D_IMPALA-5789=3A_Add_always_false_flag_in_bloom_filter=0A?= X-Gerrit-Change-Id: If680240a3cd4583fc97c3192177d86d9567c4f8d X-Gerrit-Change-Number: 8170 X-Gerrit-ChangeURL: X-Gerrit-Commit: 1cbd672d382b6772ada294e5a216594ed6ef80b2 In-Reply-To: References: Reply-To: twang@cloudera.com, impala-cr@cloudera.com, sailesh@cloudera.com, marcelk@gmail.com, tarmstrong@cloudera.com, tmarshall@cloudera.com, reviews@impala.incubator.apache.org MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Content-Disposition: inline User-Agent: Gerrit/2.14.2 Content-Type: multipart/alternative; boundary="q/WwavllsKE="; charset=UTF-8 archived-at: Thu, 19 Oct 2017 18:18:14 -0000 --q/WwavllsKE= Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable Hello Thomas Tauber-Marshall, Sailesh Mukil, Tim Armstrong, I'd like you = to reexamine a change=2E Please visit http://gerrit=2Ecloudera=2Eorg:8= 080/8170 to look at the new patch set (#7)=2E Change subject: IMPALA-5789= : Add always_false flag in bloom filter =2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E= =2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E= =2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E= =2E=2E=2E=2E=2E=2E=2E=2E=2E IMPALA-5789: Add always_false flag in bloom fi= lter This patch adds an always_false flag in bloom filters=2E The flag is = set if nothing has been inserted into the bloom filter=2E HdfsScanner uses = this flag to early terminate the scan at file and split granularities=2E T= esting: It passes existing tests=2E Two test cases are added checking that = an always-false runtime filter can filter out files and splits=2E In single= node perf tests, time spent on primitive_empty_build_join_1 is reduced by = 75%=2E Change-Id: If680240a3cd4583fc97c3192177d86d9567c4f8d --- M be/src/c= ommon/global-flags=2Ecc M be/src/exec/base-sequence-scanner=2Ecc M be/src/e= xec/base-sequence-scanner=2Eh M be/src/exec/filter-context=2Ecc M be/src/ex= ec/filter-context=2Eh M be/src/exec/hdfs-parquet-scanner=2Ecc M be/src/exec= /hdfs-scan-node-base=2Ecc M be/src/exec/hdfs-scan-node=2Ecc M be/src/exec/h= dfs-scanner=2Ecc M be/src/runtime/coordinator-backend-state=2Ecc M be/src/r= untime/coordinator-backend-state=2Eh M be/src/runtime/coordinator-filter-st= ate=2Eh M be/src/runtime/coordinator=2Ecc M be/src/runtime/runtime-filter-i= r=2Ecc M be/src/runtime/runtime-filter=2Eh M be/src/runtime/runtime-filter= =2Einline=2Eh M be/src/util/bloom-filter-ir=2Ecc M be/src/util/bloom-filter= =2Ecc M be/src/util/bloom-filter=2Eh M common/thrift/ImpalaInternalService= =2Ethrift A tests/custom_cluster/test_always_false_filter=2Epy M tests/quer= y_test/test_runtime_filters=2Epy 22 files changed, 200 insertions(+), 197 d= eletions(-) git pull ssh://gerrit=2Ecloudera=2Eorg:29418/Impala-ASF ref= s/changes/70/8170/7 -- To view, visit http://gerrit=2Ecloudera=2Eorg:8080/= 8170 To unsubscribe, visit http://gerrit=2Ecloudera=2Eorg:8080/settings Ge= rrit-Project: Impala-ASF Gerrit-Branch: master Gerrit-MessageType: newpatch= set Gerrit-Change-Id: If680240a3cd4583fc97c3192177d86d9567c4f8d Gerrit-Chan= ge-Number: 8170 Gerrit-PatchSet: 7 Gerrit-Owner: Tianyi Wang Gerrit-Reviewer: Sailesh Mukil Gerrit-Rev= iewer: Thomas Tauber-Marshall Gerrit-Reviewer: T= ianyi Wang Gerrit-Reviewer: Tim Armstrong --q/WwavllsKE=--