Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 71B56200D34 for ; Fri, 20 Oct 2017 05:59:55 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 7008F160BEC; Fri, 20 Oct 2017 03:59:55 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id B37D01609EE for ; Fri, 20 Oct 2017 05:59:54 +0200 (CEST) Received: (qmail 41614 invoked by uid 500); 20 Oct 2017 03:59:53 -0000 Mailing-List: contact reviews-help@impala.incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list reviews@impala.incubator.apache.org Received: (qmail 41603 invoked by uid 99); 20 Oct 2017 03:59:53 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd3-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 20 Oct 2017 03:59:53 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd3-us-west.apache.org (ASF Mail Server at spamd3-us-west.apache.org) with ESMTP id 34D591807FD for ; Fri, 20 Oct 2017 03:59:52 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd3-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 2.363 X-Spam-Level: ** X-Spam-Status: No, score=2.363 tagged_above=-999 required=6.31 tests=[HTML_MESSAGE=2, RDNS_DYNAMIC=0.363, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=disabled Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd3-us-west.apache.org [10.40.0.10]) (amavisd-new, port 10024) with ESMTP id pnyn_6lf9Oi7 for ; Fri, 20 Oct 2017 03:59:50 +0000 (UTC) Received: from ip-10-146-233-104.ec2.internal (ec2-75-101-130-251.compute-1.amazonaws.com [75.101.130.251]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTPS id 24EDE5FB0B for ; Fri, 20 Oct 2017 03:59:50 +0000 (UTC) Received: from localhost (localhost [127.0.0.1]) by ip-10-146-233-104.ec2.internal (8.14.4/8.14.4) with ESMTP id v9K3xnwU026067; Fri, 20 Oct 2017 03:59:49 GMT Message-Id: <201710200359.v9K3xnwU026067@ip-10-146-233-104.ec2.internal> X-Gerrit-PatchSet: 9 Date: Fri, 20 Oct 2017 03:59:49 +0000 From: "Impala Public Jenkins (Code Review)" To: Tianyi Wang , impala-cr@cloudera.com, reviews@impala.incubator.apache.org X-Gerrit-MessageType: merged Subject: =?UTF-8?Q?=5BImpala-ASF-CR=5D_IMPALA-5789=3A_Add_always_false_flag_in_bloom_filter=0A?= X-Gerrit-Change-Id: If680240a3cd4583fc97c3192177d86d9567c4f8d X-Gerrit-Change-Number: 8170 X-Gerrit-ChangeURL: X-Gerrit-Commit: 359b409b9636e20414270bc7a642d9116304eac1 In-Reply-To: References: Reply-To: impala-cr@cloudera.com, twang@cloudera.com, reviews@impala.incubator.apache.org MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Content-Disposition: inline User-Agent: Gerrit/2.14.2 Content-Type: multipart/alternative; boundary="fwv+qLq1lGg="; charset=UTF-8 archived-at: Fri, 20 Oct 2017 03:59:55 -0000 --fwv+qLq1lGg= Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable Impala Public Jenkins has submitted this change and it was merged=2E ( http= ://gerrit=2Ecloudera=2Eorg:8080/8170 ) Change subject: IMPALA-5789: Add al= ways_false flag in bloom filter =2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E= =2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E= =2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E= =2E=2E=2E=2E=2E=2E IMPALA-5789: Add always_false flag in bloom filter Thi= s patch adds an always_false flag in bloom filters=2E The flag is set if no= thing has been inserted into the bloom filter=2E HdfsScanner uses this flag= to early terminate the scan at file and split granularities=2E Testing: I= t passes existing tests=2E Two test cases are added checking that an always= -false runtime filter can filter out files and splits=2E In single node per= f tests, time spent on primitive_empty_build_join_1 is reduced by 75%=2E C= hange-Id: If680240a3cd4583fc97c3192177d86d9567c4f8d Reviewed-on: http://ger= rit=2Ecloudera=2Eorg:8080/8170 Reviewed-by: Tim Armstrong Tested-by: Impala Public Jenkins --- M be/src/common/global-flag= s=2Ecc M be/src/exec/base-sequence-scanner=2Ecc M be/src/exec/base-sequence= -scanner=2Eh M be/src/exec/filter-context=2Ecc M be/src/exec/filter-context= =2Eh M be/src/exec/hdfs-parquet-scanner=2Ecc M be/src/exec/hdfs-scan-node-b= ase=2Ecc M be/src/exec/hdfs-scan-node=2Ecc M be/src/exec/hdfs-scanner=2Ecc = M be/src/runtime/coordinator-backend-state=2Ecc M be/src/runtime/coordinato= r-backend-state=2Eh M be/src/runtime/coordinator-filter-state=2Eh M be/src/= runtime/coordinator=2Ecc M be/src/runtime/runtime-filter-ir=2Ecc M be/src/r= untime/runtime-filter=2Eh M be/src/runtime/runtime-filter=2Einline=2Eh M be= /src/util/bloom-filter-ir=2Ecc M be/src/util/bloom-filter=2Ecc M be/src/uti= l/bloom-filter=2Eh M common/thrift/ImpalaInternalService=2Ethrift A tests/c= ustom_cluster/test_always_false_filter=2Epy M tests/query_test/test_runtime= _filters=2Epy 22 files changed, 200 insertions(+), 197 deletions(-) Approv= als: Tim Armstrong: Looks good to me, approved Impala Public Jenkins: V= erified -- To view, visit http://gerrit=2Ecloudera=2Eorg:8080/8170 To uns= ubscribe, visit http://gerrit=2Ecloudera=2Eorg:8080/settings Gerrit-Projec= t: Impala-ASF Gerrit-Branch: master Gerrit-MessageType: merged Gerrit-Chang= e-Id: If680240a3cd4583fc97c3192177d86d9567c4f8d Gerrit-Change-Number: 8170 = Gerrit-PatchSet: 9 Gerrit-Owner: Tianyi Wang Gerrit-= Reviewer: Impala Public Jenkins Gerrit-Reviewer: Sailesh Mukil Gerrit-Reviewer: Thomas Tauber-Marshall Gerrit-Reviewer: Tianyi Wang Gerrit-Reviewer: T= im Armstrong --fwv+qLq1lGg=--