Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 9D3B1200D1F for ; Fri, 13 Oct 2017 22:34:14 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 9BAAD160BE5; Fri, 13 Oct 2017 20:34:14 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id E0ADA1609CA for ; Fri, 13 Oct 2017 22:34:13 +0200 (CEST) Received: (qmail 96292 invoked by uid 500); 13 Oct 2017 20:34:13 -0000 Mailing-List: contact reviews-help@impala.incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list reviews@impala.incubator.apache.org Received: (qmail 96280 invoked by uid 99); 13 Oct 2017 20:34:12 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd1-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 13 Oct 2017 20:34:12 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd1-us-west.apache.org (ASF Mail Server at spamd1-us-west.apache.org) with ESMTP id 1D22EC2A60 for ; Fri, 13 Oct 2017 20:34:12 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 2.363 X-Spam-Level: ** X-Spam-Status: No, score=2.363 tagged_above=-999 required=6.31 tests=[HTML_MESSAGE=2, RDNS_DYNAMIC=0.363, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=disabled Received: from mx1-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd1-us-west.apache.org [10.40.0.7]) (amavisd-new, port 10024) with ESMTP id WvNQ754FkFKL for ; Fri, 13 Oct 2017 20:34:11 +0000 (UTC) Received: from ip-10-146-233-104.ec2.internal (ec2-75-101-130-251.compute-1.amazonaws.com [75.101.130.251]) by mx1-lw-eu.apache.org (ASF Mail Server at mx1-lw-eu.apache.org) with ESMTPS id 1BA0F5F523 for ; Fri, 13 Oct 2017 20:34:09 +0000 (UTC) Received: from localhost (localhost [127.0.0.1]) by ip-10-146-233-104.ec2.internal (8.14.4/8.14.4) with ESMTP id v9DKY6tR014621; Fri, 13 Oct 2017 20:34:06 GMT Message-Id: <201710132034.v9DKY6tR014621@ip-10-146-233-104.ec2.internal> X-Gerrit-PatchSet: 4 Date: Fri, 13 Oct 2017 20:34:05 +0000 From: "Tim Armstrong (Code Review)" To: Tianyi Wang , impala-cr@cloudera.com, reviews@impala.incubator.apache.org CC: Sailesh Mukil , Thomas Tauber-Marshall X-Gerrit-MessageType: comment Subject: =?UTF-8?Q?=5BImpala-ASF-CR=5D_IMPALA-5789=3A_Add_always_false_flag_in_bloom_filter=0A?= X-Gerrit-Change-Id: If680240a3cd4583fc97c3192177d86d9567c4f8d X-Gerrit-Change-Number: 8170 X-Gerrit-ChangeURL: X-Gerrit-Commit: 413d6e455ae01f02ce84528a17362423c44868e7 In-Reply-To: References: X-Gerrit-Comment-Date: Fri, 13 Oct 2017 20:34:05 +0000 Reply-To: tarmstrong@cloudera.com, impala-cr@cloudera.com, sailesh@cloudera.com, twang@cloudera.com, marcelk@gmail.com, tmarshall@cloudera.com, reviews@impala.incubator.apache.org MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Content-Disposition: inline User-Agent: Gerrit/2.14.2 Content-Type: multipart/alternative; boundary="EucaTb946aw="; charset=UTF-8 archived-at: Fri, 13 Oct 2017 20:34:14 -0000 --EucaTb946aw= Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable Tim Armstrong has posted comments on this change=2E ( http://gerrit=2Ecloud= era=2Eorg:8080/8170 ) Change subject: IMPALA-5789: Add always_false flag i= n bloom filter =2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E= =2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E= =2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E=2E= Patch Set 4: (5 comments) http://gerrit=2Ecloudera=2Eorg:8080/#/c/8170= /4//COMMIT_MSG Commit Message: http://gerrit=2Ecloudera=2Eorg:8080/#/c/817= 0/4//COMMIT_MSG@15 PS4, Line 15: Did you do any perf runs? It would be goo= d to verify that the extra flag checking doesn't affect perf (I suspect it = doesn't)=2E It would also be good to confirm that testdata/workloads/targe= ted-perf/queries/primitive_empty_build_join_1=2Etest gets faster (I think i= t should!)=2E http://gerrit=2Ecloudera=2Eorg:8080/#/c/8170/4/be/src/exec/= hdfs-parquet-scanner=2Ecc File be/src/exec/hdfs-parquet-scanner=2Ecc: http= ://gerrit=2Ecloudera=2Eorg:8080/#/c/8170/4/be/src/exec/hdfs-parquet-scanner= =2Ecc@403 PS4, Line 403: std:: Shouldn't need std:: - it's imported in comm= on/names=2Eh http://gerrit=2Ecloudera=2Eorg:8080/#/c/8170/4/be/src/exec/h= dfs-scanner=2Ecc File be/src/exec/hdfs-scanner=2Ecc: http://gerrit=2Ecloud= era=2Eorg:8080/#/c/8170/4/be/src/exec/hdfs-scanner=2Ecc@107 PS4, Line 107: = if (!BaseSequenceScanner::FileFormatIsSequenceBased( Might be more read= able to factor subexpression into variable e=2Eg=2E is_sequence_based=2E = http://gerrit=2Ecloudera=2Eorg:8080/#/c/8170/4/tests/custom_cluster/test_al= ways_false_filter=2Epy File tests/custom_cluster/test_always_false_filter= =2Epy: http://gerrit=2Ecloudera=2Eorg:8080/#/c/8170/4/tests/custom_cluster= /test_always_false_filter=2Epy@53 PS4, Line 53: seq_table_suffixes =3D ['= _avro', '_rc', '_seq'] We'd normally create a test matrix based on these=2E= Is the idea here to avoid restarting the cluster for each file format? If = so it would be good to leave a comment so that readers understand why=2E = http://gerrit=2Ecloudera=2Eorg:8080/#/c/8170/4/tests/custom_cluster/test_al= ways_false_filter=2Epy@82 PS4, Line 82: def test_skip_file(self, cursor):= Does this need to be a custom cluster test? I=2Ee=2E does it need a specia= l minicluster to execute=2E It's best to make things query tests if possibl= e since starting a cluster is slow and the tests aren't parallelisable=2E = -- To view, visit http://gerrit=2Ecloudera=2Eorg:8080/8170 To unsubscrib= e, visit http://gerrit=2Ecloudera=2Eorg:8080/settings Gerrit-Project: Impa= la-ASF Gerrit-Branch: master Gerrit-MessageType: comment Gerrit-Change-Id: = If680240a3cd4583fc97c3192177d86d9567c4f8d Gerrit-Change-Number: 8170 Gerrit= -PatchSet: 4 Gerrit-Owner: Tianyi Wang Gerrit-Review= er: Sailesh Mukil Gerrit-Reviewer: Thomas Tauber-M= arshall Gerrit-Reviewer: Tianyi Wang Gerrit-Reviewer: Tim Armstrong Gerr= it-Comment-Date: Fri, 13 Oct 2017 20:34:05 +0000 Gerrit-HasComments: Yes --EucaTb946aw=--