Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 57C40200B91 for ; Thu, 29 Sep 2016 09:17:35 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 565A2160AE3; Thu, 29 Sep 2016 07:17:35 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 9C7B2160AD7 for ; Thu, 29 Sep 2016 09:17:34 +0200 (CEST) Received: (qmail 20670 invoked by uid 500); 29 Sep 2016 07:17:33 -0000 Mailing-List: contact reviews-help@impala.incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list reviews@impala.incubator.apache.org Received: (qmail 20659 invoked by uid 99); 29 Sep 2016 07:17:33 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd3-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 29 Sep 2016 07:17:33 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd3-us-west.apache.org (ASF Mail Server at spamd3-us-west.apache.org) with ESMTP id 0E8FE180538 for ; Thu, 29 Sep 2016 07:17:33 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd3-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 0.362 X-Spam-Level: X-Spam-Status: No, score=0.362 tagged_above=-999 required=6.31 tests=[RDNS_DYNAMIC=0.363, SPF_PASS=-0.001] autolearn=disabled Received: from mx1-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd3-us-west.apache.org [10.40.0.10]) (amavisd-new, port 10024) with ESMTP id mcGj3Jrys-5x for ; Thu, 29 Sep 2016 07:17:29 +0000 (UTC) Received: from ip-10-146-233-104.ec2.internal (ec2-75-101-130-251.compute-1.amazonaws.com [75.101.130.251]) by mx1-lw-eu.apache.org (ASF Mail Server at mx1-lw-eu.apache.org) with ESMTPS id B2DCC5F30E for ; Thu, 29 Sep 2016 07:17:28 +0000 (UTC) Received: from localhost (localhost [127.0.0.1]) by ip-10-146-233-104.ec2.internal (8.14.4/8.14.4) with ESMTP id u8T7HRMX029941; Thu, 29 Sep 2016 07:17:27 GMT Message-Id: <201609290717.u8T7HRMX029941@ip-10-146-233-104.ec2.internal> Date: Thu, 29 Sep 2016 07:17:27 +0000 From: "Michael Ho (Code Review)" To: Tim Armstrong , impala-cr@cloudera.com, reviews@impala.incubator.apache.org CC: Juan Yu , Chen Huang , Mostafa Mokhtar , Alex Behm , Dan Hecht Reply-To: kwho@cloudera.com X-Gerrit-MessageType: newpatchset Subject: =?UTF-8?Q?=5BImpala-ASF-CR=5D_IMPALA-4026=3A_Implement_double-buffering_for_BlockingQueue=0A?= X-Gerrit-Change-Id: Ib9f4cf351455efefb0f3bb791cf9bc82d1421d54 X-Gerrit-ChangeURL: X-Gerrit-Commit: b84f58045ddc48e309a36b2fd06070d283ba2759 In-Reply-To: References: MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Content-Disposition: inline User-Agent: Gerrit/2.12.2 archived-at: Thu, 29 Sep 2016 07:17:35 -0000 Hello Tim Armstrong, I'd like you to reexamine a change. Please visit http://gerrit.cloudera.org:8080/4350 to look at the new patch set (#11). Change subject: IMPALA-4026: Implement double-buffering for BlockingQueue ...................................................................... IMPALA-4026: Implement double-buffering for BlockingQueue With recent changes to improve the parquet scanner's efficency, row batches are produced more quickly, leading to higher contention in the blocking queue shared between scanner threads and the scan node. The contention happens between different producers (i.e. the scanner threads) and also to a lesser extent, between the scanner threads and the scan node. This change addresses the contention between the scanner threads and the scan node by splitting the queue into a 'get_list_' and a 'put_list_'. The consumers will consume from 'get_list_' until it's exhausted while the producers will enqueue into 'put_list_' until it's full. When 'get_list_' is exhausted, the consumer will atomically swap the 'get_list_' with 'put_list_'. This reduces the contention: 'get_list_' and 'put_list_' are protected by two different locks so callers of BlockingGet() only contends for the 'put_lock_' when 'put_list_' is empty. With this change, primitive_filter_bigint_non_selective improves by 33.9%, going from 1.60s to 1.06s Change-Id: Ib9f4cf351455efefb0f3bb791cf9bc82d1421d54 --- M be/src/common/compiler-util.h M be/src/exec/hdfs-scan-node.cc M be/src/exec/hdfs-scan-node.h M be/src/util/blocking-queue.h A be/src/util/condition-variable.h M be/src/util/thread-pool.h 6 files changed, 222 insertions(+), 66 deletions(-) git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/50/4350/11 -- To view, visit http://gerrit.cloudera.org:8080/4350 To unsubscribe, visit http://gerrit.cloudera.org:8080/settings Gerrit-MessageType: newpatchset Gerrit-Change-Id: Ib9f4cf351455efefb0f3bb791cf9bc82d1421d54 Gerrit-PatchSet: 11 Gerrit-Project: Impala-ASF Gerrit-Branch: master Gerrit-Owner: Michael Ho Gerrit-Reviewer: Alex Behm Gerrit-Reviewer: Chen Huang Gerrit-Reviewer: Dan Hecht Gerrit-Reviewer: Juan Yu Gerrit-Reviewer: Michael Ho Gerrit-Reviewer: Mostafa Mokhtar Gerrit-Reviewer: Tim Armstrong