Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 87546200B44 for ; Thu, 30 Jun 2016 01:35:45 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 860A8160A6E; Wed, 29 Jun 2016 23:35:45 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id CB0C6160A57 for ; Thu, 30 Jun 2016 01:35:44 +0200 (CEST) Received: (qmail 20521 invoked by uid 500); 29 Jun 2016 23:35:44 -0000 Mailing-List: contact dev-help@impala.incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@impala.incubator.apache.org Delivered-To: mailing list dev@impala.incubator.apache.org Received: (qmail 20510 invoked by uid 99); 29 Jun 2016 23:35:43 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd1-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 29 Jun 2016 23:35:43 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd1-us-west.apache.org (ASF Mail Server at spamd1-us-west.apache.org) with ESMTP id 45530CDAE0 for ; Wed, 29 Jun 2016 23:35:43 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 0.362 X-Spam-Level: X-Spam-Status: No, score=0.362 tagged_above=-999 required=6.31 tests=[RDNS_DYNAMIC=0.363, SPF_PASS=-0.001] autolearn=disabled Received: from mx2-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd1-us-west.apache.org [10.40.0.7]) (amavisd-new, port 10024) with ESMTP id 17iMzxxIBfPj for ; Wed, 29 Jun 2016 23:35:41 +0000 (UTC) Received: from ip-10-146-233-104.ec2.internal (ec2-75-101-130-251.compute-1.amazonaws.com [75.101.130.251]) by mx2-lw-eu.apache.org (ASF Mail Server at mx2-lw-eu.apache.org) with ESMTPS id C17275F242 for ; Wed, 29 Jun 2016 23:35:40 +0000 (UTC) Received: from localhost (localhost [127.0.0.1]) by ip-10-146-233-104.ec2.internal (8.14.4/8.14.4) with ESMTP id u5TNZdBo000584; Wed, 29 Jun 2016 23:35:39 GMT Message-Id: <201606292335.u5TNZdBo000584@ip-10-146-233-104.ec2.internal> Date: Wed, 29 Jun 2016 23:35:39 +0000 From: "Juan Yu (Code Review)" To: impala-cr@cloudera.com, dev@impala.incubator.apache.org CC: Henry Robinson , Alan Choi , Dan Hecht , Huaisi Xu , Sailesh Mukil Reply-To: jyu@cloudera.com X-Gerrit-MessageType: comment Subject: =?UTF-8?Q?=5BImpala-CR=5D=28cdh5-trunk=29_IMPALA-3575=3A_Add_retry_to_backend_connection_request_and_rpc_timeout=0A?= X-Gerrit-Change-Id: Id6723cfe58df6217f4a9cdd12facd320cbc24964 X-Gerrit-ChangeURL: X-Gerrit-Commit: 18fd40b98e90738857f000f0b9e727d759868f98 In-Reply-To: References: MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Content-Disposition: inline User-Agent: Gerrit/2.12.2 archived-at: Wed, 29 Jun 2016 23:35:45 -0000 Juan Yu has posted comments on this change. Change subject: IMPALA-3575: Add retry to backend connection request and rpc timeout ...................................................................... Patch Set 15: (11 comments) http://gerrit.cloudera.org:8080/#/c/3343/15/be/src/runtime/client-cache.h File be/src/runtime/client-cache.h: Line 97: void DestroyClient(ClientKey* client_key); > needs a comment Done PS15, Line 197: hasRpcError_ > C++-style naming Done PS15, Line 239: IsTimeoutTException > should now be called IsRecvTimeoutTException() Done PS15, Line 240: RPC_TIMEOUT > should be RPC_RECV_TIMEOUT Done PS15, Line 246: if (strstr(e.what(),"unknown result") != NULL) > this seems very brittle, and at least should be wrapped in an IsXXXExceptio Done PS15, Line 268: release > back pressure is added by a blocking RPC, not released. Done PS15, Line 273: timeout > what's the unit? Done PS15, Line 272: Status DoRpcTimedWait(const F& f, const Request& request, Response* response, : const G& wait_func, uint64_t timeout, RuntimeState* state, bool* can_retry = NULL) > This seems like it breaks abstractions: not all RPCs happen in the context I agree with you. I did it this way in patch#6. but had to duplicate the wait response loop for several rpc calls so I changed it later. PS15, Line 282: bool no_timeout = timeout == 0; > this can be simplified by having a deadline variable: Done Line 290: if (!IsTimeoutTException(e)) { > Just curious, which function in recv_TransmitData() throws timeout exceptio The exception is thrown by thrift, see TSocket.cpp read() recv_TransmitData() will call thrift api to read data from server side. PS15, Line 309: bool hasRpcError_; > the role of this variable is not clear. Please add a comment. Done -- To view, visit http://gerrit.cloudera.org:8080/3343 To unsubscribe, visit http://gerrit.cloudera.org:8080/settings Gerrit-MessageType: comment Gerrit-Change-Id: Id6723cfe58df6217f4a9cdd12facd320cbc24964 Gerrit-PatchSet: 15 Gerrit-Project: Impala Gerrit-Branch: cdh5-trunk Gerrit-Owner: Juan Yu Gerrit-Reviewer: Alan Choi Gerrit-Reviewer: Dan Hecht Gerrit-Reviewer: Henry Robinson Gerrit-Reviewer: Huaisi Xu Gerrit-Reviewer: Juan Yu Gerrit-Reviewer: Sailesh Mukil Gerrit-HasComments: Yes