Return-Path: Delivered-To: apmail-hadoop-core-dev-archive@www.apache.org Received: (qmail 56049 invoked from network); 19 Mar 2008 17:22:18 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 19 Mar 2008 17:22:18 -0000 Received: (qmail 65346 invoked by uid 500); 19 Mar 2008 17:22:15 -0000 Delivered-To: apmail-hadoop-core-dev-archive@hadoop.apache.org Received: (qmail 65318 invoked by uid 500); 19 Mar 2008 17:22:14 -0000 Mailing-List: contact core-dev-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: core-dev@hadoop.apache.org Delivered-To: mailing list core-dev@hadoop.apache.org Received: (qmail 65309 invoked by uid 99); 19 Mar 2008 17:22:14 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 19 Mar 2008 10:22:14 -0700 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.140] (HELO brutus.apache.org) (140.211.11.140) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 19 Mar 2008 17:21:44 +0000 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id 2BC3F234C0AF for ; Wed, 19 Mar 2008 10:20:27 -0700 (PDT) Message-ID: <1830939507.1205947227178.JavaMail.jira@brutus> Date: Wed, 19 Mar 2008 10:20:27 -0700 (PDT) From: "Hairong Kuang (JIRA)" To: core-dev@hadoop.apache.org Subject: [jira] Commented: (HADOOP-2188) RPC should send a ping rather than use client timeouts MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HADOOP-2188?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12580455#action_12580455 ] Hairong Kuang commented on HADOOP-2188: --------------------------------------- Raghu, thanks for the review comments. > Why does the patch remove purging the connections on server? The purpose of this jira is not to throw away any calls. Otherwise the applications will wait for the result for ever. I do not think we should concern about the number of pending responses so much because hadoop-2910 throttle the client write rate and in general a client's read rate should not be slower than the write rate. If this is really a concern, what we can do is to limit the total number of calls in the server including unserved ones, being served ones, and pending responding ones. HADOOP-2910 only limits the unserved calls. > RPC should send a ping rather than use client timeouts > ------------------------------------------------------ > > Key: HADOOP-2188 > URL: https://issues.apache.org/jira/browse/HADOOP-2188 > Project: Hadoop Core > Issue Type: Improvement > Components: dfs, ipc > Reporter: Owen O'Malley > Assignee: Hairong Kuang > Attachments: ipc-timeout.patch, ipc-timeout1.patch, ipc-timeout2.patch, ipc-timeout3.patch, rpc-to.patch > > > Current RPC (really IPC) relies on client side timeouts to find "dead" sockets. I propose that we have a thread that once a minute (if the connection has been idle) writes a "ping" message to the socket. The client can detect a dead socket by the resulting error on the write, so no client side timeout is required. Also note that the ipc server does not need to respond to the ping, just discard it. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.