Return-Path: Delivered-To: apmail-hadoop-core-dev-archive@www.apache.org Received: (qmail 86409 invoked from network); 3 Jun 2008 21:39:09 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 3 Jun 2008 21:39:09 -0000 Received: (qmail 75276 invoked by uid 500); 3 Jun 2008 21:39:11 -0000 Delivered-To: apmail-hadoop-core-dev-archive@hadoop.apache.org Received: (qmail 75256 invoked by uid 500); 3 Jun 2008 21:39:11 -0000 Mailing-List: contact core-dev-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: core-dev@hadoop.apache.org Delivered-To: mailing list core-dev@hadoop.apache.org Received: (qmail 75241 invoked by uid 99); 3 Jun 2008 21:39:11 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 03 Jun 2008 14:39:11 -0700 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.140] (HELO brutus.apache.org) (140.211.11.140) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 03 Jun 2008 21:38:23 +0000 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id 5AD4A234C130 for ; Tue, 3 Jun 2008 14:38:45 -0700 (PDT) Message-ID: <610946310.1212529125370.JavaMail.jira@brutus> Date: Tue, 3 Jun 2008 14:38:45 -0700 (PDT) From: "Hairong Kuang (JIRA)" To: core-dev@hadoop.apache.org Subject: [jira] Commented: (HADOOP-2909) Improve IPC idle connection management In-Reply-To: <206790041.1204138791150.JavaMail.jira@brutus> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HADOOP-2909?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12602086#action_12602086 ] Hairong Kuang commented on HADOOP-2909: --------------------------------------- Ankur, thanks for reviewing this patch. Yes, this patch mainly handles those crashed clients that have no outstanding requests. Crashed clients with outstanding requests are detected by the responder when writing replies. Slow clients with outstanding requests are handled by Server.Responder.doPurge. > Improve IPC idle connection management > -------------------------------------- > > Key: HADOOP-2909 > URL: https://issues.apache.org/jira/browse/HADOOP-2909 > Project: Hadoop Core > Issue Type: Bug > Components: ipc > Affects Versions: 0.16.0 > Reporter: Hairong Kuang > Assignee: Hairong Kuang > Fix For: 0.18.0 > > Attachments: idleConn.patch > > > IPC server determines if a connection is idle or not by checking if the connection does not have any IO activity for a predefined max idle time. An idle connection will be closed even if the connection still has outstanding requests or replies. This causes RPC failures when a server becomes slow or if a request takes a long time to be served. In jira, I'd like to propose the following changes to IPC idle management: > 1. Add data structures to the IPC server that keep track of outstanding requests. > 2. IPC server does not close a connection that has outstanding requests/replies even when it has no IO activities for a while. > 3. The default client-side max idle time should be in several minutes not 1 second. > 4. The server-side max idle time should be greater than the client-side max idle time, for example, twice of the client-side max idle time. So server mainly deals with clients that are crashed without closing > its connections. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.