Return-Path: Delivered-To: apmail-hadoop-hdfs-issues-archive@minotaur.apache.org Received: (qmail 7899 invoked from network); 3 Aug 2010 17:11:42 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 3 Aug 2010 17:11:42 -0000 Received: (qmail 35408 invoked by uid 500); 3 Aug 2010 17:11:42 -0000 Delivered-To: apmail-hadoop-hdfs-issues-archive@hadoop.apache.org Received: (qmail 34887 invoked by uid 500); 3 Aug 2010 17:11:41 -0000 Mailing-List: contact hdfs-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hdfs-issues@hadoop.apache.org Delivered-To: mailing list hdfs-issues@hadoop.apache.org Received: (qmail 34695 invoked by uid 99); 3 Aug 2010 17:11:41 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 03 Aug 2010 17:11:41 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.22] (HELO thor.apache.org) (140.211.11.22) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 03 Aug 2010 17:11:39 +0000 Received: from thor (localhost [127.0.0.1]) by thor.apache.org (8.13.8+Sun/8.13.8) with ESMTP id o73HBHki016864 for ; Tue, 3 Aug 2010 17:11:17 GMT Message-ID: <31498708.140061280855477198.JavaMail.jira@thor> Date: Tue, 3 Aug 2010 13:11:17 -0400 (EDT) From: "Tsz Wo (Nicholas), SZE (JIRA)" To: hdfs-issues@hadoop.apache.org Subject: [jira] Updated: (HDFS-1325) DFSClient(DFSInputStream) release the persistent connection with datanode when no data have been read for a long time In-Reply-To: <13496559.110881280715077054.JavaMail.jira@thor> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HDFS-1325?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tsz Wo (Nicholas), SZE updated HDFS-1325: ----------------------------------------- Hadoop Flags: [Incompatible change] After the timeout, could the client read again? > DFSClient(DFSInputStream) release the persistent connection with datanode when no data have been read for a long time > --------------------------------------------------------------------------------------------------------------------- > > Key: HDFS-1325 > URL: https://issues.apache.org/jira/browse/HDFS-1325 > Project: Hadoop HDFS > Issue Type: Improvement > Components: hdfs client > Reporter: jinglong.liujl > Fix For: 0.20.3 > > Attachments: dfsclient.patch, toomanyconnction.patch > > > When you use Hbase over hadoop. We found during scanning over a large table ( which has many regions and each region has many store files), there're too many connections has been kept between regionserver (act as DFSClient) and datanode. Even if the store file has been complete to scanning, the connections can not be closed. > In our cluster, too many extra connections cause too many system resource has been wasted, which cause system cpu on region server reach to a high level, then bring this region server down. > After investigating, we found the number of active connection is very small, and the most connection is idle. We add a timeout checker thread into DFSClient, to close this connection. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.