Return-Path: X-Original-To: apmail-hadoop-hdfs-issues-archive@minotaur.apache.org Delivered-To: apmail-hadoop-hdfs-issues-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 41A77D3CD for ; Wed, 5 Sep 2012 06:27:09 +0000 (UTC) Received: (qmail 27967 invoked by uid 500); 5 Sep 2012 06:27:08 -0000 Delivered-To: apmail-hadoop-hdfs-issues-archive@hadoop.apache.org Received: (qmail 27928 invoked by uid 500); 5 Sep 2012 06:27:08 -0000 Mailing-List: contact hdfs-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hdfs-issues@hadoop.apache.org Delivered-To: mailing list hdfs-issues@hadoop.apache.org Received: (qmail 27904 invoked by uid 99); 5 Sep 2012 06:27:08 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 05 Sep 2012 06:27:08 +0000 Date: Wed, 5 Sep 2012 17:27:08 +1100 (NCT) From: "Vinay (JIRA)" To: hdfs-issues@hadoop.apache.org Message-ID: <1145366792.37646.1346826428421.JavaMail.jiratomcat@arcas> In-Reply-To: <4007362.39761288982082841.JavaMail.jira@thor> Subject: [jira] [Assigned] (HDFS-1490) TransferFSImage should timeout MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/HDFS-1490?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinay reassigned HDFS-1490: --------------------------- Assignee: Vinay (was: Dmytro Molkov) > TransferFSImage should timeout > ------------------------------ > > Key: HDFS-1490 > URL: https://issues.apache.org/jira/browse/HDFS-1490 > Project: Hadoop HDFS > Issue Type: Bug > Components: name-node > Reporter: Dmytro Molkov > Assignee: Vinay > Priority: Minor > Fix For: 3.0.0, 2.2.0-alpha > > Attachments: HDFS-1490.patch, HDFS-1490.patch, HDFS-1490.patch, HDFS-1490.patch > > > Sometimes when primary crashes during image transfer secondary namenode would hang trying to read the image from HTTP connection forever. > It would be great to set timeouts on the connection so if something like that happens there is no need to restart the secondary itself. > In our case restarting components is handled by the set of scripts and since the Secondary as the process is running it would just stay hung until we get an alarm saying the checkpointing doesn't happen. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira