Return-Path: Delivered-To: apmail-hadoop-hdfs-issues-archive@minotaur.apache.org Received: (qmail 78173 invoked from network); 9 Feb 2011 07:00:25 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 9 Feb 2011 07:00:25 -0000 Received: (qmail 72741 invoked by uid 500); 9 Feb 2011 07:00:25 -0000 Delivered-To: apmail-hadoop-hdfs-issues-archive@hadoop.apache.org Received: (qmail 72627 invoked by uid 500); 9 Feb 2011 07:00:21 -0000 Mailing-List: contact hdfs-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hdfs-issues@hadoop.apache.org Delivered-To: mailing list hdfs-issues@hadoop.apache.org Received: (qmail 72609 invoked by uid 99); 9 Feb 2011 07:00:20 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 09 Feb 2011 07:00:20 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=5.0 tests=ALL_TRUSTED,T_RP_MATCHES_RCVD X-Spam-Check-By: apache.org Received: from [140.211.11.116] (HELO hel.zones.apache.org) (140.211.11.116) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 09 Feb 2011 07:00:18 +0000 Received: from hel.zones.apache.org (hel.zones.apache.org [140.211.11.116]) by hel.zones.apache.org (Postfix) with ESMTP id 6E47C19ABB0 for ; Wed, 9 Feb 2011 06:59:57 +0000 (UTC) Date: Wed, 9 Feb 2011 06:59:57 +0000 (UTC) From: "Tsz Wo (Nicholas), SZE (JIRA)" To: hdfs-issues@hadoop.apache.org Message-ID: <152029748.4450.1297234797448.JavaMail.tomcat@hel.zones.apache.org> In-Reply-To: <437159414.979.1296517289091.JavaMail.tomcat@hel.zones.apache.org> Subject: [jira] Commented: (HDFS-1606) Provide a stronger data guarantee in the write pipeline MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HDFS-1606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12992347#comment-12992347 ] Tsz Wo (Nicholas), SZE commented on HDFS-1606: ---------------------------------------------- h5. When to add a datanode? Since adding a datanode to an existing pipeline is an expensive operation (see [the previoius comment|https://issues.apache.org/jira/browse/HDFS-1606?focusedCommentId=12991839&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-12991839]), it should not be performed for every pipeline failure. Suppose the number of replications of the file is greater than or equal to 3. When a pipeline fails, the operation will be invoked if * the number of datanodes in the pipeline drops from 2 to 1; or * the block is reopened for append; or * it is specified by the user. Note that when the number of replications is specified to less than 3, the operation should not be invoked by default because performance is preferred over data guarantee. > Provide a stronger data guarantee in the write pipeline > ------------------------------------------------------- > > Key: HDFS-1606 > URL: https://issues.apache.org/jira/browse/HDFS-1606 > Project: Hadoop HDFS > Issue Type: New Feature > Components: data-node, hdfs client > Reporter: Tsz Wo (Nicholas), SZE > Assignee: Tsz Wo (Nicholas), SZE > > In the current design, if there is a datanode/network failure in the write pipeline, DFSClient will try to remove the failed datanode from the pipeline and then continue writing with the remaining datanodes. As a result, the number of datanodes in the pipeline is decreased. Unfortunately, it is possible that DFSClient may incorrectly remove a healthy datanode but leave the failed datanode in the pipeline because failure detection may be inaccurate under erroneous conditions. > We propose to have a new mechanism for adding new datanodes to the pipeline in order to provide a stronger data guarantee. -- This message is automatically generated by JIRA. - For more information on JIRA, see: http://www.atlassian.com/software/jira