Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id A1E06200BB3 for ; Wed, 2 Nov 2016 18:33:00 +0100 (CET) Received: by cust-asf.ponee.io (Postfix) id A08BA160AF0; Wed, 2 Nov 2016 17:33:00 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id E4EE7160B0A for ; Wed, 2 Nov 2016 18:32:59 +0100 (CET) Received: (qmail 18346 invoked by uid 500); 2 Nov 2016 17:32:58 -0000 Mailing-List: contact hdfs-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list hdfs-issues@hadoop.apache.org Received: (qmail 17987 invoked by uid 99); 2 Nov 2016 17:32:58 -0000 Received: from arcas.apache.org (HELO arcas) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 02 Nov 2016 17:32:58 +0000 Received: from arcas.apache.org (localhost [127.0.0.1]) by arcas (Postfix) with ESMTP id A15912C1F56 for ; Wed, 2 Nov 2016 17:32:58 +0000 (UTC) Date: Wed, 2 Nov 2016 17:32:58 +0000 (UTC) From: "Mingliang Liu (JIRA)" To: hdfs-issues@hadoop.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (HDFS-11090) Leave safemode immediately if all blocks have reported in MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 archived-at: Wed, 02 Nov 2016 17:33:00 -0000 [ https://issues.apache.org/jira/browse/HDFS-11090?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15629742#comment-15629742 ] Mingliang Liu commented on HDFS-11090: -------------------------------------- {quote} Is this mean we should leave safemode when all the blocks have been reported? I think sometimes users intendly to set the threshold > 1 to not leave safemode. {quote} I think what Andrew suggests is not to leave safe mode if the threshold is > 1. I'm also with this. Failing tests are related. Please hold on commit. I need more time to review the idea and patch; we don't want to leave safemode too early. > Leave safemode immediately if all blocks have reported in > --------------------------------------------------------- > > Key: HDFS-11090 > URL: https://issues.apache.org/jira/browse/HDFS-11090 > Project: Hadoop HDFS > Issue Type: Improvement > Components: namenode > Affects Versions: 2.7.3 > Reporter: Andrew Wang > Assignee: Yiqun Lin > Attachments: HDFS-11090.001.patch > > > Startup safemode is triggered by two thresholds: % blocks reported in, and min # datanodes. It's extended by an interval (default 30s) until these two thresholds are met. > Safemode extension is helpful when the cluster has data, and the default % blocks threshold (0.99) is used. It gives DNs a little extra time to report in and thus avoid unnecessary replication work. > However, we can leave startup safemode early if 100% of blocks have reported in. > Note that operators sometimes change the % blocks threshold to > 1 to never automatically leave safemode. We should maintain this behavior. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: hdfs-issues-unsubscribe@hadoop.apache.org For additional commands, e-mail: hdfs-issues-help@hadoop.apache.org