Return-Path: X-Original-To: apmail-hbase-user-archive@www.apache.org Delivered-To: apmail-hbase-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 6019910871 for ; Fri, 28 Feb 2014 06:39:36 +0000 (UTC) Received: (qmail 37539 invoked by uid 500); 28 Feb 2014 06:39:32 -0000 Delivered-To: apmail-hbase-user-archive@hbase.apache.org Received: (qmail 36932 invoked by uid 500); 28 Feb 2014 06:39:26 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 36886 invoked by uid 99); 28 Feb 2014 06:39:24 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 28 Feb 2014 06:39:24 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of magnito@gmail.com designates 209.85.220.169 as permitted sender) Received: from [209.85.220.169] (HELO mail-vc0-f169.google.com) (209.85.220.169) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 28 Feb 2014 06:39:20 +0000 Received: by mail-vc0-f169.google.com with SMTP id hq11so308795vcb.0 for ; Thu, 27 Feb 2014 22:38:59 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=TW7p5CiLjvDTq6GcEg3qtRhEiCEgiR35nmKY/Lhl/Ek=; b=UKLRFE4Ka2ti2lvqNceYTuVlxuGbwMNZVoSt4BfiI0ji0OsgwkdaCrjBujbem2/WJ6 uDRM2p+WCvyaRQzB8ISU787cZCUFWAWZUBjRsRrfBrk8L+Y96CecMY0qVkHuPxRnNbDk VHuwHu16MeNqdw0Z3cuLZhvQu/3XYTNFZmdG9wwhz27/tf9HU/iURVthB7Y2xDiLBGqX WvOFzIeIe6fbkZGpK+N7+6WKAO6b+UCW2+q1NvGajfMResG0fCNZNQLHenAxsgizBZod oj4EZ5dAIpUPjORPc/yiXJKibXQILhbw1EBXqquBnS67BtdaMt4fNO64NZIQmRGD6DmR BoSw== MIME-Version: 1.0 X-Received: by 10.58.168.142 with SMTP id zw14mr991462veb.33.1393569539548; Thu, 27 Feb 2014 22:38:59 -0800 (PST) Received: by 10.58.187.162 with HTTP; Thu, 27 Feb 2014 22:38:59 -0800 (PST) In-Reply-To: References: Date: Thu, 27 Feb 2014 22:38:59 -0800 Message-ID: Subject: Re: Question about dead datanode From: Jack Levin To: "user@hbase.apache.org" Content-Type: text/plain; charset=ISO-8859-1 X-Virus-Checked: Checked by ClamAV on apache.org Is this related to JIRA HDFS-378? On Wed, Feb 26, 2014 at 11:54 AM, Jack Levin wrote: > Submitted JIRA patch: https://issues.apache.org/jira/browse/HDFS-6022 > (with test) > > On Mon, Feb 24, 2014 at 12:16 PM, Jack Levin wrote: >> I will do that. >> >> -Jack >> >> On Mon, Feb 24, 2014 at 6:23 AM, Steve Loughran wrote: >>> that's a very old version of cloudera's branch you are working with there; >>> patching that is not a good way to go, as you are on the slippery slope of >>> having your own private branch and all the costs of it. >>> >>> It looks like dead node logic has -> DFSInputStream, where it is still >>> instance-specific: >>> >>> /* XXX Use of CocurrentHashMap is temp fix. Need to fix >>> * parallel accesses to DFSInputStream (through ptreads) properly */ >>> private final ConcurrentHashMap deadNodes = >>> new ConcurrentHashMap(); >>> >>> This implies the problem still exists -and the opportunity to fix it -but >>> you will need to modify your patch to apply to hadoop trunk, ideally think >>> of a test, then submit a patch to the HDFS project on JIRA. >>> >>> >>> On 19 February 2014 04:48, Stack wrote: >>> >>>> On Sat, Feb 15, 2014 at 8:01 PM, Jack Levin wrote: >>>> >>>> > Looks like I patched it in DFSClient.java, here is the patch: >>>> > https://gist.github.com/anonymous/9028934 >>>> > >>>> > .... >>>> >>>> >>>> > I moved 'deadNodes' list outside as global field that is accessible by >>>> > all running threads, so at any point datanode does go down, each >>>> > thread is basically informed that the datanode _is_ down. >>>> > >>>> >>>> We need to add something like this to current versions of DFSClient, a >>>> global status, so each stream does not have to discover bad DNs for itself. >>>> St.Ack >>>> >>> >>> -- >>> CONFIDENTIALITY NOTICE >>> NOTICE: This message is intended for the use of the individual or entity to >>> which it is addressed and may contain information that is confidential, >>> privileged and exempt from disclosure under applicable law. If the reader >>> of this message is not the intended recipient, you are hereby notified that >>> any printing, copying, dissemination, distribution, disclosure or >>> forwarding of this communication is strictly prohibited. If you have >>> received this communication in error, please contact the sender immediately >>> and delete it from your system. Thank You.