hadoop-common-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ravi Prakash (JIRA)" <j...@apache.org>
Subject [jira] Commented: (HADOOP-5094) Show dead nodes information in dfsadmin -report
Date Tue, 15 Feb 2011 20:33:57 GMT

    [ https://issues.apache.org/jira/browse/HADOOP-5094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12994988#comment-12994988
] 

Ravi Prakash commented on HADOOP-5094:
--------------------------------------

Hi Gary,

I've only just joined the hadoop team and am going to be writing automated tests for the test
cases you've listed. Please excuse my naivete if I'm way off. Can you please clarify what
you mean by "stop a node" . Is that using the hadoop-daemon.sh script? Won't the the namenode
mark the datanode as dead only after 10 mins? Should my test be that long-lived?

Cheers
Ravi.


> Show dead nodes information in dfsadmin -report
> -----------------------------------------------
>
>                 Key: HADOOP-5094
>                 URL: https://issues.apache.org/jira/browse/HADOOP-5094
>             Project: Hadoop Common
>          Issue Type: Improvement
>    Affects Versions: 0.18.2
>            Reporter: Jim Huang
>            Assignee: Jakob Homan
>            Priority: Minor
>             Fix For: 0.21.0
>
>         Attachments: DfsAdminDeadNode_testCases.html, DfsAdminDeadNode_testCases.html,
HADOOP-5094.patch, HADOOP-5094.patch, HADOOP-5094.patch
>
>
> As part of operations responsibility to bring back dead nodes, it will be good to have
a quick way to obtain a list of dead data nodes.  
> The current way is to scrape the namenode web UI page and parse that information, but
this creates load on the namenode.   
> In search of a less costly way, I noticed dfsadmin -report only reports data nodes with
State: "In Service" and "Decommission in progress" get listed.
> Asking for a cheap way to obtain a list of dead nodes.  
> In addition, can the following requests be reviewed for additional enhancement and changes
to dfsadmin -report.
> - Consistent formatting output in "Remaining raw bytes:" for the data nodes should have
a space between the exact value and the parenthesized value.
> Sample:
> Total raw bytes: 3842232975360 (3.49 TB)
> Remaining raw bytes: 146090593065(136.06 GB)
> Used raw bytes: 3240864964620 (2.95 TB)
> - Include the running version of Hadoop.  
> - What is the meaning of "Total effective bytes"?
> - Display the hostname instead of the IP address for the data node (toggle option?)

-- 
This message is automatically generated by JIRA.
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Mime
View raw message