hadoop-common-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ravi Prakash (JIRA)" <j...@apache.org>
Subject [jira] Commented: (HADOOP-5094) Show dead nodes information in dfsadmin -report
Date Mon, 07 Mar 2011 16:53:59 GMT

    [ https://issues.apache.org/jira/browse/HADOOP-5094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13003427#comment-13003427
] 

Ravi Prakash commented on HADOOP-5094:
--------------------------------------

I'm not sure what the expected behavior is when a node is specified in dfs.exclude and the
cluster is started. Maybe this should never be done. But if it IS done. dfsadmin -report shows
this

Live datanodes:
....
....
....

Dead datanodes:
report: String index out of range: -1

Is this fine?

The output is as expected when the cluster is started without any dfs.exclude entries, and
then one added. (it shows it as a dead node with Decommission Status: Decommissioned) So that
is good.

> Show dead nodes information in dfsadmin -report
> -----------------------------------------------
>
>                 Key: HADOOP-5094
>                 URL: https://issues.apache.org/jira/browse/HADOOP-5094
>             Project: Hadoop Common
>          Issue Type: Improvement
>    Affects Versions: 0.18.2
>            Reporter: Jim Huang
>            Assignee: Jakob Homan
>            Priority: Minor
>             Fix For: 0.21.0
>
>         Attachments: DfsAdminDeadNode_testCases.html, DfsAdminDeadNode_testCases.html,
HADOOP-5094.patch, HADOOP-5094.patch, HADOOP-5094.patch
>
>
> As part of operations responsibility to bring back dead nodes, it will be good to have
a quick way to obtain a list of dead data nodes.  
> The current way is to scrape the namenode web UI page and parse that information, but
this creates load on the namenode.   
> In search of a less costly way, I noticed dfsadmin -report only reports data nodes with
State: "In Service" and "Decommission in progress" get listed.
> Asking for a cheap way to obtain a list of dead nodes.  
> In addition, can the following requests be reviewed for additional enhancement and changes
to dfsadmin -report.
> - Consistent formatting output in "Remaining raw bytes:" for the data nodes should have
a space between the exact value and the parenthesized value.
> Sample:
> Total raw bytes: 3842232975360 (3.49 TB)
> Remaining raw bytes: 146090593065(136.06 GB)
> Used raw bytes: 3240864964620 (2.95 TB)
> - Include the running version of Hadoop.  
> - What is the meaning of "Total effective bytes"?
> - Display the hostname instead of the IP address for the data node (toggle option?)

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message