ambari-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hudson (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (AMBARI-13194) Alert definition when DataNode data dirs become unmounted
Date Wed, 23 Sep 2015 23:52:04 GMT

    [ https://issues.apache.org/jira/browse/AMBARI-13194?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14905541#comment-14905541
] 

Hudson commented on AMBARI-13194:
---------------------------------

SUCCESS: Integrated in Ambari-branch-2.1 #589 (See [https://builds.apache.org/job/Ambari-branch-2.1/589/])
AMBARI-13194. Alert definition when DataNode data dirs become unmounted (alejandro) (afernandez:
http://git-wip-us.apache.org/repos/asf?p=ambari.git&a=commit&h=705f0d22abd74cae2151b5629b54803aa600e491)
* ambari-server/src/test/python/stacks/2.0.6/HDFS/test_alert_datanode_unmounted_data_dir.py
* ambari-server/src/main/resources/common-services/HDFS/2.1.0.2.0/alerts.json
* ambari-agent/src/test/python/resource_management/TestDatanodeHelper.py
* ambari-server/src/main/resources/common-services/HDFS/2.1.0.2.0/package/alerts/alert_datanode_unmounted_data_dir.py
* ambari-common/src/main/python/resource_management/libraries/functions/dfs_datanode_helper.py
* ambari-common/src/main/python/resource_management/core/providers/system.py


> Alert definition when DataNode data dirs become unmounted
> ---------------------------------------------------------
>
>                 Key: AMBARI-13194
>                 URL: https://issues.apache.org/jira/browse/AMBARI-13194
>             Project: Ambari
>          Issue Type: Bug
>          Components: ambari-agent, ambari-server
>    Affects Versions: 2.1.2
>            Reporter: Alejandro Fernandez
>            Assignee: Alejandro Fernandez
>             Fix For: 2.2.0, 2.1.2
>
>         Attachments: AMBARI-13194.patch
>
>
> Ambari uses the dfs.datanode.data.dir.mount.file property in HDFS, whose value is typically
/etc/hadoop/conf/dfs_data_dir_mount.hist
> to track the mount points for each of the data dirs.
> E.g.,
> {code}
> /hadoop01/data,/device1
> /hadoop02/data,/device2
> /hadoop03/data,/     # this one is on root, the others are all on mount points.
> {code}
> Whenever a drive becomes unmounted, Ambari detects that it was previously on a mount
and will not create that data dir; HDFS can still tolerate the failure if dfs.datanode.failed.volumes.tolerated
is greater than 0.
> Now, if the /etc/hadoop/conf/dfs_data_dir_mount.hist file is deleted, then Ambari won't
have this knowledge, and will create the datadir (even if it's on the root partition).
> To improve tracking, create an alert definition that checks the following
> * warning status if the /etc/hadoop/conf/dfs_data_dir_mount.hist file is deleted
> * critical status if at least one of the data dirs is mounted on the root partition,
and at least one data dir is on a mount



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message