ambari-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Alejandro Fernandez" <afernan...@hortonworks.com>
Subject Review Request 34920: Restarting HistoryServer fails during RU because NameNode is in safemode
Date Tue, 02 Jun 2015 03:20:53 GMT

-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/34920/
-----------------------------------------------------------

Review request for Ambari, Andrew Onischuk, Dmitro Lisnichenko, Jonathan Hurley, and Nate
Cole.


Bugs: AMBARI-11605
    https://issues.apache.org/jira/browse/AMBARI-11605


Repository: ambari


Description
-------

When restarting HistoryServer for the first time during the Core Masters rolling upgrade,
the restart fails because one of the NameNodes is still in safemode.

Turns out that now that the HDFS command run faster, by the time the HistorySever is restarted,
it's still possible for the standby NameNode to still be in safemode.
For this reason, we must wait for both NameNodes to come out of safemode before proceeding
to any other services or Service Checks.


Diffs
-----

  ambari-server/src/main/resources/common-services/HDFS/2.1.0.2.0/package/scripts/hdfs_namenode.py
5e824d0 
  ambari-server/src/main/resources/common-services/HDFS/2.1.0.2.0/package/scripts/namenode.py
864961e 
  ambari-server/src/main/resources/common-services/HDFS/2.1.0.2.0/package/scripts/utils.py
38270e8 
  ambari-server/src/main/resources/common-services/HIVE/0.12.0.2.0/package/scripts/params_linux.py
6e12dd0 
  ambari-server/src/test/python/stacks/2.0.6/HDFS/test_namenode.py b7126fd 

Diff: https://reviews.apache.org/r/34920/diff/


Testing
-------

Deployed a cluster and copied the patched files, then enabled NameNode HA, and performed a
successful RU.

----------------------------------------------------------------------
Total run:744
Total errors:0
Total failures:0
OK


Thanks,

Alejandro Fernandez


Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message