ambari-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jayush Luniya (JIRA)" <j...@apache.org>
Subject [jira] [Created] (AMBARI-13065) RU: Core Slaves restart schedule is extremely slow on very large cluster
Date Thu, 10 Sep 2015 19:34:46 GMT
Jayush Luniya created AMBARI-13065:
--------------------------------------

             Summary: RU: Core Slaves restart schedule is extremely slow on very large cluster
                 Key: AMBARI-13065
                 URL: https://issues.apache.org/jira/browse/AMBARI-13065
             Project: Ambari
          Issue Type: Bug
          Components: ambari-server
    Affects Versions: 2.1.2
            Reporter: Jayush Luniya
            Assignee: Jayush Luniya
            Priority: Blocker
             Fix For: 2.1.2


Performed RU on 1200 node cluster and the progress of 'Core Slaves' restarts is extremely
slow - In 3 hours it restarted only 22 components (screenshot attached). At this rate it will
take weeks for RU to complete.

It we look into the agent log where RU core-slaves finished, we see that sequential commands
are sent 8 minutes apart - which is very slow. The commands themselves execute in under a
minute.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message