hadoop-hdfs-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Suresh Srinivas (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HDFS-5140) Too many safemode monitor threads being created in the standby namenode causing it to fail with out of memory error
Date Fri, 30 Aug 2013 06:07:53 GMT

    [ https://issues.apache.org/jira/browse/HDFS-5140?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13754414#comment-13754414
] 

Suresh Srinivas commented on HDFS-5140:
---------------------------------------

+1 for the patch.

The current code in SafeModeInfo#canLeave() checks needEnter() again. This is bothersome,
since in case of secondary we could flip flop about leaving and entering safemode. The whole
safemode seems to have become complicated in case of secondary. We should perhaps create a
jira about at least not checking needEnter() again.
                
> Too many safemode monitor threads being created in the standby namenode causing it to
fail with out of memory error
> -------------------------------------------------------------------------------------------------------------------
>
>                 Key: HDFS-5140
>                 URL: https://issues.apache.org/jira/browse/HDFS-5140
>             Project: Hadoop HDFS
>          Issue Type: Bug
>          Components: ha
>    Affects Versions: 2.1.0-beta
>            Reporter: Arpit Gupta
>            Assignee: Jing Zhao
>            Priority: Blocker
>         Attachments: HDFS-5140.001.patch, HDFS-5140.002.patch
>
>
> While running namenode load generator with 100 threads for 10 mins namenode was being
failed over ever 2 mins.
> The standby namenode shut itself down as it ran out of memory and was not able to create
another thread.
> When we searched for 'Safe mode extension entered' in the standby log it was present
55000+ times

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message