hadoop-hdfs-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Wei-Chiu Chuang (JIRA)" <j...@apache.org>
Subject [jira] [Created] (HDFS-13393) Improve OOM logging
Date Tue, 03 Apr 2018 21:36:00 GMT
Wei-Chiu Chuang created HDFS-13393:
--------------------------------------

             Summary: Improve OOM logging
                 Key: HDFS-13393
                 URL: https://issues.apache.org/jira/browse/HDFS-13393
             Project: Hadoop HDFS
          Issue Type: Improvement
          Components: balancer &amp; mover, datanode
            Reporter: Wei-Chiu Chuang


It is not uncommon to find "java.lang.OutOfMemoryError: unable to create new native thread"
error in a HDFS cluster. Most often this happens when DataNode creating DataXceiver threads,
or when balancer creates threads for moving blocks around.

In most of cases, the "OOM" is a symptom of number of threads reaching system limit, rather
than actually running out of memory.

How about capturing the OOM, and if it is due to "unable to create new native thread", print
some more helpful message like "bump your ulimit" or "take a jstack of the process"?



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: hdfs-dev-unsubscribe@hadoop.apache.org
For additional commands, e-mail: hdfs-dev-help@hadoop.apache.org


Mime
View raw message