Mailing-List: contact hdfs-issues-help@hadoop.apache.org; run by ezmlm
Precedence: bulk
Reply-To: hdfs-issues@hadoop.apache.org
Date: Thu, 8 Jan 2015 07:22:34 +0000 (UTC)
From: "JichengSong (JIRA)" <jira@apache.org>
To: hdfs-issues@hadoop.apache.org
Message-ID: <JIRA.12765553.1420697461000.35857.1420701754403@Atlassian.JIRA>
In-Reply-To: <JIRA.12765553.1420697461000@Atlassian.JIRA>
References: <JIRA.12765553.1420697461000@Atlassian.JIRA>
 <JIRA.12765553.1420697461253@arcas>
Subject: [jira] [Updated] (HDFS-7592) A bug in BlocksMap that  cause
 NameNode  memory leak.
MIME-Version: 1.0
Content-Type: text/plain; charset=utf-8
Content-Transfer-Encoding: quoted-printable


     [ https://issues.apache.org/jira/browse/HDFS-7592?page=3Dcom.atlassian=
.jira.plugin.system.issuetabpanels:all-tabpanel ]

JichengSong updated HDFS-7592:
------------------------------
    Labels: BlocksMap leak memory  (was: BlocksMap MemoryLeak)

> A bug in BlocksMap that  cause NameNode  memory leak.
> -----------------------------------------------------
>
>                 Key: HDFS-7592
>                 URL: https://issues.apache.org/jira/browse/HDFS-7592
>             Project: Hadoop HDFS
>          Issue Type: Bug
>          Components: namenode
>    Affects Versions: 0.21.0
>         Environment: HDFS-0.21.0
>            Reporter: JichengSong
>            Assignee: JichengSong
>              Labels: BlocksMap, leak, memory
>
> In our HDFS production environment, NameNode FGC frequently after running=
 for 2 months, we have to restart NameNode manually.
> We dumped NameNode's Heap for objects statistics.
> Before restarting NameNode:
>     num #instances #bytes class name
>     ----------------------------------------------
> =C2=A0=C2=A0=C2=A0     1: 59262275 3613989480 [Ljava.lang.Object;
> =C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0     ...
> =C2=A0=C2=A0      10: 8549361 615553992 org.apache.hadoop.hdfs.server.nam=
enode.BlockInfoUnderConstruction
> =C2=A0=C2=A0      11: 5941511 427788792 org.apache.hadoop.hdfs.server.nam=
enode.INodeFileUnderConstruction
> After restarting NameNode:
>     num #instances #bytes class name
>     ----------------------------------------------
> =C2=A0=C2=A0=C2=A0      1: 44188391 2934099616 [Ljava.lang.Object;
> =C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0     ...
> =C2=A0=C2=A0      23: 721763 51966936 org.apache.hadoop.hdfs.server.namen=
ode.BlockInfoUnderConstruction
> =C2=A0=C2=A0      24: 620028 44642016 org.apache.hadoop.hdfs.server.namen=
ode.INodeFileUnderConstruction
> We find the number of BlockInfoUnderConstruction is abnormally large befo=
re restarting NameNode.
> As we know, BlockInfoUnderConstruction keeps block state when the file is=
 being written. But the write pressure of
> our cluster is far less than million/sec. We think there is a memory leak=
 in NameNode.
> We fixed the bug as followsing patch.
> --- src/java/org/apache/hadoop/hdfs/server/namenode/BlocksMap.java      (=
=E7=89=88=E6=9C=AC 1640066)
> +++ src/java/org/apache/hadoop/hdfs/server/namenode/BlocksMap.java      (=
=E5=B7=A5=E4=BD=9C=E5=89=AF=E6=9C=AC)
> @@ -205,6 +205,8 @@
>        DatanodeDescriptor dn =3D currentBlock.getDatanode(idx);
>        dn.replaceBlock(currentBlock, newBlock);
>      }
> +    // change to fix bug about memory leak of NameNode
> +    map.remove(newBlock);
>      // replace block in the map itself
>      map.put(newBlock, newBlock);


--
This message was sent by Atlassian JIRA
(v6.3.4#6332)