Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@hadoop.apache.org
Received-SPF: pass (nike.apache.org: domain of jens.scheidtmann@gmail.com
 designates 209.85.219.53 as permitted sender)
MIME-Version: 1.0
In-Reply-To: 
 <CAHtRq2eOyFR-UE6zt6w5+ffbi8nTkZCD7pCbqU5K7puNGiXuow@mail.gmail.com>
References: 
 <CAHtRq2fs8YdiUKV4vhqgjy4++DWvdwUiXiTYzmc62aJvVoCswA@mail.gmail.com>
	<1380138830.85455.YahooMailNeo@web141202.mail.bf1.yahoo.com>
	<CAHtRq2e1ykf1hCvjfby=Jr3td5cuCr5+Zq-R-T+=kuwVtxboTg@mail.gmail.com>
	<CAOcnVr3axV+O3+V=jDEHf7hLFAgU9D=Z_hSnPFKthP+LU8wH_w@mail.gmail.com>
	<CAHtRq2d574jPzDt1b=C=6+SJf9nfoj4mJT2EaDKxYBztUVyzwA@mail.gmail.com>
	<CAORpBsjNJp3kf5mijtX7mR4=ATVuN1ijX5RW49NZF81z_2305A@mail.gmail.com>
	<CAHtRq2cO5XBPu+PYPLrLb355dFX9YMmwpzEfZRWiQ9mauFNOaw@mail.gmail.com>
	<CAOcnVr1Hdq9RQhgrdP6yodGR1wbg2cC9JoeR6rpP4+F6COt0rA@mail.gmail.com>
	<CAHtRq2eOyFR-UE6zt6w5+ffbi8nTkZCD7pCbqU5K7puNGiXuow@mail.gmail.com>
Date: Sun, 29 Sep 2013 11:09:12 +0200
Message-ID: 
 <CADfVb54DsjuJFkRD4M_18K7+yh3-vvde-Se6HR=7bkJBKXSXPQ@mail.gmail.com>
Subject: Re: Is there any way to partially process HDFS edits?
From: Jens Scheidtmann <jens.scheidtmann@gmail.com>
To: "user@hadoop.apache.org" <user@hadoop.apache.org>
Content-Type: multipart/alternative; boundary=047d7b3a9a7cb366c604e78212d2

--047d7b3a9a7cb366c604e78212d2
Content-Type: text/plain; charset=ISO-8859-1

Tom,

I would file a jira, if I were you and my Hadoop Version was recent
enough.  Should be pretty easy to reproduce.

Jens

Am Donnerstag, 26. September 2013 schrieb Tom Brown :

> They were created and deleted in quick succession. I thought that meant
> the edits for both the create and delete would be logically next to each
> other in the file allowing it to release the memory almost as soon as it
> had been allocated.
>
> In any case, after finding a VM host that could give me more RAM, I was
> able to get the namenode started. The process used 25GB at it's peak.
>
> Thanks for your help!
>
>
> On Thu, Sep 26, 2013 at 11:07 AM, Harsh J <harsh@cloudera.com> wrote:
>
> Tom,
>
> That is valuable info. When we "replay" edits, we would be creating
> and then deleting those files - so memory would grow in between until
> the delete events begin appearing in the edit log segment.
>
> On Thu, Sep 26, 2013 at 10:07 PM, Tom Brown <tombrown52@gmail.com> wrote:
> > A simple estimate puts the total number of blocks somewhere around
> 500,000.
> > Due to an HBase bug (HBASE-9648), there were approximately 50,000,000
> files
> > that were created and quickly deleted (about 10/sec for 6 weeks) in the
> > cluster, and that activity is what is contained in the edits.
> >
> > Since those files don't exist (quickly created and deleted), shouldn't
> they
> > be inconsequential to the memory requirements of the namenode as it
> starts
> > up.
> >
> > --Tom
> >
> >
> > On Thu, Sep 26, 2013 at 10:25 AM, Nitin Pawar <nitinpawar432@gmail.com>
> > wrote:
> >>
> >> Can you share how many blocks does your cluster have? how many
> >> directories? how many files?
> >>
> >> There is a JIRA https://issues.apache.org/jira/browse/HADOOP-1687 which
> >> explains how much RAM will be used for your namenode.
> >> Its pretty old by hadoop version but its a good starting point.
> >>
> >> According to Cloudera's blog "A good rule of thumb is to assume 1GB of
> >> NameNode memory for every 1 million blocks stored in the distributed
> file
> >> system"
> >>
> >>
> http://blog.cloudera.com/blog/2013/08/how-to-select-the-right-hardware-for-your-new-hadoop-cluster/
> >>
> >>
> >>
> >> On Thu, Sep 26, 2013 at 9:26 PM, Tom Brown <tombrown52@gmail.com>
> wrote:
> >>>
> >>> It ran again for about 15 hours before dying again. I'm seeing what
> extra
> >>> RAM resources we can throw at this VM (maybe up to 32GB), but until
> then I'm
> >>> trying to figure out if I'm hitting some strange bug.
> >>>
> >>> When the edits were originally made (over the course of 6 weeks), the
> >>> namenode only had 512MB and was able to contain the filesystem
> completely in
> >>> memory. I don't understand why it's running out of memory. If 512MB was
> >>> enough while the edits were first made, shouldn't it be enough to
> process
> >>> them again?
> >>>
> >>> --Tom
> >>>
> >>>
> >>> On Thu, Sep 26, 2013 at 6:05 AM, Harsh J <harsh@cloudera.com> wrote:
> >>>>
> >>>> Hi Tom,
> >>>>
> >>>> The edits are processed sequentially, and aren't all held in memory.
> >>>> Right now there's no mid-way-checkpoint when it is loaded, such that
> >>>> it could resume only with remaining work if interrupted. Normally this
> >>>> is not a problem in deployments given that SNN or SBN runs for
> >>>> checkpointing the images and keeping the edits collection small
> >>>> periodically.
> >>>>
> >>>> If your NameNode is running out of memory _applying_ the edits, then
> >>>> the cause is not the edits but a growing namespace. You most-likely
> >>>> have more files now than before, and thats going to take up permanent
> >>>> memory from the NameNode heap size.
> >>>>
> >>>> On Thu, Sep 26, 2013 at 3:00 AM, Tom Brown <tombrown52@gmail.com>
> wrote:
> >>>> > Unfortunately, I cannot give it that much RAM. The machine has 4GB
> >>>> > total
> >>>> > (though could be expanded somewhat-- it's a VM).
> >>>> >
> >>>> > Though if each edit is processed sequentially (in a
>
>

--047d7b3a9a7cb366c604e78212d2
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

Tom,<div><br></div><div>I=A0<span></span>would file a jira, if I were you a=
nd my Hadoop Version was=A0recent enough.=A0=A0Should be pretty easy to rep=
roduce.<div><br></div><div>Jens<br><br>Am Donnerstag, 26. September 2013 sc=
hrieb Tom Brown :<br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div dir=3D"ltr">They were created and delet=
ed in quick succession. I thought that meant the edits for both the create =
and delete would be logically next to each other in the file allowing it to=
 release the memory almost as soon as it had been allocated.<div>

<br></div><div>In any case, after finding a VM host that could give me more=
 RAM, I was able to get the namenode started. The process used 25GB at it&#=
39;s peak.</div><div><br></div><div>Thanks for your help!</div></div><div>

<br><br><div>On Thu, Sep 26, 2013 at 11:07 AM, Harsh J <span dir=3D"ltr">&l=
t;<a>harsh@cloudera.com</a>&gt;</span> wrote:<br><blockquote style=3D"margi=
n:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
Tom,<br>
<br>
That is valuable info. When we &quot;replay&quot; edits, we would be creati=
ng<br>
and then deleting those files - so memory would grow in between until<br>
the delete events begin appearing in the edit log segment.<br>
<div><div><br>
On Thu, Sep 26, 2013 at 10:07 PM, Tom Brown &lt;<a>tombrown52@gmail.com</a>=
&gt; wrote:<br>
&gt; A simple estimate puts the total number of blocks somewhere around 500=
,000.<br>
&gt; Due to an HBase bug (HBASE-9648), there were approximately 50,000,000 =
files<br>
&gt; that were created and quickly deleted (about 10/sec for 6 weeks) in th=
e<br>
&gt; cluster, and that activity is what is contained in the edits.<br>
&gt;<br>
&gt; Since those files don&#39;t exist (quickly created and deleted), shoul=
dn&#39;t they<br>
&gt; be inconsequential to the memory requirements of the namenode as it st=
arts<br>
&gt; up.<br>
&gt;<br>
&gt; --Tom<br>
&gt;<br>
&gt;<br>
&gt; On Thu, Sep 26, 2013 at 10:25 AM, Nitin Pawar &lt;<a>nitinpawar432@gma=
il.com</a>&gt;<br>
&gt; wrote:<br>
&gt;&gt;<br>
&gt;&gt; Can you share how many blocks does your cluster have? how many<br>
&gt;&gt; directories? how many files?<br>
&gt;&gt;<br>
&gt;&gt; There is a JIRA <a href=3D"https://issues.apache.org/jira/browse/H=
ADOOP-1687" target=3D"_blank">https://issues.apache.org/jira/browse/HADOOP-=
1687</a> which<br>
&gt;&gt; explains how much RAM will be used for your namenode.<br>
&gt;&gt; Its pretty old by hadoop version but its a good starting point.<br=
>
&gt;&gt;<br>
&gt;&gt; According to Cloudera&#39;s blog &quot;A good rule of thumb is to =
assume 1GB of<br>
&gt;&gt; NameNode memory for every 1 million blocks stored in the distribut=
ed file<br>
&gt;&gt; system&quot;<br>
&gt;&gt;<br>
&gt;&gt; <a href=3D"http://blog.cloudera.com/blog/2013/08/how-to-select-the=
-right-hardware-for-your-new-hadoop-cluster/" target=3D"_blank">http://blog=
.cloudera.com/blog/2013/08/how-to-select-the-right-hardware-for-your-new-ha=
doop-cluster/</a><br>


&gt;&gt;<br>
&gt;&gt;<br>
&gt;&gt;<br>
&gt;&gt; On Thu, Sep 26, 2013 at 9:26 PM, Tom Brown &lt;<a>tombrown52@gmail=
.com</a>&gt; wrote:<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt; It ran again for about 15 hours before dying again. I&#39;m se=
eing what extra<br>
&gt;&gt;&gt; RAM resources we can throw at this VM (maybe up to 32GB), but =
until then I&#39;m<br>
&gt;&gt;&gt; trying to figure out if I&#39;m hitting some strange bug.<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt; When the edits were originally made (over the course of 6 week=
s), the<br>
&gt;&gt;&gt; namenode only had 512MB and was able to contain the filesystem=
 completely in<br>
&gt;&gt;&gt; memory. I don&#39;t understand why it&#39;s running out of mem=
ory. If 512MB was<br>
&gt;&gt;&gt; enough while the edits were first made, shouldn&#39;t it be en=
ough to process<br>
&gt;&gt;&gt; them again?<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt; --Tom<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt; On Thu, Sep 26, 2013 at 6:05 AM, Harsh J &lt;<a>harsh@cloudera=
.com</a>&gt; wrote:<br>
&gt;&gt;&gt;&gt;<br>
&gt;&gt;&gt;&gt; Hi Tom,<br>
&gt;&gt;&gt;&gt;<br>
&gt;&gt;&gt;&gt; The edits are processed sequentially, and aren&#39;t all h=
eld in memory.<br>
&gt;&gt;&gt;&gt; Right now there&#39;s no mid-way-checkpoint when it is loa=
ded, such that<br>
&gt;&gt;&gt;&gt; it could resume only with remaining work if interrupted. N=
ormally this<br>
&gt;&gt;&gt;&gt; is not a problem in deployments given that SNN or SBN runs=
 for<br>
&gt;&gt;&gt;&gt; checkpointing the images and keeping the edits collection =
small<br>
&gt;&gt;&gt;&gt; periodically.<br>
&gt;&gt;&gt;&gt;<br>
&gt;&gt;&gt;&gt; If your NameNode is running out of memory _applying_ the e=
dits, then<br>
&gt;&gt;&gt;&gt; the cause is not the edits but a growing namespace. You mo=
st-likely<br>
&gt;&gt;&gt;&gt; have more files now than before, and thats going to take u=
p permanent<br>
&gt;&gt;&gt;&gt; memory from the NameNode heap size.<br>
&gt;&gt;&gt;&gt;<br>
&gt;&gt;&gt;&gt; On Thu, Sep 26, 2013 at 3:00 AM, Tom Brown &lt;<a>tombrown=
52@gmail.com</a>&gt; wrote:<br>
&gt;&gt;&gt;&gt; &gt; Unfortunately, I cannot give it that much RAM. The ma=
chine has 4GB<br>
&gt;&gt;&gt;&gt; &gt; total<br>
&gt;&gt;&gt;&gt; &gt; (though could be expanded somewhat-- it&#39;s a VM).<=
br>
&gt;&gt;&gt;&gt; &gt;<br>
&gt;&gt;&gt;&gt; &gt; Though if each edit is processed sequentially (in a</=
div></div></blockquote></div></div></blockquote></div></div>

--047d7b3a9a7cb366c604e78212d2--