Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@hadoop.apache.org
Received-SPF: pass (nike.apache.org: domain of michael_segel@hotmail.com
 designates 65.55.111.105 as permitted sender)
Message-ID: <BLU0-SMTP58FA806967070D4CBA9FBA8F6B0@phx.gbl>
From: Michael Segel <michael_segel@hotmail.com>
Content-Type: multipart/alternative;
	boundary="Apple-Mail=_3A928F0F-C7A9-4660-BAE4-A3B2C624194F"
MIME-Version: 1.0 (Mac OS X Mail 6.2 \(1499\))
Subject: Re: One mapper/reducer runs on a single JVM
Date: Tue, 6 Nov 2012 10:27:45 -0600
References: 
 <CAK_MoSs96iMKYND4jnKS90KwaMjHiYKMbu5zV3CX-A5NjWT_mQ@mail.gmail.com>
 <BLU0-SMTP340F9366F083B892EFF449C8F6B0@phx.gbl>
 <CAK_MoSunTGKFz8vtjjsO4tY2ia=bSX1fZZZ2jTmYgOF5cvWODA@mail.gmail.com>
To: user@hadoop.apache.org
In-Reply-To: 
 <CAK_MoSunTGKFz8vtjjsO4tY2ia=bSX1fZZZ2jTmYgOF5cvWODA@mail.gmail.com>

--Apple-Mail=_3A928F0F-C7A9-4660-BAE4-A3B2C624194F
Content-Transfer-Encoding: quoted-printable
Content-Type: text/plain; charset="iso-8859-1"

If you exceed the amount of physical memory available, memory pages will =
be written to disk in a temp space. The act of 'swapping' the memory =
pages from memory to disk and back again is known as 'swap'.=20

HBase is highly sensitive to the latency of swapping memory in and out =
of physical memory to disk. You need to avoid swap when running HBase.  =
It will crash a region server and ultimately you can end up with a =
cascading failure and HBase will go down.=20

HTH

-Mike

On Nov 5, 2012, at 11:06 PM, Lin Ma <linlma@gmail.com> wrote:

> Thanks Michael,
>=20
> "If you are running just Hadoop, you could have a little swap. Running =
HBase, fuggit about it." -- could you give a bit more information about =
what do you mean swap and why forget for HBase?
>=20
> regards,
> Lin
>=20
>=20
> On Tue, Nov 6, 2012 at 12:46 PM, Michael Segel =
<michael_segel@hotmail.com> wrote:
> Mappers and Reducers are separate JVM processes.
> And yes you need to take in to account the amount of memory the =
machine(s) when you configure the number of slots.
>=20
> If you are running just Hadoop, you could have a little swap. Running =
HBase, fuggit about it.
>=20
>=20
> On Nov 5, 2012, at 7:12 PM, Lin Ma <linlma@gmail.com> wrote:
>=20
> > Hello Hadoop experts,
> >
> > I have a question in my mind for a long time. Supposing I am =
developing M-R program, and it is Java based (Java UDF, implements =
mapper or reducer interface). My question is, in this scenario, whether =
a mapper or a reducer is a separate JVM process? E.g. supposing on a =
machine, there are 4 mappers, they are 4 individual processes? I am also =
wondering whether the processes on a single machine will impact each =
other when each JVM wants to get more memory to run faster?
> >
> > thanks in advance,
> > Lin
> >
> >
>=20
>=20


--Apple-Mail=_3A928F0F-C7A9-4660-BAE4-A3B2C624194F
Content-Transfer-Encoding: quoted-printable
Content-Type: text/html; charset="iso-8859-1"

<html><head><meta http-equiv=3D"Content-Type" content=3D"text/html =
charset=3Diso-8859-1"></head><body style=3D"word-wrap: break-word; =
-webkit-nbsp-mode: space; -webkit-line-break: after-white-space; ">If =
you exceed the amount of physical memory available, memory pages will be =
written to disk in a temp space. The act of 'swapping' the memory pages =
from memory to disk and back again is known as =
'swap'.&nbsp;<div><br></div><div>HBase is highly sensitive to the =
latency of swapping memory in and out of physical memory to disk. You =
need to avoid swap when running HBase. &nbsp;It will crash a region =
server and ultimately you can end up with a cascading failure and HBase =
will go =
down.&nbsp;</div><div><br></div><div>HTH</div><div><br></div><div>-Mike</d=
iv><div><br><div><div>On Nov 5, 2012, at 11:06 PM, Lin Ma &lt;<a =
href=3D"mailto:linlma@gmail.com">linlma@gmail.com</a>&gt; =
wrote:</div><br class=3D"Apple-interchange-newline"><blockquote =
type=3D"cite">Thanks Michael,<br><br>"If you are running just Hadoop, =
you could have a little swap. Running HBase, fuggit about it." -- could =
you give a bit more information about what do you mean swap and why =
forget for HBase?<br>
<br>regards,<br>Lin<br><div class=3D"gmail_extra"><br><br><div =
class=3D"gmail_quote">On Tue, Nov 6, 2012 at 12:46 PM, Michael Segel =
<span dir=3D"ltr">&lt;<a href=3D"mailto:michael_segel@hotmail.com" =
target=3D"_blank">michael_segel@hotmail.com</a>&gt;</span> wrote:<br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 =
.8ex;border-left:1px #ccc solid;padding-left:1ex">Mappers and Reducers =
are separate JVM processes.<br>
And yes you need to take in to account the amount of memory the =
machine(s) when you configure the number of slots.<br>
<br>
If you are running just Hadoop, you could have a little swap. Running =
HBase, fuggit about it.<br>
<div class=3D"HOEnZb"><div class=3D"h5"><br>
<br>
On Nov 5, 2012, at 7:12 PM, Lin Ma &lt;<a =
href=3D"mailto:linlma@gmail.com">linlma@gmail.com</a>&gt; wrote:<br>
<br>
&gt; Hello Hadoop experts,<br>
&gt;<br>
&gt; I have a question in my mind for a long time. Supposing I am =
developing M-R program, and it is Java based (Java UDF, implements =
mapper or reducer interface). My question is, in this scenario, whether =
a mapper or a reducer is a separate JVM process? E.g. supposing on a =
machine, there are 4 mappers, they are 4 individual processes? I am also =
wondering whether the processes on a single machine will impact each =
other when each JVM wants to get more memory to run faster?<br>

&gt;<br>
&gt; thanks in advance,<br>
&gt; Lin<br>
&gt;<br>
&gt;<br>
<br>
</div></div></blockquote></div><br></div>
</blockquote></div><br></div></body></html>=

--Apple-Mail=_3A928F0F-C7A9-4660-BAE4-A3B2C624194F--