Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@hadoop.apache.org
Received-SPF: pass (nike.apache.org: domain of linlma@gmail.com designates
 209.85.220.182 as permitted sender)
MIME-Version: 1.0
In-Reply-To: 
 <CAOcnVr14wiH7cPxgpnVnAK420DT-Q88TbkEc3tJ+nfCViL88qg@mail.gmail.com>
References: 
 <CAK_MoSvmz4+FPjb6S9PGTTWEBvhAswgkMuJSvJ40wJT+BGLQFg@mail.gmail.com>
	<491FA550-FC92-4280-8FB5-186E5F7A4743@123.org>
	<CAK_MoSuY1B5P_AQTzJfrAwB_-tJ0gmt2PvTaBR9qG5bjPHDpEg@mail.gmail.com>
	<A06694F8-AFA7-49C3-81DC-F3C1B10E47C3@123.org>
	<CAK_MoSsZK8UxWvZohz0wWTZ83haT=mbKtFkd6GEcExd=yiRpbg@mail.gmail.com>
	<CAOcnVr0d7F2YesEjyh92zFF0yb6r-j3_uQoS6wi6C3jdvz2yig@mail.gmail.com>
	<CAK_MoSteMHNJNcocRn+XCfUax52jy0AhpGDA1E4yimDZ+uCbcw@mail.gmail.com>
	<CAOcnVr14wiH7cPxgpnVnAK420DT-Q88TbkEc3tJ+nfCViL88qg@mail.gmail.com>
Date: Wed, 26 Dec 2012 18:43:46 +0800
Message-ID: 
 <CAK_MoSssLvmxfqyZF6u2sH+uj_GoZPk6b+rHzurcC2=aWxD+JQ@mail.gmail.com>
Subject: Re: distributed cache
From: Lin Ma <linlma@gmail.com>
To: Harsh J <harsh@cloudera.com>, user@hadoop.apache.org
Content-Type: multipart/alternative; boundary=14dae9cdc6bde53f8b04d1bf1a30

--14dae9cdc6bde53f8b04d1bf1a30
Content-Type: text/plain; charset=ISO-8859-1

Thanks Harsh, multiple concurrent read is generally faster or?

regards,
Lin

On Wed, Dec 26, 2012 at 6:21 PM, Harsh J <harsh@cloudera.com> wrote:

> There is no limitation in HDFS that limits reads of a block to a
> single client at a time (no reason to do so) - so downloads can be as
> concurrent as possible.
>
> On Wed, Dec 26, 2012 at 3:41 PM, Lin Ma <linlma@gmail.com> wrote:
> > Thanks Harsh,
> >
> > Supposing DistributedCache is uploaded by client, for each replica, in
> > Hadoop design, it could only serve one download session (download from a
> > mapper or a reducer which requires the DistributedCache) at a time until
> > DistributedCache file download is completed, or it could serve multiple
> > concurrent parallel download session (download from multiple mappers or
> > reducers which requires the DistributedCache).
> >
> > regards,
> > Lin
> >
> >
> > On Wed, Dec 26, 2012 at 4:51 PM, Harsh J <harsh@cloudera.com> wrote:
> >>
> >> Hi Lin,
> >>
> >> DistributedCache files are stored onto the HDFS by the client first.
> >> The TaskTrackers download and localize it. Therefore, as with any
> >> other file on HDFS, "downloads" can be efficiently parallel with
> >> higher replicas.
> >>
> >> The point of having higher replication for these files is also tied to
> >> the concept of racks in a cluster - you would want more replicas
> >> spread across racks such that on task bootup the downloads happen with
> >> rack locality.
> >>
> >> On Sat, Dec 22, 2012 at 6:54 PM, Lin Ma <linlma@gmail.com> wrote:
> >> > Hi Kai,
> >> >
> >> > Smart answer! :-)
> >> >
> >> > The assumption you have is one distributed cache replica could only
> >> > serve
> >> > one download session for tasktracker node (this is why you get
> >> > concurrency
> >> > n/r). The question is, why one distributed cache replica cannot serve
> >> > multiple concurrent download session? For example, supposing a
> >> > tasktracker
> >> > use elapsed time t to download a file from a specific distributed
> cache
> >> > replica, it is possible for 2 tasktrackers to download from the
> specific
> >> > distributed cache replica in parallel using elapsed time t as well, or
> >> > 1.5
> >> > t, which is faster than sequential download time 2t you mentioned
> >> > before?
> >> > "In total, r+n/r concurrent operations. If you optimize r depending on
> >> > n,
> >> > SRQT(n) is the optimal replication level." -- how do you get SRQT(n)
> for
> >> > minimize r+n/r? Appreciate if you could point me to more details.
> >> >
> >> > regards,
> >> > Lin
> >> >
> >> >
> >> > On Sat, Dec 22, 2012 at 8:51 PM, Kai Voigt <k@123.org> wrote:
> >> >>
> >> >> Hi,
> >> >>
> >> >> simple math. Assuming you have n TaskTrackers in your cluster that
> will
> >> >> need to access the files in the distributed cache. And r is the
> >> >> replication
> >> >> level of those files.
> >> >>
> >> >> Copying the files into HDFS requires r copy operations over the
> >> >> network.
> >> >> The n TaskTrackers need to get their local copies from HDFS, so the n
> >> >> TaskTrackers copy from r DataNodes, so n/r concurrent operation. In
> >> >> total,
> >> >> r+n/r concurrent operations. If you optimize r depending on n,
> SRQT(n)
> >> >> is
> >> >> the optimal replication level. So 10 is a reasonable default setting
> >> >> for
> >> >> most clusters that are not 500+ nodes big.
> >> >>
> >> >> Kai
> >> >>
> >> >> Am 22.12.2012 um 13:46 schrieb Lin Ma <linlma@gmail.com>:
> >> >>
> >> >> Thanks Kai, using higher replication count for the purpose of?
> >> >>
> >> >> regards,
> >> >> Lin
> >> >>
> >> >> On Sat, Dec 22, 2012 at 8:44 PM, Kai Voigt <k@123.org> wrote:
> >> >>>
> >> >>> Hi,
> >> >>>
> >> >>> Am 22.12.2012 um 13:03 schrieb Lin Ma <linlma@gmail.com>:
> >> >>>
> >> >>> > I want to confirm when on each task node either mapper or reducer
> >> >>> > access distributed cache file, it resides on disk, not resides in
> >> >>> > memory.
> >> >>> > Just want to make sure distributed cache file does not fully
> loaded
> >> >>> > into
> >> >>> > memory which compete memory consumption with mapper/reducer tasks.
> >> >>> > Is that
> >> >>> > correct?
> >> >>>
> >> >>>
> >> >>> Yes, you are correct. The JobTracker will put files for the
> >> >>> distributed
> >> >>> cache into HDFS with a higher replication count (10 by default).
> >> >>> Whenever a
> >> >>> TaskTracker needs those files for a task it is launching locally, it
> >> >>> will
> >> >>> fetch a copy to its local disk. So it won't need to do this again
> for
> >> >>> future
> >> >>> tasks on this node. After a job is done, all local copies and the
> HDFS
> >> >>> copies of files in the distributed cache are cleaned up.
> >> >>>
> >> >>> Kai
> >> >>>
> >> >>> --
> >> >>> Kai Voigt
> >> >>> k@123.org
> >> >>>
> >> >>>
> >> >>>
> >> >>>
> >> >>
> >> >>
> >> >> --
> >> >> Kai Voigt
> >> >> k@123.org
> >> >>
> >> >>
> >> >>
> >> >>
> >> >
> >>
> >>
> >>
> >> --
> >> Harsh J
> >
> >
>
>
>
> --
> Harsh J
>

--14dae9cdc6bde53f8b04d1bf1a30
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

Thanks Harsh, multiple concurrent read is generally faster or?<br><br>regar=
ds,<br>Lin<br><br><div class=3D"gmail_quote">On Wed, Dec 26, 2012 at 6:21 P=
M, Harsh J <span dir=3D"ltr">&lt;<a href=3D"mailto:harsh@cloudera.com" targ=
et=3D"_blank">harsh@cloudera.com</a>&gt;</span> wrote:<br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex">There is no limitation in HDFS that limits r=
eads of a block to a<br>
single client at a time (no reason to do so) - so downloads can be as<br>
concurrent as possible.<br>
<div class=3D"HOEnZb"><div class=3D"h5"><br>
On Wed, Dec 26, 2012 at 3:41 PM, Lin Ma &lt;<a href=3D"mailto:linlma@gmail.=
com">linlma@gmail.com</a>&gt; wrote:<br>
&gt; Thanks Harsh,<br>
&gt;<br>
&gt; Supposing DistributedCache is uploaded by client, for each replica, in=
<br>
&gt; Hadoop design, it could only serve one download session (download from=
 a<br>
&gt; mapper or a reducer which requires the DistributedCache) at a time unt=
il<br>
&gt; DistributedCache file download is completed, or it could serve multipl=
e<br>
&gt; concurrent parallel download session (download from multiple mappers o=
r<br>
&gt; reducers which requires the DistributedCache).<br>
&gt;<br>
&gt; regards,<br>
&gt; Lin<br>
&gt;<br>
&gt;<br>
&gt; On Wed, Dec 26, 2012 at 4:51 PM, Harsh J &lt;<a href=3D"mailto:harsh@c=
loudera.com">harsh@cloudera.com</a>&gt; wrote:<br>
&gt;&gt;<br>
&gt;&gt; Hi Lin,<br>
&gt;&gt;<br>
&gt;&gt; DistributedCache files are stored onto the HDFS by the client firs=
t.<br>
&gt;&gt; The TaskTrackers download and localize it. Therefore, as with any<=
br>
&gt;&gt; other file on HDFS, &quot;downloads&quot; can be efficiently paral=
lel with<br>
&gt;&gt; higher replicas.<br>
&gt;&gt;<br>
&gt;&gt; The point of having higher replication for these files is also tie=
d to<br>
&gt;&gt; the concept of racks in a cluster - you would want more replicas<b=
r>
&gt;&gt; spread across racks such that on task bootup the downloads happen =
with<br>
&gt;&gt; rack locality.<br>
&gt;&gt;<br>
&gt;&gt; On Sat, Dec 22, 2012 at 6:54 PM, Lin Ma &lt;<a href=3D"mailto:linl=
ma@gmail.com">linlma@gmail.com</a>&gt; wrote:<br>
&gt;&gt; &gt; Hi Kai,<br>
&gt;&gt; &gt;<br>
&gt;&gt; &gt; Smart answer! :-)<br>
&gt;&gt; &gt;<br>
&gt;&gt; &gt; The assumption you have is one distributed cache replica coul=
d only<br>
&gt;&gt; &gt; serve<br>
&gt;&gt; &gt; one download session for tasktracker node (this is why you ge=
t<br>
&gt;&gt; &gt; concurrency<br>
&gt;&gt; &gt; n/r). The question is, why one distributed cache replica cann=
ot serve<br>
&gt;&gt; &gt; multiple concurrent download session? For example, supposing =
a<br>
&gt;&gt; &gt; tasktracker<br>
&gt;&gt; &gt; use elapsed time t to download a file from a specific distrib=
uted cache<br>
&gt;&gt; &gt; replica, it is possible for 2 tasktrackers to download from t=
he specific<br>
&gt;&gt; &gt; distributed cache replica in parallel using elapsed time t as=
 well, or<br>
&gt;&gt; &gt; 1.5<br>
&gt;&gt; &gt; t, which is faster than sequential download time 2t you menti=
oned<br>
&gt;&gt; &gt; before?<br>
&gt;&gt; &gt; &quot;In total, r+n/r concurrent operations. If you optimize =
r depending on<br>
&gt;&gt; &gt; n,<br>
&gt;&gt; &gt; SRQT(n) is the optimal replication level.&quot; -- how do you=
 get SRQT(n) for<br>
&gt;&gt; &gt; minimize r+n/r? Appreciate if you could point me to more deta=
ils.<br>
&gt;&gt; &gt;<br>
&gt;&gt; &gt; regards,<br>
&gt;&gt; &gt; Lin<br>
&gt;&gt; &gt;<br>
&gt;&gt; &gt;<br>
&gt;&gt; &gt; On Sat, Dec 22, 2012 at 8:51 PM, Kai Voigt &lt;<a href=3D"mai=
lto:k@123.org">k@123.org</a>&gt; wrote:<br>
&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt; Hi,<br>
&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt; simple math. Assuming you have n TaskTrackers in your clu=
ster that will<br>
&gt;&gt; &gt;&gt; need to access the files in the distributed cache. And r =
is the<br>
&gt;&gt; &gt;&gt; replication<br>
&gt;&gt; &gt;&gt; level of those files.<br>
&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt; Copying the files into HDFS requires r copy operations ov=
er the<br>
&gt;&gt; &gt;&gt; network.<br>
&gt;&gt; &gt;&gt; The n TaskTrackers need to get their local copies from HD=
FS, so the n<br>
&gt;&gt; &gt;&gt; TaskTrackers copy from r DataNodes, so n/r concurrent ope=
ration. In<br>
&gt;&gt; &gt;&gt; total,<br>
&gt;&gt; &gt;&gt; r+n/r concurrent operations. If you optimize r depending =
on n, SRQT(n)<br>
&gt;&gt; &gt;&gt; is<br>
&gt;&gt; &gt;&gt; the optimal replication level. So 10 is a reasonable defa=
ult setting<br>
&gt;&gt; &gt;&gt; for<br>
&gt;&gt; &gt;&gt; most clusters that are not 500+ nodes big.<br>
&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt; Kai<br>
&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt; Am 22.12.2012 um 13:46 schrieb Lin Ma &lt;<a href=3D"mail=
to:linlma@gmail.com">linlma@gmail.com</a>&gt;:<br>
&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt; Thanks Kai, using higher replication count for the purpos=
e of?<br>
&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt; regards,<br>
&gt;&gt; &gt;&gt; Lin<br>
&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt; On Sat, Dec 22, 2012 at 8:44 PM, Kai Voigt &lt;<a href=3D=
"mailto:k@123.org">k@123.org</a>&gt; wrote:<br>
&gt;&gt; &gt;&gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt; Hi,<br>
&gt;&gt; &gt;&gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt; Am 22.12.2012 um 13:03 schrieb Lin Ma &lt;<a href=3D"=
mailto:linlma@gmail.com">linlma@gmail.com</a>&gt;:<br>
&gt;&gt; &gt;&gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt; I want to confirm when on each task node either =
mapper or reducer<br>
&gt;&gt; &gt;&gt;&gt; &gt; access distributed cache file, it resides on dis=
k, not resides in<br>
&gt;&gt; &gt;&gt;&gt; &gt; memory.<br>
&gt;&gt; &gt;&gt;&gt; &gt; Just want to make sure distributed cache file do=
es not fully loaded<br>
&gt;&gt; &gt;&gt;&gt; &gt; into<br>
&gt;&gt; &gt;&gt;&gt; &gt; memory which compete memory consumption with map=
per/reducer tasks.<br>
&gt;&gt; &gt;&gt;&gt; &gt; Is that<br>
&gt;&gt; &gt;&gt;&gt; &gt; correct?<br>
&gt;&gt; &gt;&gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt; Yes, you are correct. The JobTracker will put files f=
or the<br>
&gt;&gt; &gt;&gt;&gt; distributed<br>
&gt;&gt; &gt;&gt;&gt; cache into HDFS with a higher replication count (10 b=
y default).<br>
&gt;&gt; &gt;&gt;&gt; Whenever a<br>
&gt;&gt; &gt;&gt;&gt; TaskTracker needs those files for a task it is launch=
ing locally, it<br>
&gt;&gt; &gt;&gt;&gt; will<br>
&gt;&gt; &gt;&gt;&gt; fetch a copy to its local disk. So it won&#39;t need =
to do this again for<br>
&gt;&gt; &gt;&gt;&gt; future<br>
&gt;&gt; &gt;&gt;&gt; tasks on this node. After a job is done, all local co=
pies and the HDFS<br>
&gt;&gt; &gt;&gt;&gt; copies of files in the distributed cache are cleaned =
up.<br>
&gt;&gt; &gt;&gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt; Kai<br>
&gt;&gt; &gt;&gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt; --<br>
&gt;&gt; &gt;&gt;&gt; Kai Voigt<br>
&gt;&gt; &gt;&gt;&gt; <a href=3D"mailto:k@123.org">k@123.org</a><br>
&gt;&gt; &gt;&gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt;<br>
&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt; --<br>
&gt;&gt; &gt;&gt; Kai Voigt<br>
&gt;&gt; &gt;&gt; <a href=3D"mailto:k@123.org">k@123.org</a><br>
&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;<br>
&gt;&gt;<br>
&gt;&gt;<br>
&gt;&gt;<br>
&gt;&gt; --<br>
&gt;&gt; Harsh J<br>
&gt;<br>
&gt;<br>
<br>
<br>
<br>
</div></div><span class=3D"HOEnZb"><font color=3D"#888888">--<br>
Harsh J<br>
</font></span></blockquote></div><br>

--14dae9cdc6bde53f8b04d1bf1a30--