Mailing-List: contact user-help@accumulo.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@accumulo.apache.org
MIME-Version: 1.0
References: <CA+zmODgukQjUyMa5mcHbmQS+YZsLi6MfJNCVKtL0iFSesPM_rA@mail.gmail.com>
 <57635F59.9090505@gmail.com> <CAPVnKEm5F_x72bg16wuB=643UcVD6vOuPq-=uP4PcAte+HdrYA@mail.gmail.com>
 <CAPVnKEm=hsH4p9Uq0AAqJYL++4FmmErcu0e0tKWYyOzr_N3RxA@mail.gmail.com>
In-Reply-To: <CAPVnKEm=hsH4p9Uq0AAqJYL++4FmmErcu0e0tKWYyOzr_N3RxA@mail.gmail.com>
From: Roshan Punnoose <roshanp@gmail.com>
Date: Fri, 17 Jun 2016 11:04:40 +0000
Message-ID: <CA+zmODjfmMgqY0A9_Lg_xtbZeeE932ACfOCd1imx1=FFg=qUEA@mail.gmail.com>
Subject: Re: Bulk Ingest
To: user@accumulo.apache.org
Content-Type: multipart/alternative; boundary=001a113da646cd33720535775235
archived-at: Fri, 17 Jun 2016 11:04:57 -0000

--001a113da646cd33720535775235
Content-Type: text/plain; charset=UTF-8

Thanks guys! Awesome stuff.

On Thu, Jun 16, 2016, 11:41 PM Russ Weeks <rweeks@newbrightidea.com> wrote:

> Whoops forgot the link GroupedKeyPartitioner from the excellent
> accumulo-recipes project:
> https://github.com/calrissian/accumulo-recipes/blob/master/thirdparty/spark/src/main/scala/org/calrissian/accumulorecipes/spark/support/GroupedKeyPartitioner.scala
>
> On Thu, Jun 16, 2016 at 8:40 PM Russ Weeks <rweeks@newbrightidea.com>
> wrote:
>
>> > 1) Avoid lots of small files. Target as large of files as you can, relative
>> to your ingest latency requirements and your max file size (set on your
>> instance or table)
>>
>> If you're using Spark to produce the RFiles, one trick for this is to
>> call coalesce() on your RDD to reduce the number of RFiles that are written
>> to HDFS.
>>
>> > 2) Avoid having to import one file to multiple tablets.
>>
>> This is huge. Again, if you're using Spark you must not use the
>> HashPartitioner to create RDDs or you'll wind up in a situation where every
>> tablet owns a piece of every RFile. Ideally you would use something like
>> the GroupedKeyPartitioner[1] to align the RDD partitions with the tablet
>> splits but even the built-in RangePartitioner will be much better than the
>> HashPartitioner.
>>
>> -Russ
>>
>> On Thu, Jun 16, 2016 at 7:24 PM Josh Elser <josh.elser@gmail.com> wrote:
>>
>>> There are two big things that are required to really scale up bulk
>>> loading. Sadly (I guess) they are both things you would need to be
>>> implement on your own:
>>>
>>> 1) Avoid lots of small files. Target as large of files as you can,
>>> relative to your ingest latency requirements and your max file size (set
>>> on your instance or table)
>>>
>>> 2) Avoid having to import one file to multiple tablets. Remember that
>>> the majority of the metadata update for Accumulo is updating the tablet
>>> row with the new file. When you have one file which spans many tablets,
>>> you are now create N metadata updates instead of just one. When you
>>> create the files, take into account the split points of your table, and
>>> use that try to target one file per tablet.
>>>
>>> Roshan Punnoose wrote:
>>> > We are trying to perform bulk ingest at scale and wanted to get some
>>> > quick thoughts on how to increase performance and stability. One of the
>>> > problems we have is that we sometimes import thousands of small files,
>>> > and I don't believe there is a good way around this in the architecture
>>> > as of yet. Already I have run into an rpc timeout issue because the
>>> > import process is taking longer than 5m. And another issue where we
>>> have
>>> > so many files after a bulk import that we have had to bump the
>>> > tserver.scan.files.open.max to 1K.
>>> >
>>> > Here are some other configs that we have been toying with:
>>> > - master.fate.threadpool.size: 20
>>> > - master.bulk.threadpool.size: 20
>>> > - master.bulk.timeout: 20m
>>> > - tserver.bulk.process.threads: 20
>>> > - tserver.bulk.assign.threads: 20
>>> > - tserver.bulk.timeout: 20m
>>> > - tserver.compaction.major.concurrent.max: 20
>>> > - tserver.scan.files.open.max: 1200
>>> > - tserver.server.threads.minimum: 64
>>> > - table.file.max: 64
>>> > - table.compaction.major.ratio: 20
>>> >
>>> > (HDFS)
>>> > - dfs.namenode.handler.count: 100
>>> > - dfs.datanode.handler.count: 50
>>> >
>>> > Just want to get any quick ideas for performing bulk ingest at scale.
>>> > Thanks guys
>>> >
>>> > p.s. This is on Accumulo 1.6.5
>>>
>>

--001a113da646cd33720535775235
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

<p dir=3D"ltr">Thanks guys! Awesome stuff. </p>
<br><div class=3D"gmail_quote"><div dir=3D"ltr">On Thu, Jun 16, 2016, 11:41=
 PM Russ Weeks &lt;<a href=3D"mailto:rweeks@newbrightidea.com">rweeks@newbr=
ightidea.com</a>&gt; wrote:<br></div><blockquote class=3D"gmail_quote" styl=
e=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div di=
r=3D"ltr">Whoops forgot the link GroupedKeyPartitioner from the excellent a=
ccumulo-recipes project:=C2=A0<a href=3D"https://github.com/calrissian/accu=
mulo-recipes/blob/master/thirdparty/spark/src/main/scala/org/calrissian/acc=
umulorecipes/spark/support/GroupedKeyPartitioner.scala" target=3D"_blank">h=
ttps://github.com/calrissian/accumulo-recipes/blob/master/thirdparty/spark/=
src/main/scala/org/calrissian/accumulorecipes/spark/support/GroupedKeyParti=
tioner.scala</a></div><br><div class=3D"gmail_quote"><div dir=3D"ltr">On Th=
u, Jun 16, 2016 at 8:40 PM Russ Weeks &lt;<a href=3D"mailto:rweeks@newbrigh=
tidea.com" target=3D"_blank">rweeks@newbrightidea.com</a>&gt; wrote:<br></d=
iv><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left=
:1px #ccc solid;padding-left:1ex"><div dir=3D"ltr">&gt;=C2=A0<span style=3D=
"color:rgb(33,33,33);font-family:&quot;helvetica neue&quot;,helvetica,arial=
,sans-serif">1) Avoid lots of small files. Target as large of files as you =
can,=C2=A0</span><span style=3D"color:rgb(33,33,33);font-family:&quot;helve=
tica neue&quot;,helvetica,arial,sans-serif">relative to your ingest latency=
 requirements and your max file size (set=C2=A0</span><span style=3D"color:=
rgb(33,33,33);font-family:&quot;helvetica neue&quot;,helvetica,arial,sans-s=
erif">on your instance or table)</span><br style=3D"color:rgb(33,33,33);fon=
t-family:&quot;helvetica neue&quot;,helvetica,arial,sans-serif"><div><span =
style=3D"color:rgb(33,33,33);font-family:&quot;helvetica neue&quot;,helveti=
ca,arial,sans-serif"><br></span></div></div><div dir=3D"ltr"><div><span sty=
le=3D"color:rgb(33,33,33);font-family:&quot;helvetica neue&quot;,helvetica,=
arial,sans-serif">If you&#39;re using Spark to produce the RFiles, one tric=
k for this is to call coalesce() on your RDD to reduce the number of RFiles=
 that are written to HDFS.</span></div></div><div dir=3D"ltr"><div><span st=
yle=3D"color:rgb(33,33,33);font-family:&quot;helvetica neue&quot;,helvetica=
,arial,sans-serif"><br></span></div><div><span style=3D"color:rgb(33,33,33)=
;font-family:&quot;helvetica neue&quot;,helvetica,arial,sans-serif">&gt;=C2=
=A0</span><span style=3D"color:rgb(33,33,33);font-family:&quot;helvetica ne=
ue&quot;,helvetica,arial,sans-serif;line-height:1.5">2) Avoid having to imp=
ort one file to multiple tablets.</span></div><div><span style=3D"color:rgb=
(33,33,33);font-family:&quot;helvetica neue&quot;,helvetica,arial,sans-seri=
f;line-height:1.5"><br></span></div></div><div dir=3D"ltr"><div><span style=
=3D"color:rgb(33,33,33);font-family:&quot;helvetica neue&quot;,helvetica,ar=
ial,sans-serif;line-height:1.5">This is huge. Again, if you&#39;re using Sp=
ark you must not use the HashPartitioner to create RDDs or you&#39;ll wind =
up in a situation where every tablet owns a piece of every RFile. Ideally y=
ou would use something like the GroupedKeyPartitioner[1] to align the RDD p=
artitions with the tablet splits but even the built-in RangePartitioner wil=
l be much better than the HashPartitioner.</span></div></div><div dir=3D"lt=
r"><div><span style=3D"color:rgb(33,33,33);font-family:&quot;helvetica neue=
&quot;,helvetica,arial,sans-serif;line-height:1.5"><br></span></div><div><s=
pan style=3D"color:rgb(33,33,33);font-family:&quot;helvetica neue&quot;,hel=
vetica,arial,sans-serif;line-height:1.5">-Russ</span></div></div><br><div c=
lass=3D"gmail_quote"><div dir=3D"ltr">On Thu, Jun 16, 2016 at 7:24 PM Josh =
Elser &lt;<a href=3D"mailto:josh.elser@gmail.com" target=3D"_blank">josh.el=
ser@gmail.com</a>&gt; wrote:<br></div><blockquote class=3D"gmail_quote" sty=
le=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">There =
are two big things that are required to really scale up bulk<br>
loading. Sadly (I guess) they are both things you would need to be<br>
implement on your own:<br>
<br>
1) Avoid lots of small files. Target as large of files as you can,<br>
relative to your ingest latency requirements and your max file size (set<br=
>
on your instance or table)<br>
<br>
2) Avoid having to import one file to multiple tablets. Remember that<br>
the majority of the metadata update for Accumulo is updating the tablet<br>
row with the new file. When you have one file which spans many tablets,<br>
you are now create N metadata updates instead of just one. When you<br>
create the files, take into account the split points of your table, and<br>
use that try to target one file per tablet.<br>
<br>
Roshan Punnoose wrote:<br>
&gt; We are trying to perform bulk ingest at scale and wanted to get some<b=
r>
&gt; quick thoughts on how to increase performance and stability. One of th=
e<br>
&gt; problems we have is that we sometimes import thousands of small files,=
<br>
&gt; and I don&#39;t believe there is a good way around this in the archite=
cture<br>
&gt; as of yet. Already I have run into an rpc timeout issue because the<br=
>
&gt; import process is taking longer than 5m. And another issue where we ha=
ve<br>
&gt; so many files after a bulk import that we have had to bump the<br>
&gt; tserver.scan.files.open.max to 1K.<br>
&gt;<br>
&gt; Here are some other configs that we have been toying with:<br>
&gt; - master.fate.threadpool.size: 20<br>
&gt; - master.bulk.threadpool.size: 20<br>
&gt; - master.bulk.timeout: 20m<br>
&gt; - tserver.bulk.process.threads: 20<br>
&gt; - tserver.bulk.assign.threads: 20<br>
&gt; - tserver.bulk.timeout: 20m<br>
&gt; - tserver.compaction.major.concurrent.max: 20<br>
&gt; - tserver.scan.files.open.max: 1200<br>
&gt; - tserver.server.threads.minimum: 64<br>
&gt; - table.file.max: 64<br>
&gt; - table.compaction.major.ratio: 20<br>
&gt;<br>
&gt; (HDFS)<br>
&gt; - dfs.namenode.handler.count: 100<br>
&gt; - dfs.datanode.handler.count: 50<br>
&gt;<br>
&gt; Just want to get any quick ideas for performing bulk ingest at scale.<=
br>
&gt; Thanks guys<br>
&gt;<br>
&gt; p.s. This is on Accumulo 1.6.5<br>
</blockquote></div></blockquote></div>
</blockquote></div>

--001a113da646cd33720535775235--