Mailing-List: contact user-help@accumulo.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@accumulo.apache.org
Received-SPF: pass (athena.apache.org: local policy includes SPF record at
 spf.trusted-forwarder.org)
MIME-Version: 1.0
In-Reply-To: <1386172416.36495.YahooMailNeo@web171703.mail.ir2.yahoo.com>
References: <1386152073.30919.YahooMailNeo@web171701.mail.ir2.yahoo.com>
	<CAPMpPc6cHBoMUbU8C0Wt7Q7zaWB7JDXBhhLgV=J2LayzZQQz=A@mail.gmail.com>
	<1386172416.36495.YahooMailNeo@web171703.mail.ir2.yahoo.com>
Date: Wed, 4 Dec 2013 11:09:12 -0500
Message-ID: 
 <CAGUtCHr+eA5OPMayVbXfB70zxAnmxzGQ=BKmQxXS7DMbkj86OA@mail.gmail.com>
Subject: Re: WAL - rate limiting factor x4.67
From: Keith Turner <keith@deenlo.com>
To: user@accumulo.apache.org, Peter Tillotson <slatemine@yahoo.co.uk>
Content-Type: multipart/alternative; boundary=047d7b339e07445a6904ecb7a2dc

--047d7b339e07445a6904ecb7a2dc
Content-Type: text/plain; charset=ISO-8859-1

How many concurrent writers do you have?  I made some other comments below
inline.


On Wed, Dec 4, 2013 at 10:53 AM, Peter Tillotson <slatemine@yahoo.co.uk>wrote:

> Keith
>
> I tried tserver.mutation.queue.max=4M and it improved but by no where near
> a significant difference. I my app records get turned into multiple
> Accumulo rows.
>
> So in terms of my record write rate.
>
> wal=true  & mutation.queue.max = 256K    |   ~8K records/s
> wal=true & mutation.queue.max = 4M        |   ~14K records/s
>

Do you know if its plateaued?  If you increase this further (like 8M), is
the rate the same?


> wal=false                                                 |   ~25K
> records/s
>
> Adam,
>
> Its one box so replication is off, good thought tnx.
>
> BTW - I've been plying around with ZFS compression vs Accumulo Snappy.
> What I've found was quite interesting. The idea was that with ZFS dedup and
> being in charge of compression I'd get a boost later on when blocks merge.
> What I've found is that after a while with ZFS LZ4 the CPU and disk all
> tail off, as though timeouts are elapsing somewhere whereas SNAPPY
> maintains an average ~20k+.
>

W/ this strategy the data will not be compressed when going between the
tserver and datanode OR the datanode and OS.


>
> Anyway tnx and if I get a chance I may the 1.7 branch for the fix.
>

Nothing was done in 1.7 for this issue yet.


>
>
>
>   On Wednesday, 4 December 2013, 14:56, Adam Fuchs <afuchs@apache.org>
> wrote:
>  One thing you can do is reduce the replication factor for the WAL. We
> have found that makes a pretty significant different in write performance.
> That can be modified with the tserver.wal.replication property. Setting it
> to 2 instead of the default (probably 3) should give you some performance
> improvement, of course at some cost to durability.
>
> Adam
>
>
> On Wed, Dec 4, 2013 at 5:14 AM, Peter Tillotson <slatemine@yahoo.co.uk>wrote:
>
> I've been trying to get the most out of streaming data into Accumulo 1.5
> (Hadoop Cloudera CDH4). Having tried a number of settings, re-writing
> client code etc I finally switched off the Write Ahead Log
> (table.walog.enabled=false) and saw a huge leap in ingest performance.
>
> Ingest with table.walog.enabled= true:   ~6 MB/s
> Ingest with table.walog.enabled= false:  ~28 MB/s
>
> That is a factor of about x4.67 speed improvement.
>
> Now my use case could probably live without or work around not having a
> wal, but I wondered if this was a known issue??
> (didn't see anything in jira), wal seem to be a significant rate limiter
> this is either endemic to Accumulo or an HDFS / setup issue. Though given
> everything is in HDFS these days and otherwise IO flies it looks like
> Accumulo WAL is the most likely culprit.
>
> I don't believe this to be an IO issue on the box, with wal off the is
> significantly more IO (up to 80M/s reported by dstat), with wal on (up to
> 12M/s reported by dstat). Testing the box with FIO sequential write is
> 160M/s.
>
> Further info:
> Hadoop 2.00 (Cloudera cdh4)
> Accumulo (1.5.0)
> Zookeeper ( with Netty, minor improvement of <1MB/s  )
> Filesystem ( HDFS is ZFS, compression=on, dedup=on, otherwise ext4 )
>
> With large imports from scratch now I start off CPU bound and as more
> shuffling is needed this becomes Disk bound later in the import as
> expected. So I know pre-splitting would probably sort it.
>
> Tnx
>
> P
>
>
>
>
>

--047d7b339e07445a6904ecb7a2dc
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">How many concurrent writers do you have? =A0I made some ot=
her comments below inline.<br><div class=3D"gmail_extra"><br><br><div class=
=3D"gmail_quote">On Wed, Dec 4, 2013 at 10:53 AM, Peter Tillotson <span dir=
=3D"ltr">&lt;<a href=3D"mailto:slatemine@yahoo.co.uk" target=3D"_blank">sla=
temine@yahoo.co.uk</a>&gt;</span> wrote:<br>
<blockquote class=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;border-=
left-width:1px;border-left-color:rgb(204,204,204);border-left-style:solid;p=
adding-left:1ex"><div><div style=3D"font-size:10pt;font-family:HelveticaNeu=
e,&#39;Helvetica Neue&#39;,Helvetica,Arial,&#39;Lucida Grande&#39;,sans-ser=
if">
<div><span>Keith</span></div><div style=3D"font-style:normal;font-size:13px=
;background-color:transparent;font-family:HelveticaNeue,&#39;Helvetica Neue=
&#39;,Helvetica,Arial,&#39;Lucida Grande&#39;,sans-serif"><span><br></span>=
</div>
<div style=3D"font-style:normal;font-size:13px;background-color:transparent=
;font-family:HelveticaNeue,&#39;Helvetica Neue&#39;,Helvetica,Arial,&#39;Lu=
cida Grande&#39;,sans-serif"><span>I tried=A0tserver.mutation.queue.max=3D4=
M and it improved but by no where near a significant difference. I my app r=
ecords get turned into multiple Accumulo rows.=A0</span></div>
<div style=3D"font-style:normal;font-size:13px;background-color:transparent=
;font-family:HelveticaNeue,&#39;Helvetica Neue&#39;,Helvetica,Arial,&#39;Lu=
cida Grande&#39;,sans-serif"><span><br></span></div><div style=3D"font-styl=
e:normal;font-size:13px;background-color:transparent;font-family:HelveticaN=
eue,&#39;Helvetica Neue&#39;,Helvetica,Arial,&#39;Lucida Grande&#39;,sans-s=
erif">
<span>So in terms of my record write rate.=A0</span></div><div style=3D"fon=
t-style:normal;font-size:13px;background-color:transparent;font-family:Helv=
eticaNeue,&#39;Helvetica Neue&#39;,Helvetica,Arial,&#39;Lucida Grande&#39;,=
sans-serif">
<span><br></span></div><div style=3D"font-style:normal;font-size:13px;backg=
round-color:transparent;font-family:HelveticaNeue,&#39;Helvetica Neue&#39;,=
Helvetica,Arial,&#39;Lucida Grande&#39;,sans-serif"><span>wal=3Dtrue =A0&am=
p; mutation.queue.max =3D 256K =A0 =A0| =A0 ~8K records/s</span></div>
<div style=3D"font-style:normal;font-size:13px;background-color:transparent=
;font-family:HelveticaNeue,&#39;Helvetica Neue&#39;,Helvetica,Arial,&#39;Lu=
cida Grande&#39;,sans-serif"><span><span style=3D"font-size:10pt">wal=3Dtru=
e &amp; mutation.queue.max =3D 4M =A0 =A0 =A0 =A0| =A0 ~14K records/s</span=
>=A0</span></div>
</div></div></blockquote><div><br></div><div style>Do you know if its plate=
aued? =A0If you increase this further (like 8M), is the rate the same?=A0</=
div><div>=A0</div><blockquote class=3D"gmail_quote" style=3D"margin:0px 0px=
 0px 0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);border-=
left-style:solid;padding-left:1ex">
<div><div style=3D"font-size:10pt;font-family:HelveticaNeue,&#39;Helvetica =
Neue&#39;,Helvetica,Arial,&#39;Lucida Grande&#39;,sans-serif"><div style=3D=
"font-style:normal;font-size:13px;background-color:transparent;font-family:=
HelveticaNeue,&#39;Helvetica Neue&#39;,Helvetica,Arial,&#39;Lucida Grande&#=
39;,sans-serif">
<span style=3D"background-color:transparent"><span style=3D"font-size:10pt"=
>wal=3Dfalse =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =
=A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 |=A0</span><span style=3D"font-size:10p=
t">=A0</span>=A0~25K records/s</span></div><div style=3D"font-style:normal;=
font-size:10pt;background-color:transparent;font-family:HelveticaNeue,&#39;=
Helvetica Neue&#39;,Helvetica,Arial,&#39;Lucida Grande&#39;,sans-serif">
<span style=3D"background-color:transparent"><br></span></div><div style=3D=
"font-style:normal;font-size:13px;background-color:transparent;font-family:=
HelveticaNeue,&#39;Helvetica Neue&#39;,Helvetica,Arial,&#39;Lucida Grande&#=
39;,sans-serif">
<span style=3D"background-color:transparent">Adam,=A0</span></div><div styl=
e=3D"font-style:normal;font-size:13px;background-color:transparent;font-fam=
ily:HelveticaNeue,&#39;Helvetica Neue&#39;,Helvetica,Arial,&#39;Lucida Gran=
de&#39;,sans-serif">
<span style=3D"background-color:transparent"><br></span></div><div style=3D=
"font-style:normal;font-size:13px;background-color:transparent;font-family:=
HelveticaNeue,&#39;Helvetica Neue&#39;,Helvetica,Arial,&#39;Lucida Grande&#=
39;,sans-serif">
<span style=3D"background-color:transparent">Its one box so replication is =
off, good thought tnx.=A0</span></div><div style=3D"font-style:normal;font-=
size:13px;background-color:transparent;font-family:HelveticaNeue,&#39;Helve=
tica Neue&#39;,Helvetica,Arial,&#39;Lucida Grande&#39;,sans-serif">
<span style=3D"background-color:transparent"><br></span></div><div style=3D=
"font-style:normal;font-size:13px;background-color:transparent;font-family:=
HelveticaNeue,&#39;Helvetica Neue&#39;,Helvetica,Arial,&#39;Lucida Grande&#=
39;,sans-serif">
<span style=3D"background-color:transparent">BTW - I&#39;ve been plying aro=
und with ZFS compression vs Accumulo Snappy. What I&#39;ve found was quite =
interesting. The idea was that with ZFS dedup and being in charge of compre=
ssion I&#39;d get a boost later on when blocks merge. What I&#39;ve found i=
s that after a while with ZFS LZ4 the CPU and disk all tail off, as though =
timeouts are elapsing somewhere whereas SNAPPY maintains an average ~20k+.=
=A0</span></div>
</div></div></blockquote><div><br></div><div style>W/ this strategy the dat=
a will not be compressed when going between the tserver and datanode OR the=
 datanode and OS. =A0</div><div>=A0</div><blockquote class=3D"gmail_quote" =
style=3D"margin:0px 0px 0px 0.8ex;border-left-width:1px;border-left-color:r=
gb(204,204,204);border-left-style:solid;padding-left:1ex">
<div><div style=3D"font-size:10pt;font-family:HelveticaNeue,&#39;Helvetica =
Neue&#39;,Helvetica,Arial,&#39;Lucida Grande&#39;,sans-serif"><div style=3D=
"font-style:normal;font-size:13px;background-color:transparent;font-family:=
HelveticaNeue,&#39;Helvetica Neue&#39;,Helvetica,Arial,&#39;Lucida Grande&#=
39;,sans-serif">
<span style=3D"background-color:transparent"><br></span></div><div style=3D=
"font-style:normal;font-size:13px;background-color:transparent;font-family:=
HelveticaNeue,&#39;Helvetica Neue&#39;,Helvetica,Arial,&#39;Lucida Grande&#=
39;,sans-serif">
<span style=3D"background-color:transparent">Anyway tnx and if I get a chan=
ce I may the 1.7 branch for the fix.</span></div></div></div></blockquote><=
div><br></div><div style>Nothing was done in 1.7 for this issue yet.</div>
<div>=A0</div><blockquote class=3D"gmail_quote" style=3D"margin:0px 0px 0px=
 0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);border-left=
-style:solid;padding-left:1ex"><div><div style=3D"font-size:10pt;font-famil=
y:HelveticaNeue,&#39;Helvetica Neue&#39;,Helvetica,Arial,&#39;Lucida Grande=
&#39;,sans-serif">
<div style=3D"font-style:normal;font-size:13px;background-color:transparent=
;font-family:HelveticaNeue,&#39;Helvetica Neue&#39;,Helvetica,Arial,&#39;Lu=
cida Grande&#39;,sans-serif"><span style=3D"background-color:transparent"> =
=A0 =A0 =A0 =A0 =A0 =A0 =A0=A0</span><br>
</div><div><div class=3D"h5"><div style=3D"display:block"> <br> <br> <div s=
tyle=3D"font-family:HelveticaNeue,&#39;Helvetica Neue&#39;,Helvetica,Arial,=
&#39;Lucida Grande&#39;,sans-serif;font-size:10pt"> <div style=3D"font-fami=
ly:HelveticaNeue,&#39;Helvetica Neue&#39;,Helvetica,Arial,&#39;Lucida Grand=
e&#39;,sans-serif;font-size:12pt">
 <div dir=3D"ltr"> <font face=3D"Arial"> On Wednesday, 4 December 2013, 14:=
56, Adam Fuchs &lt;<a href=3D"mailto:afuchs@apache.org" target=3D"_blank">a=
fuchs@apache.org</a>&gt; wrote:<br> </font> </div>  <div><div><div><div dir=
=3D"ltr">
One thing you can do is reduce the replication factor for the WAL. We have =
found that makes a pretty significant different in write performance. That =
can be modified with the tserver.wal.replication property. Setting it to 2 =
instead of the default (probably 3) should give you some performance improv=
ement, of course at some cost to durability.=A0<div>

<br clear=3D"none"></div><div>Adam</div></div><div><div><br clear=3D"none">=
<br clear=3D"none"><div>On Wed, Dec 4, 2013 at 5:14 AM, Peter Tillotson <sp=
an dir=3D"ltr">&lt;<a rel=3D"nofollow" shape=3D"rect" href=3D"mailto:slatem=
ine@yahoo.co.uk" target=3D"_blank">slatemine@yahoo.co.uk</a>&gt;</span> wro=
te:<br clear=3D"none">

<blockquote style=3D"margin:0px 0px 0px 0.8ex;border-left-width:1px;border-=
left-color:rgb(204,204,204);border-left-style:solid;padding-left:1ex"><div>=
<div style=3D"font-size:10pt;font-family:HelveticaNeue,&#39;Helvetica Neue&=
#39;,Helvetica,Arial,&#39;Lucida Grande&#39;,sans-serif">
<div>
I&#39;ve been trying to get the most out of streaming data into Accumulo 1.=
5 (Hadoop Cloudera CDH4). Having tried a number of settings, re-writing cli=
ent code etc I finally switched off the Write Ahead Log (table.walog.enable=
d=3Dfalse) and saw a huge leap in ingest performance.=A0</div>

<div><br clear=3D"none"></div><div style=3D"font-style:normal;font-size:13p=
x;background-color:transparent;font-family:HelveticaNeue,&#39;Helvetica Neu=
e&#39;,Helvetica,Arial,&#39;Lucida Grande&#39;,sans-serif">Ingest with=A0ta=
ble.walog.enabled=3D true: =A0 ~6 MB/s</div>

<div style=3D"font-style:normal;font-size:13px;background-color:transparent=
;font-family:HelveticaNeue,&#39;Helvetica Neue&#39;,Helvetica,Arial,&#39;Lu=
cida Grande&#39;,sans-serif">Ingest with=A0table.walog.enabled=3D false:
 =A0~28 MB/s<br clear=3D"none"></div><div style=3D"font-style:normal;font-s=
ize:13px;background-color:transparent;font-family:HelveticaNeue,&#39;Helvet=
ica Neue&#39;,Helvetica,Arial,&#39;Lucida Grande&#39;,sans-serif"><br clear=
=3D"none">
</div><div style=3D"font-style:normal;font-size:13px;background-color:trans=
parent;font-family:HelveticaNeue,&#39;Helvetica Neue&#39;,Helvetica,Arial,&=
#39;Lucida Grande&#39;,sans-serif">
That is a factor of about x4.67 speed improvement.=A0</div><div style=3D"fo=
nt-style:normal;font-size:13px;background-color:transparent;font-family:Hel=
veticaNeue,&#39;Helvetica Neue&#39;,Helvetica,Arial,&#39;Lucida Grande&#39;=
,sans-serif">

<br clear=3D"none"></div><div style=3D"font-style:normal;font-size:13px;bac=
kground-color:transparent;font-family:HelveticaNeue,&#39;Helvetica Neue&#39=
;,Helvetica,Arial,&#39;Lucida Grande&#39;,sans-serif">Now my use case could=
 probably live without or work around not having a wal, but
 I wondered if this was a known issue??=A0</div><div style=3D"font-style:no=
rmal;font-size:13px;background-color:transparent;font-family:HelveticaNeue,=
&#39;Helvetica Neue&#39;,Helvetica,Arial,&#39;Lucida Grande&#39;,sans-serif=
">

(didn&#39;t see anything in jira), wal seem to be a significant rate limite=
r this is either endemic to Accumulo or an HDFS / setup issue. Though given=
 everything is in HDFS these days and otherwise IO flies it looks like Accu=
mulo WAL is the most likely culprit. =A0=A0</div>

<div style=3D"font-style:normal;font-size:13px;background-color:transparent=
;font-family:HelveticaNeue,&#39;Helvetica Neue&#39;,Helvetica,Arial,&#39;Lu=
cida Grande&#39;,sans-serif"><br clear=3D"none"></div><div style=3D"font-st=
yle:normal;font-size:13px;background-color:transparent;font-family:Helvetic=
aNeue,&#39;Helvetica Neue&#39;,Helvetica,Arial,&#39;Lucida Grande&#39;,sans=
-serif">

I don&#39;t believe this to be an IO issue on the
 box, with wal off the is significantly more IO (up to 80M/s reported by ds=
tat), with wal on=A0<span style=3D"font-size:10pt">(up to 12M/s reported by=
 dstat). Testing the box with FIO sequential write is 160M/s.=A0</span></di=
v>

<div style=3D"font-style:normal;font-size:10pt;background-color:transparent=
;font-family:HelveticaNeue,&#39;Helvetica Neue&#39;,Helvetica,Arial,&#39;Lu=
cida Grande&#39;,sans-serif"><span style=3D"font-size:10pt"><br clear=3D"no=
ne">
</span></div>
<div style=3D"font-style:normal;font-size:13px;background-color:transparent=
;font-family:HelveticaNeue,&#39;Helvetica Neue&#39;,Helvetica,Arial,&#39;Lu=
cida Grande&#39;,sans-serif"><span style=3D"font-size:10pt">Further info:=
=A0</span></div>

<div style=3D"font-style:normal;font-size:10pt;background-color:transparent=
;font-family:HelveticaNeue,&#39;Helvetica Neue&#39;,Helvetica,Arial,&#39;Lu=
cida Grande&#39;,sans-serif"><span style=3D"font-size:10pt">Hadoop
 2.00 (Cloudera cdh4)</span></div><div style=3D"font-style:normal;font-size=
:10pt;background-color:transparent;font-family:HelveticaNeue,&#39;Helvetica=
 Neue&#39;,Helvetica,Arial,&#39;Lucida Grande&#39;,sans-serif"><span style=
=3D"font-size:10pt">Accumulo (1.5.0)</span></div>

<div style=3D"font-style:normal;font-size:10pt;background-color:transparent=
;font-family:HelveticaNeue,&#39;Helvetica Neue&#39;,Helvetica,Arial,&#39;Lu=
cida Grande&#39;,sans-serif"><span style=3D"font-size:10pt">Zookeeper ( wit=
h Netty, minor improvement of &lt;1MB/s =A0)</span></div>

<div style=3D"font-style:normal;font-size:10pt;background-color:transparent=
;font-family:HelveticaNeue,&#39;Helvetica Neue&#39;,Helvetica,Arial,&#39;Lu=
cida Grande&#39;,sans-serif"><span style=3D"font-size:10pt">Filesystem ( HD=
FS is ZFS, compression=3Don, dedup=3Don, otherwise ext4 )</span></div>

<div style=3D"font-style:normal;font-size:10pt;background-color:transparent=
;font-family:HelveticaNeue,&#39;Helvetica Neue&#39;,Helvetica,Arial,&#39;Lu=
cida Grande&#39;,sans-serif"><span style=3D"font-size:10pt"><br clear=3D"no=
ne">
</span></div>
<div style=3D"font-style:normal;font-size:13px;background-color:transparent=
;font-family:HelveticaNeue,&#39;Helvetica Neue&#39;,Helvetica,Arial,&#39;Lu=
cida Grande&#39;,sans-serif"><span style=3D"font-size:10pt">With large impo=
rts from scratch now I start off CPU bound and as more shuffling is needed =
this becomes Disk bound later in the import as expected. So I know pre-spli=
tting would probably sort it.</span></div>

<div style=3D"font-style:normal;font-size:10pt;background-color:transparent=
;font-family:HelveticaNeue,&#39;Helvetica Neue&#39;,Helvetica,Arial,&#39;Lu=
cida Grande&#39;,sans-serif"><span style=3D"font-size:10pt"><br clear=3D"no=
ne">
</span></div>
<div style=3D"font-style:normal;font-size:13px;background-color:transparent=
;font-family:HelveticaNeue,&#39;Helvetica Neue&#39;,Helvetica,Arial,&#39;Lu=
cida Grande&#39;,sans-serif"><span style=3D"font-size:10pt">Tnx=A0</span></=
div>

<span><font color=3D"#888888"></font></span><div style=3D"font-style:normal=
;font-size:10pt;background-color:transparent;font-family:HelveticaNeue,&#39=
;Helvetica Neue&#39;,Helvetica,Arial,&#39;Lucida Grande&#39;,sans-serif"><s=
pan style=3D"font-size:10pt"><br clear=3D"none">

</span></div><div style=3D"font-style:normal;font-size:13px;background-colo=
r:transparent;font-family:HelveticaNeue,&#39;Helvetica Neue&#39;,Helvetica,=
Arial,&#39;Lucida Grande&#39;,sans-serif"><span style=3D"font-size:10pt">P<=
/span></div>

</div></div></blockquote></div><br clear=3D"none"></div></div></div></div><=
br><br></div>  </div> </div>  </div> </div></div></div></div></blockquote><=
/div><br></div></div>

--047d7b339e07445a6904ecb7a2dc--