Mailing-List: contact user-help@flume.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@flume.apache.org
Received-SPF: pass (nike.apache.org: domain of hshreedharan@cloudera.com
 designates 209.85.220.176 as permitted sender)
MIME-Version: 1.0
In-Reply-To: 
 <CAFjB=SEwuhqLE1ajJOb5H+g1Sb42vFNLCSwJGpvJnFOazx_C9A@mail.gmail.com>
References: 
 <CAFjB=SHyyMcPxtr7j6yzpVAEBC62psgx1E0O34Z3StsPZ6GLTQ@mail.gmail.com>
 <8B094F7E897248E89C8E55526178FD65@cloudera.com>
 <CAFjB=SFOBOW6Uqv0XmyxxVibTK2E0rezzTBFJMSCyrpH_5=bWw@mail.gmail.com>
 <B8A96CAA813C491583B42B994110DA7D@cloudera.com>
 <CAFukC=69CwA75+iLduJmhCAWPC7xtwt=FRsvUUOreykzKQ92Gg@mail.gmail.com>
 <CAFjB=SFC4spv+6_v_ZyojSkXC0fh3EfyEzrKNEG1ePRR6+xVQA@mail.gmail.com>
 <CAHbPYVZdAkTYFiyuPbQ2zFSqJM0cZTY_3vLuW7O9drueyujwwg@mail.gmail.com>
 <CAFjB=SEwuhqLE1ajJOb5H+g1Sb42vFNLCSwJGpvJnFOazx_C9A@mail.gmail.com>
From: Hari Shreedharan <hshreedharan@cloudera.com>
Date: Wed, 2 Jul 2014 20:47:46 -0700
Message-ID: 
 <CAHbPYVZRkW4q0sbN=NAh_qaVepN+Mc5acdts9T9ZxCHixudtFA@mail.gmail.com>
Subject: Re: File Channel Backup Checkpoints are I/O Intensive
To: "user@flume.apache.org" <user@flume.apache.org>
Content-Type: multipart/alternative; boundary=001a11337e426d3e1c04fd41e0aa

--001a11337e426d3e1c04fd41e0aa
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

Thanks a lot. I will take a look at this tomorrow or early next week.


On Wed, Jul 2, 2014 at 5:29 PM, Abraham Fine <abe@brightroll.com> wrote:

> Hari-
>
> I added the new tests and created a new revision to my patch.
>
>
> https://issues.apache.org/jira/secure/attachment/12653728/compress_backup=
_checkpoint_new_tests.patch
>
> Thanks,
> Abe
>
> --
> Abraham Fine | Software Engineer
> (516) 567-2535
> BrightRoll, Inc. | Smart Video Advertising | www.brightroll.com
>
>
> On Wed, Jul 2, 2014 at 4:32 PM, Hari Shreedharan <
> hshreedharan@cloudera.com> wrote:
>
>> Hi Abraham,
>>
>> In general, the patch looks good. Can you add a couple of tests -
>> * Original checkpoint is uncompressed, config changes to compress
>> checkpoint - does the file channel restart from original checkpoint? are
>> new checkpoints compressed?
>> * Compressed checkpoint, config changes to not compress checkpoint - doe=
s
>> channel start up? are new checkpoints uncompressed?
>>
>>
>> Hari
>>
>>
>>  On Wed, Jul 2, 2014 at 3:06 PM, Abraham Fine <abe@brightroll.com> wrote=
:
>>
>>> Hi Brock and Hari-
>>>
>>> I was just wondering if either of you had a chance to take a look at th=
e
>>> patch and if there is anything I can do to improve it.
>>>
>>> Thanks,
>>> Abe
>>>
>>> --
>>> Abraham Fine | Software Engineer
>>> (516) 567-2535
>>> BrightRoll, Inc. | Smart Video Advertising | www.brightroll.com
>>>
>>>
>>> On Wed, Jun 11, 2014 at 6:48 PM, Brock Noland <brock@cloudera.com>
>>> wrote:
>>>
>>>> This is a great suggestion Abraham!
>>>>
>>>>
>>>> On Wed, Jun 11, 2014 at 5:39 PM, Hari Shreedharan <
>>>> hshreedharan@cloudera.com> wrote:
>>>>
>>>>>  Thanks. I will review it :)
>>>>>
>>>>>
>>>>> Thanks,
>>>>> Hari
>>>>>
>>>>> On Wednesday, June 11, 2014 at 5:00 PM, Abraham Fine wrote:
>>>>>
>>>>> I went ahead and created a JIRA and patch:
>>>>> https://issues.apache.org/jira/browse/FLUME-2401
>>>>>
>>>>> The option is configurable with:
>>>>> agentX.channels.ch1.compressBackupCheckpoint =3D true
>>>>>
>>>>> As per your recommendation, I used snappy-java. I also considered the
>>>>> snappy and lz4 implementations in Hadoop IO but noticed that the
>>>>> Hadoop IO dependency was removed in
>>>>> https://issues.apache.org/jira/browse/FLUME-1285
>>>>>
>>>>> Thanks,
>>>>> Abe
>>>>> --
>>>>> Abraham Fine | Software Engineer
>>>>> (516) 567-2535
>>>>> BrightRoll, Inc. | Smart Video Advertising | www.brightroll.com
>>>>>
>>>>>
>>>>> On Mon, Jun 9, 2014 at 4:01 PM, Hari Shreedharan
>>>>> <hshreedharan@cloudera.com> wrote:
>>>>>
>>>>> Hi Abraham,
>>>>>
>>>>> Compressing the backup checkpoint is very possible. Since the backup =
is
>>>>> rarely read (only if the original one is corrupt on restarts), is it
>>>>> used.
>>>>> So I think compressing it using something like Snappy would make sens=
e
>>>>> (GZIP
>>>>> might hit performance). Can you try using snappy-java and see if that
>>>>> gives
>>>>> good perf and reasonable compression?
>>>>>
>>>>> Patches are always welcome. I=E2=80=99d be glad to review and commit =
it. I
>>>>> would
>>>>> suggest making the compression optional via configuration so that
>>>>> anyone
>>>>> with smaller channels don=E2=80=99t end up using CPU for not much gai=
n.
>>>>>
>>>>>
>>>>> Thanks,
>>>>> Hari
>>>>>
>>>>> On Monday, June 9, 2014 at 3:56 PM, Abraham Fine wrote:
>>>>>
>>>>> Hello-
>>>>>
>>>>> We are using Flume 1.4 with File Channel configured to use a very
>>>>> large capacity. We keep the checkpoint and backup checkpoint on
>>>>> separate disks.
>>>>>
>>>>> Normally the file channel is mostly empty (<<1% of capacity). For the
>>>>> checkpoint the disk I/O seems to be very reasonable due to the usage
>>>>> of a MappedByteBuffer.
>>>>>
>>>>> On the other hand, the backup checkpoint seems to be written to disk
>>>>> in its entirety over and over again, resulting in very high disk
>>>>> utilization.
>>>>>
>>>>> I noticed that, because the checkpoint file is mostly empty, it is
>>>>> very compressible. I was able to GZIP our checkpoint from 381M to
>>>>> 386K. I was wondering if it would be possible to always compress the
>>>>> backup checkpoint before writing it to disk.
>>>>>
>>>>> I would be happy to work on a patch to implement this functionality i=
f
>>>>> there is interest.
>>>>>
>>>>> Thanks in Advance,
>>>>>
>>>>> --
>>>>> Abraham Fine | Software Engineer
>>>>> (516) 567-2535
>>>>> BrightRoll, Inc. | Smart Video Advertising | www.brightroll.com
>>>>>
>>>>>
>>>>>
>>>>
>>>
>>
>

--001a11337e426d3e1c04fd41e0aa
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">Thanks a lot. I will take a look at this tomorrow or early=
 next week.</div><div class=3D"gmail_extra"><br><br><div class=3D"gmail_quo=
te">On Wed, Jul 2, 2014 at 5:29 PM, Abraham Fine <span dir=3D"ltr">&lt;<a h=
ref=3D"mailto:abe@brightroll.com" target=3D"_blank">abe@brightroll.com</a>&=
gt;</span> wrote:<br>

<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div dir=3D"ltr">Hari-<div><br></div><div>I =
added the new tests and created a new revision to my patch.</div><div><br><=
/div>

<div><a href=3D"https://issues.apache.org/jira/secure/attachment/12653728/c=
ompress_backup_checkpoint_new_tests.patch" target=3D"_blank">https://issues=
.apache.org/jira/secure/attachment/12653728/compress_backup_checkpoint_new_=
tests.patch</a><br>


</div><div><br></div><div>Thanks,</div><div>Abe</div></div><div class=3D"gm=
ail_extra"><div class=3D""><br clear=3D"all"><div><div dir=3D"ltr"><div>--=
=C2=A0</div><div>Abraham Fine | Software Engineer</div><div><a href=3D"tel:=
%28516%29%20567-2535" value=3D"+15165672535" target=3D"_blank">(516) 567-25=
35</a></div>

<div>BrightRoll, Inc. | Smart Video Advertising | <a href=3D"http://www.bri=
ghtroll.com" target=3D"_blank">www.brightroll.com</a></div>
</div></div>
<br><br></div><div><div class=3D"h5"><div class=3D"gmail_quote">On Wed, Jul=
 2, 2014 at 4:32 PM, Hari Shreedharan <span dir=3D"ltr">&lt;<a href=3D"mail=
to:hshreedharan@cloudera.com" target=3D"_blank">hshreedharan@cloudera.com</=
a>&gt;</span> wrote:<br>

<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex">
<div dir=3D"ltr">Hi Abraham,<div><br></div><div>In general, the patch looks=
 good. Can you add a couple of tests -</div><div>* Original checkpoint is u=
ncompressed, config changes to compress checkpoint - does the file channel =
restart from original checkpoint? are new checkpoints compressed?</div>


<div>* Compressed checkpoint, config changes to not compress checkpoint - d=
oes channel start up? are new checkpoints uncompressed?</div><span><font co=
lor=3D"#888888"><div><br></div><div><br></div><div>Hari</div>
</font></span></div><div><div><div class=3D"gmail_extra"><br><br>
<div class=3D"gmail_quote">
On Wed, Jul 2, 2014 at 3:06 PM, Abraham Fine <span dir=3D"ltr">&lt;<a href=
=3D"mailto:abe@brightroll.com" target=3D"_blank">abe@brightroll.com</a>&gt;=
</span> wrote:<br><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .=
8ex;border-left:1px #ccc solid;padding-left:1ex">


<div dir=3D"ltr">Hi Brock and Hari-<div><br></div><div>I was just wondering=
 if either of you had a chance to take a look at the patch and if there is =
anything I can do to improve it.</div><div><br></div><div>Thanks,</div><div=
>


Abe</div><div class=3D"gmail_extra"><div><br clear=3D"all"><div><div dir=3D=
"ltr"><div>--=C2=A0</div><div>Abraham Fine | Software Engineer</div><div><a=
 href=3D"tel:%28516%29%20567-2535" value=3D"+15165672535" target=3D"_blank"=
>(516) 567-2535</a></div>


<div>BrightRoll, Inc. | Smart Video Advertising | <a href=3D"http://www.bri=
ghtroll.com" target=3D"_blank">www.brightroll.com</a></div>
</div></div>
<br><br></div><div><div><div class=3D"gmail_quote">On Wed, Jun 11, 2014 at =
6:48 PM, Brock Noland <span dir=3D"ltr">&lt;<a href=3D"mailto:brock@clouder=
a.com" target=3D"_blank">brock@cloudera.com</a>&gt;</span> wrote:<br>

<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex">
<div dir=3D"ltr">This is a great suggestion Abraham!</div><div><div><div cl=
ass=3D"gmail_extra"><br><br><div class=3D"gmail_quote">On Wed, Jun 11, 2014=
 at 5:39 PM, Hari Shreedharan <span dir=3D"ltr">&lt;<a href=3D"mailto:hshre=
edharan@cloudera.com" target=3D"_blank">hshreedharan@cloudera.com</a>&gt;</=
span> wrote:<br>


<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex">
                <div>
                    Thanks. I will review it :)
                </div>
                <div><div><br></div><br><div>Thanks,</div><div>Hari</div><d=
iv><br></div></div><div><div>
                =20
                <p style=3D"color:#a0a0a8">On Wednesday, June 11, 2014 at 5=
:00 PM, Abraham Fine wrote:</p>
                <blockquote type=3D"cite" style=3D"border-left-style:solid;=
border-width:1px;margin-left:0px;padding-left:10px">
                    <span><div><div><div>I went ahead and created a JIRA an=
d patch:</div><div><a href=3D"https://issues.apache.org/jira/browse/FLUME-2=
401" target=3D"_blank">https://issues.apache.org/jira/browse/FLUME-2401</a>=
</div>


<div><br></div><div>The option is configurable with:</div><div>agentX.chann=
els.ch1.compressBackupCheckpoint =3D true</div><div><br></div><div>As per y=
our recommendation, I used snappy-java. I also considered the</div><div>


snappy and lz4 implementations in Hadoop IO but noticed that the</div>
<div>Hadoop IO dependency was removed in</div><div><a href=3D"https://issue=
s.apache.org/jira/browse/FLUME-1285" target=3D"_blank">https://issues.apach=
e.org/jira/browse/FLUME-1285</a></div><div><br></div><div>Thanks,</div><div=
>


Abe</div><div>-- </div><div>Abraham Fine | Software Engineer</div><div><a h=
ref=3D"tel:%28516%29%20567-2535" value=3D"+15165672535" target=3D"_blank">(=
516) 567-2535</a></div><div>BrightRoll, Inc. | Smart Video Advertising | <a=
 href=3D"http://www.brightroll.com" target=3D"_blank">www.brightroll.com</a=
></div>


<div><br></div><div><br></div><div>On Mon, Jun 9, 2014 at 4:01 PM, Hari Shr=
eedharan</div><div>&lt;<a href=3D"mailto:hshreedharan@cloudera.com" target=
=3D"_blank">hshreedharan@cloudera.com</a>&gt; wrote:</div><blockquote type=
=3D"cite">


<div><div>Hi Abraham,</div><div><br></div><div>Compressing the backup check=
point is very possible. Since the backup is</div><div>rarely read (only if =
the original one is corrupt on restarts), is it used.</div><div>So I think =
compressing it using something like Snappy would make sense (GZIP</div>


<div>might hit performance). Can you try using snappy-java and see if that =
gives</div><div>good perf and reasonable compression?</div><div><br></div><=
div>Patches are always welcome. I=E2=80=99d be glad to review and commit it=
. I would</div>


<div>suggest making the compression optional via configuration so that anyo=
ne</div><div>with smaller channels don=E2=80=99t end up using CPU for not m=
uch gain.</div><div><br></div><div><br></div><div>Thanks,</div><div>Hari</d=
iv>


<div>
<br></div><div>On Monday, June 9, 2014 at 3:56 PM, Abraham Fine wrote:</div=
><div><br></div><div>Hello-</div><div><br></div><div>We are using Flume 1.4=
 with File Channel configured to use a very</div><div>large capacity. We ke=
ep the checkpoint and backup checkpoint on</div>


<div>separate disks.</div><div><br></div><div>Normally the file channel is =
mostly empty (&lt;&lt;1% of capacity). For the</div><div>checkpoint the dis=
k I/O seems to be very reasonable due to the usage</div><div>of a MappedByt=
eBuffer.</div>


<div><br></div><div>On the other hand, the backup checkpoint seems to be wr=
itten to disk</div><div>in its entirety over and over again, resulting in v=
ery high disk</div><div>utilization.</div><div><br></div><div>I noticed tha=
t, because the checkpoint file is mostly empty, it is</div>


<div>very compressible. I was able to GZIP our checkpoint from 381M to</div=
><div>386K. I was wondering if it would be possible to always compress the<=
/div><div>backup checkpoint before writing it to disk.</div><div><br></div>


<div>I would be happy to work on a patch to implement this functionality if=
</div><div>there is interest.</div><div><br></div><div>Thanks in Advance,</=
div><div><br></div><div>--</div><div>Abraham Fine | Software Engineer</div>


<div><a href=3D"tel:%28516%29%20567-2535" value=3D"+15165672535" target=3D"=
_blank">(516) 567-2535</a></div><div>BrightRoll, Inc. | Smart Video Adverti=
sing | <a href=3D"http://www.brightroll.com" target=3D"_blank">www.brightro=
ll.com</a></div>


</div></blockquote></div></div></span>
                =20
                =20
                =20
                =20
                </blockquote>
                =20
                <div>
                    <br>
                </div>
            </div></div></blockquote></div><br></div>
</div></div></blockquote></div><br></div></div></div></div>
</blockquote></div><br></div>
</div></div></blockquote></div><br></div></div></div>
</blockquote></div><br></div>

--001a11337e426d3e1c04fd41e0aa--