Mailing-List: contact user-help@flume.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@flume.apache.org
Received-SPF: pass (athena.apache.org: domain of mfuini@spryinc.com designates
 209.85.128.52 as permitted sender)
MIME-Version: 1.0
In-Reply-To: 
 <CALuKxdVbovUN7yo9F=thE3cpBLUGzj0fEnvLvY6oSeHzrn3CWg@mail.gmail.com>
References: 
 <CALuKxdVR-5Kbav-DuSroGU2KC-CD1-0BUt7gauyWru8b2U51bQ@mail.gmail.com>
	<CFA73F1CD063634CB7C435EBBC36BA4B0D878E6F52@cnvr-exch01>
	<CALuKxdXQku5yzreP-bNRhYDJQGcKcKtGxMkj=V1URU6G_Q+r_Q@mail.gmail.com>
	<CALuKxdVbovUN7yo9F=thE3cpBLUGzj0fEnvLvY6oSeHzrn3CWg@mail.gmail.com>
Date: Fri, 27 Sep 2013 11:05:31 -0400
Message-ID: 
 <CAPQ+VaEdyStAvNrMddMv+nQWoG-4zivfay0vZqHx0jmSM1xKWw@mail.gmail.com>
Subject: Re: Unable to put batch on required channel
From: Mark Fuini <mfuini@spryinc.com>
To: user@flume.apache.org
Content-Type: multipart/alternative; boundary=001a11c300f45908d004e75ed165

--001a11c300f45908d004e75ed165
Content-Type: text/plain; charset=ISO-8859-1

unsubscribe


On Fri, Sep 27, 2013 at 10:55 AM, Cameron Wellock <
cameron.wellock@nrelate.com> wrote:

> Final update, in case anyone ever has a similar problem: increasing the
> transactionCapacity to a low multiple of the batch size (say 5x batch size)
> seems to have fixed the problem, at least for the moment.
>
> Cameron
>
>
> On Thu, Sep 26, 2013 at 12:22 PM, Cameron Wellock <
> cameron.wellock@nrelate.com> wrote:
>
>> Hi Paul, thanks for your thoughts. The sink does not complain--at
>> all--and there are no relevant errors in the logs on the datanodes. I
>> haven't waited to see if flume recovers after the other write stops, as I
>> took the error messages at face value and restarted flume. I will try that
>> today, time permitting, and I'll let you know what happens.
>>
>> Thanks again,
>> Cameron
>>
>>
>> On Thu, Sep 26, 2013 at 12:07 PM, Paul Chavez <
>> pchavez@verticalsearchworks.com> wrote:
>>
>>> Is the HDFS sink reporting any issues writing to the cluster? If you
>>> leave it alone or wait until the other application stops writing will flume
>>> recover?****
>>>
>>> ** **
>>>
>>> SpoolDir is a good source if the write performance to HDFS is variable
>>> as the files in the spool directory will just sit and wait until the flume
>>> channel has space again. Another option may be to add another HDFS sink or
>>> two pulling from the same channel, but from what you are saying this may
>>> not increase performance.****
>>>
>>> ** **
>>>
>>> Hope that helps,****
>>>
>>> Paul Chavez****
>>>
>>> ** **
>>>
>>> ** **
>>>
>>> *From:* Cameron Wellock [mailto:cameron.wellock@nrelate.com]
>>> *Sent:* Thursday, September 26, 2013 8:37 AM
>>> *To:* user@flume.apache.org
>>> *Subject:* Unable to put batch on required channel****
>>>
>>> ** **
>>>
>>> Hello world,****
>>>
>>> ** **
>>>
>>> I've been trying to set up a test instance of flume and have been
>>> stymied by recurring failures. I'm trying to use a single flume agent
>>> moving about 200G of data from a spooldir into a very small hadoop cluster
>>> (3 nodes). If flume is the only thing writing to HDFS, everything works
>>> fine, but as soon as another application starts writing data into the
>>> cluster HDFS slows down and flume barfs with an "unable to put batch on
>>> required channel" exception.****
>>>
>>> ** **
>>>
>>> I have tried all kinds of configuration changes, to no avail. I have
>>> tried memory channels, file channels, small batch sizes (down to 50), large
>>> batch sizes (up to 20000), increasing timeouts, increasing channel capacity
>>> (up to 150 million), you name it. Sooner or later (usually 5-10 minutes
>>> after restart) flume comes to a halt. This is especially vexing considering
>>> that it's copying from a file to a file--there are no realtime requirements
>>> that might reasonably lead to a full channel in other circumstances.
>>> Anybody have any advice? Insights? Wild guesses? Outright lies?****
>>>
>>> ** **
>>>
>>> Below are two exceptions from the log, one from a memory channel
>>> configuration, one from a file channel configuration, and below that is the
>>> most recent configuration file used. Absolutely any suggestions would be
>>> appreciated.****
>>>
>>> ** **
>>>
>>> Thanks,****
>>>
>>> Cameron****
>>>
>>> ** **
>>>
>>> ** **
>>>
>>> 25 Sep 2013 21:05:12,262 ERROR [pool-5-thread-1]
>>> (org.apache.flume.source.SpoolDirectorySource$Spool****
>>>
>>> DirectoryRunnable.run:195)  - FATAL: Spool Directory source r1: {
>>> spoolDir: /var/nrelate/flume-spool****
>>>
>>>  }: Uncaught exception in SpoolDirectorySource thread. Restart or
>>> reconfigure Flume to continue proc****
>>>
>>> essing.****
>>>
>>> org.apache.flume.ChannelException: Unable to put batch on required
>>> channel: org.apache.flume.channel****
>>>
>>> .MemoryChannel{name: c1}****
>>>
>>>             at
>>> org.apache.flume.channel.ChannelProcessor.processEventBatch(ChannelProcessor.java:200)
>>> ****
>>>
>>>             at
>>> org.apache.flume.source.SpoolDirectorySource$SpoolDirectoryRunnable.run(SpoolDirectorySou
>>> ****
>>>
>>> rce.java:189)****
>>>
>>>             at
>>> java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:471)*
>>> ***
>>>
>>>             at
>>> java.util.concurrent.FutureTask$Sync.innerRunAndReset(FutureTask.java:351)
>>> ****
>>>
>>>             at
>>> java.util.concurrent.FutureTask.runAndReset(FutureTask.java:178)****
>>>
>>>             at
>>> java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$201(Scheduled
>>> ****
>>>
>>> ThreadPoolExecutor.java:165)****
>>>
>>>             at
>>> java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadP
>>> ****
>>>
>>> oolExecutor.java:267)****
>>>
>>>             at
>>> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1110)
>>> ****
>>>
>>>             at
>>> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:603)
>>> ****
>>>
>>>             at java.lang.Thread.run(Thread.java:679)****
>>>
>>> Caused by: org.apache.flume.ChannelException: Space for commit to queue
>>> couldn't be acquired Sinks a****
>>>
>>> re likely not keeping up with sources, or the buffer size is too tight**
>>> **
>>>
>>>             at
>>> org.apache.flume.channel.MemoryChannel$MemoryTransaction.doCommit(MemoryChannel.java:128)
>>> ****
>>>
>>>             at
>>> org.apache.flume.channel.BasicTransactionSemantics.commit(BasicTransactionSemantics.java:
>>> ****
>>>
>>> 151)****
>>>
>>>             at
>>> org.apache.flume.channel.ChannelProcessor.processEventBatch(ChannelProcessor.java:192)
>>> ****
>>>
>>>             ... 9 more****
>>>
>>> ** **
>>>
>>> ** **
>>>
>>> 25 Sep 2013 22:18:37,672 ERROR [pool-5-thread-1]
>>> (org.apache.flume.source.SpoolDirectorySource$Spool****
>>>
>>> DirectoryRunnable.run:195)  - FATAL: Spool Directory source r1: {
>>> spoolDir: /var/nrelate/flume-spool****
>>>
>>>  }: Uncaught exception in SpoolDirectorySource thread. Restart or
>>> reconfigure Flume to continue proc****
>>>
>>> essing.****
>>>
>>> org.apache.flume.ChannelException: Unable to put batch on required
>>> channel: FileChannel c1 { dataDir****
>>>
>>> s: [/var/lib/flume-ng/.flume/file-channel/data] }****
>>>
>>>             at
>>> org.apache.flume.channel.ChannelProcessor.processEventBatch(ChannelProcessor.java:200)
>>> ****
>>>
>>>             at
>>> org.apache.flume.source.SpoolDirectorySource$SpoolDirectoryRunnable.run(SpoolDirectorySou
>>> ****
>>>
>>> rce.java:189)****
>>>
>>>             at
>>> java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:471)*
>>> ***
>>>
>>>             at
>>> java.util.concurrent.FutureTask$Sync.innerRunAndReset(FutureTask.java:351)
>>> ****
>>>
>>>             at
>>> java.util.concurrent.FutureTask.runAndReset(FutureTask.java:178)****
>>>
>>>             at
>>> java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$201(Scheduled
>>> ****
>>>
>>> ThreadPoolExecutor.java:165)****
>>>
>>>             at
>>> java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadP
>>> ****
>>>
>>> oolExecutor.java:267)****
>>>
>>>             at
>>> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1110)
>>> ****
>>>
>>>             at
>>> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:603)
>>> ****
>>>
>>>             at java.lang.Thread.run(Thread.java:679)****
>>>
>>> Caused by: org.apache.flume.ChannelException: The channel has reached
>>> it's capacity. This might be t****
>>>
>>> he result of a sink on the channel having too low of batch size, a
>>> downstream system running slower ****
>>>
>>> than normal, or that the channel capacity is just too low. [channel=c1]*
>>> ***
>>>
>>>             at
>>> org.apache.flume.channel.file.FileChannel$FileBackedTransaction.doPut(FileChannel.java:46
>>> ****
>>>
>>> 8)****
>>>
>>>             at
>>> org.apache.flume.channel.BasicTransactionSemantics.put(BasicTransactionSemantics.java:93)
>>> ****
>>>
>>>             at
>>> org.apache.flume.channel.BasicChannelSemantics.put(BasicChannelSemantics.java:80)
>>> ****
>>>
>>>             at
>>> org.apache.flume.channel.ChannelProcessor.processEventBatch(ChannelProcessor.java:189)
>>> ****
>>>
>>>             ... 9 more****
>>>
>>> ** **
>>>
>>> ** **
>>>
>>> ** **
>>>
>>> ** **
>>>
>>> # define the pipeline parts
>>> --------------------------------------------------------------****
>>>
>>> ** **
>>>
>>> agent.sources = r1****
>>>
>>> agent.sinks = k1****
>>>
>>> agent.channels = c1****
>>>
>>> ** **
>>>
>>> agent.sources.r1.channels = c1****
>>>
>>> agent.sinks.k1.channel = c1****
>>>
>>> ** **
>>>
>>> # the main source, a spooldir
>>> ------------------------------------------------------------****
>>>
>>> ** **
>>>
>>> agent.sources.r1.type = spooldir****
>>>
>>> agent.sources.r1.spoolDir = /var/intheworld/flume-spool****
>>>
>>> agent.sources.r1.batchSize = 10000****
>>>
>>> agent.sources.r1.deserializer.maxLineLength = 10000****
>>>
>>> agent.sources.r1.interceptors = i1 i2****
>>>
>>> ** **
>>>
>>> # parse out the timestamp and add to header****
>>>
>>> agent.sources.r1.interceptors.i1.type = regex_extractor****
>>>
>>> agent.sources.r1.interceptors.i1.regex = ^.*\\"ts\\":(\\d+).*$****
>>>
>>> agent.sources.r1.interceptors.i1.serializers = s1****
>>>
>>> agent.sources.r1.interceptors.i1.serializers.s1.name = timestamp****
>>>
>>> ** **
>>>
>>> # also set host (hostname doesn't work properly, so set explicitly)****
>>>
>>> agent.sources.r1.interceptors.i2.type = static****
>>>
>>> agent.sources.r1.interceptors.i2.key = host****
>>>
>>> agent.sources.r1.interceptors.i2.value = Ess003726****
>>>
>>> ** **
>>>
>>> # the sink, HDFS
>>> -------------------------------------------------------------------------
>>> ****
>>>
>>> ** **
>>>
>>> agent.sinks.k1.type = hdfs****
>>>
>>> agent.sinks.k1.hdfs.path = hdfs://
>>> a.host.in.the.world.com/events/raw/%Y-%m-%d<http://a.host.in.the.world.com/events/raw/%25Y-%25m-%25d>
>>> ****
>>>
>>> agent.sinks.k1.hdfs.filePrefix = %{host}****
>>>
>>> agent.sinks.k1.hdfs.rollInterval = 0****
>>>
>>> agent.sinks.k1.hdfs.rollSize = 0****
>>>
>>> agent.sinks.k1.hdfs.rollCount = 0****
>>>
>>> agent.sinks.k1.hdfs.batchSize = 10000****
>>>
>>> agent.sinks.k1.hdfs.txnEventMax = 10000****
>>>
>>> agent.sinks.k1.hdfs.idleTimeout = 900****
>>>
>>> agent.sinks.k1.hdfs.callTimeout = 300000****
>>>
>>> agent.sinks.k1.hdfs.fileType = DataStream****
>>>
>>> agent.sinks.k1.hdfs.writeFormat = Text****
>>>
>>> ** **
>>>
>>> # the channel
>>> ----------------------------------------------------------------------------
>>> ****
>>>
>>> ** **
>>>
>>> agent.channels.c1.type = file****
>>>
>>> agent.channels.c1.capacity = 150000000****
>>>
>>> agent.channels.c1.transactionCapacity = 10000****
>>>
>>> agent.channels.c1.write-timeout = 360****
>>>
>>
>>
>

--001a11c300f45908d004e75ed165
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">unsubscribe</div><div class=3D"gmail_extra"><br><br><div c=
lass=3D"gmail_quote">On Fri, Sep 27, 2013 at 10:55 AM, Cameron Wellock <spa=
n dir=3D"ltr">&lt;<a href=3D"mailto:cameron.wellock@nrelate.com" target=3D"=
_blank">cameron.wellock@nrelate.com</a>&gt;</span> wrote:<br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div dir=3D"ltr">Final update, in case anyon=
e ever has a similar problem: increasing the transactionCapacity to a low m=
ultiple of the batch size (say 5x batch size) seems to have fixed the probl=
em, at least for the moment.=A0<div>

<br></div><div>Cameron</div></div><div class=3D"gmail_extra"><br><br><div c=
lass=3D"gmail_quote">On Thu, Sep 26, 2013 at 12:22 PM, Cameron Wellock <spa=
n dir=3D"ltr">&lt;<a href=3D"mailto:cameron.wellock@nrelate.com" target=3D"=
_blank">cameron.wellock@nrelate.com</a>&gt;</span> wrote:<br>

<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div dir=3D"ltr">Hi Paul, thanks for your th=
oughts. The sink does not complain--at all--and there are no relevant error=
s in the logs on the datanodes. I haven&#39;t waited to see if flume recove=
rs after the other write stops, as I took the error messages at face value =
and restarted flume. I will try that today, time permitting, and I&#39;ll l=
et you know what happens.<div>


<br></div><div>Thanks again,</div><div>Cameron</div></div><div><div><div cl=
ass=3D"gmail_extra"><br><br><div class=3D"gmail_quote">On Thu, Sep 26, 2013=
 at 12:07 PM, Paul Chavez <span dir=3D"ltr">&lt;<a href=3D"mailto:pchavez@v=
erticalsearchworks.com" target=3D"_blank">pchavez@verticalsearchworks.com</=
a>&gt;</span> wrote:<br>


<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div lang=3D"EN-US" link=3D"blue" vlink=3D"p=
urple"><div><p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-fam=
ily:&quot;Calibri&quot;,&quot;sans-serif&quot;;color:#1f497d">Is the HDFS s=
ink reporting any issues writing to the cluster? If you leave it alone or w=
ait until the other application stops writing will flume recover?<u></u><u>=
</u></span></p>


<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,&quot;sans-serif&quot;;color:#1f497d"><u></u>=A0<u></u></span><=
/p><p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot=
;Calibri&quot;,&quot;sans-serif&quot;;color:#1f497d">SpoolDir is a good sou=
rce if the write performance to HDFS is variable as the files in the spool =
directory will just sit and wait until the flume channel has space again. A=
nother option may be to add another HDFS sink or two pulling from the same =
channel, but from what you are saying this may not increase performance.<u>=
</u><u></u></span></p>


<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,&quot;sans-serif&quot;;color:#1f497d"><u></u>=A0<u></u></span><=
/p><p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot=
;Calibri&quot;,&quot;sans-serif&quot;;color:#1f497d">Hope that helps,<u></u=
><u></u></span></p>


<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,&quot;sans-serif&quot;;color:#1f497d">Paul Chavez<u></u><u></u>=
</span></p><p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-fami=
ly:&quot;Calibri&quot;,&quot;sans-serif&quot;;color:#1f497d"><u></u>=A0<u><=
/u></span></p>


<p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,&quot;sans-serif&quot;;color:#1f497d"><u></u>=A0<u></u></span><=
/p><p class=3D"MsoNormal"><b><span style=3D"font-size:11.0pt;font-family:&q=
uot;Calibri&quot;,&quot;sans-serif&quot;">From:</span></b><span style=3D"fo=
nt-size:11.0pt;font-family:&quot;Calibri&quot;,&quot;sans-serif&quot;"> Cam=
eron Wellock [mailto:<a href=3D"mailto:cameron.wellock@nrelate.com" target=
=3D"_blank">cameron.wellock@nrelate.com</a>] <br>


<b>Sent:</b> Thursday, September 26, 2013 8:37 AM<br><b>To:</b> <a href=3D"=
mailto:user@flume.apache.org" target=3D"_blank">user@flume.apache.org</a><b=
r><b>Subject:</b> Unable to put batch on required channel<u></u><u></u></sp=
an></p>


<div><div><p class=3D"MsoNormal"><u></u>=A0<u></u></p><div><div><p class=3D=
"MsoNormal">Hello world,<u></u><u></u></p></div><div><p class=3D"MsoNormal"=
><u></u>=A0<u></u></p></div><div><p class=3D"MsoNormal">I&#39;ve been tryin=
g to set up a test instance of flume and have been stymied by recurring fai=
lures. I&#39;m trying to use a single flume agent moving about 200G of data=
 from a spooldir into a very small hadoop cluster (3 nodes). If flume is th=
e only thing writing to HDFS, everything works fine, but as soon as another=
 application starts writing data into the cluster HDFS slows down and flume=
 barfs with an &quot;unable to put batch on required channel&quot; exceptio=
n.<u></u><u></u></p>


</div><div><p class=3D"MsoNormal"><u></u>=A0<u></u></p></div><div><p class=
=3D"MsoNormal">I have tried all kinds of configuration changes, to no avail=
. I have tried memory channels, file channels, small batch sizes (down to 5=
0), large batch sizes (up to 20000), increasing timeouts, increasing channe=
l capacity (up to 150 million), you name it. Sooner or later (usually 5-10 =
minutes after restart) flume comes to a halt. This is especially vexing con=
sidering that it&#39;s copying from a file to a file--there are no realtime=
 requirements that might reasonably lead to a full channel in other circums=
tances. Anybody have any advice? Insights? Wild guesses? Outright lies?<u><=
/u><u></u></p>


</div><div><p class=3D"MsoNormal"><u></u>=A0<u></u></p></div><div><p class=
=3D"MsoNormal">Below are two exceptions from the log, one from a memory cha=
nnel configuration, one from a file channel configuration, and below that i=
s the most recent configuration file used. Absolutely any suggestions would=
 be appreciated.<u></u><u></u></p>


</div><div><p class=3D"MsoNormal"><u></u>=A0<u></u></p></div><div><p class=
=3D"MsoNormal">Thanks,<u></u><u></u></p></div><div><p class=3D"MsoNormal">C=
ameron<u></u><u></u></p></div><div><p class=3D"MsoNormal"><u></u>=A0<u></u>=
</p></div>


<div><p class=3D"MsoNormal"><u></u>=A0<u></u></p></div><div><div><p class=
=3D"MsoNormal">25 Sep 2013 21:05:12,262 ERROR [pool-5-thread-1] (org.apache=
.flume.source.SpoolDirectorySource$Spool<u></u><u></u></p></div><div><p cla=
ss=3D"MsoNormal">


DirectoryRunnable.run:195) =A0- FATAL: Spool Directory source r1: { spoolDi=
r: /var/nrelate/flume-spool<u></u><u></u></p></div><div><p class=3D"MsoNorm=
al">=A0}: Uncaught exception in SpoolDirectorySource thread. Restart or rec=
onfigure Flume to continue proc<u></u><u></u></p>


</div><div><p class=3D"MsoNormal">essing.<u></u><u></u></p></div><div><p cl=
ass=3D"MsoNormal">org.apache.flume.ChannelException: Unable to put batch on=
 required channel: org.apache.flume.channel<u></u><u></u></p></div><div><p =
class=3D"MsoNormal">


.MemoryChannel{name: c1}<u></u><u></u></p></div><div><p class=3D"MsoNormal"=
>=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0 at org.apache.flume.channel.ChannelProce=
ssor.processEventBatch(ChannelProcessor.java:200)<u></u><u></u></p></div><d=
iv><p class=3D"MsoNormal">


=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0 at org.apache.flume.source.SpoolDirectory=
Source$SpoolDirectoryRunnable.run(SpoolDirectorySou<u></u><u></u></p></div>=
<div><p class=3D"MsoNormal">rce.java:189)<u></u><u></u></p></div><div><p cl=
ass=3D"MsoNormal">=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0 at java.util.concurrent=
.Executors$RunnableAdapter.call(Executors.java:471)<u></u><u></u></p>


</div><div><p class=3D"MsoNormal">=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0 at java=
.util.concurrent.FutureTask$Sync.innerRunAndReset(FutureTask.java:351)<u></=
u><u></u></p></div><div><p class=3D"MsoNormal">=A0=A0=A0=A0=A0=A0=A0=A0=A0=
=A0=A0 at java.util.concurrent.FutureTask.runAndReset(FutureTask.java:178)<=
u></u><u></u></p>


</div><div><p class=3D"MsoNormal">=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0 at java=
.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$201=
(Scheduled<u></u><u></u></p></div><div><p class=3D"MsoNormal">ThreadPoolExe=
cutor.java:165)<u></u><u></u></p>


</div><div><p class=3D"MsoNormal">=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0 at java=
.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(Schedu=
ledThreadP<u></u><u></u></p></div><div><p class=3D"MsoNormal">oolExecutor.j=
ava:267)<u></u><u></u></p>


</div><div><p class=3D"MsoNormal">=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0 at java=
.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1110)=
<u></u><u></u></p></div><div><p class=3D"MsoNormal">=A0=A0=A0=A0=A0=A0=A0=
=A0=A0=A0=A0 at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPo=
olExecutor.java:603)<u></u><u></u></p>


</div><div><p class=3D"MsoNormal">=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0 at java=
.lang.Thread.run(Thread.java:679)<u></u><u></u></p></div><div><p class=3D"M=
soNormal">Caused by: org.apache.flume.ChannelException: Space for commit to=
 queue couldn&#39;t be acquired Sinks a<u></u><u></u></p>


</div><div><p class=3D"MsoNormal">re likely not keeping up with sources, or=
 the buffer size is too tight<u></u><u></u></p></div><div><p class=3D"MsoNo=
rmal">=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0 at org.apache.flume.channel.MemoryC=
hannel$MemoryTransaction.doCommit(MemoryChannel.java:128)<u></u><u></u></p>


</div><div><p class=3D"MsoNormal">=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0 at org.=
apache.flume.channel.BasicTransactionSemantics.commit(BasicTransactionSeman=
tics.java:<u></u><u></u></p></div><div><p class=3D"MsoNormal">151)<u></u><u=
></u></p></div><div><p class=3D"MsoNormal">


=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0 at org.apache.flume.channel.ChannelProces=
sor.processEventBatch(ChannelProcessor.java:192)<u></u><u></u></p></div><di=
v><p class=3D"MsoNormal">=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0 ... 9 more<u></u=
><u></u></p></div><div><p class=3D"MsoNormal">


<u></u>=A0<u></u></p></div><div><p class=3D"MsoNormal"><u></u>=A0<u></u></p=
></div><div><p class=3D"MsoNormal">25 Sep 2013 22:18:37,672 ERROR [pool-5-t=
hread-1] (org.apache.flume.source.SpoolDirectorySource$Spool<u></u><u></u><=
/p>

</div>
<div><p class=3D"MsoNormal">DirectoryRunnable.run:195) =A0- FATAL: Spool Di=
rectory source r1: { spoolDir: /var/nrelate/flume-spool<u></u><u></u></p></=
div><div><p class=3D"MsoNormal">=A0}: Uncaught exception in SpoolDirectoryS=
ource thread. Restart or reconfigure Flume to continue proc<u></u><u></u></=
p>


</div><div><p class=3D"MsoNormal">essing.<u></u><u></u></p></div><div><p cl=
ass=3D"MsoNormal">org.apache.flume.ChannelException: Unable to put batch on=
 required channel: FileChannel c1 { dataDir<u></u><u></u></p></div><div><p =
class=3D"MsoNormal">


s: [/var/lib/flume-ng/.flume/file-channel/data] }<u></u><u></u></p></div><d=
iv><p class=3D"MsoNormal">=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0 at org.apache.f=
lume.channel.ChannelProcessor.processEventBatch(ChannelProcessor.java:200)<=
u></u><u></u></p></div>


<div><p class=3D"MsoNormal">=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0 at org.apache=
.flume.source.SpoolDirectorySource$SpoolDirectoryRunnable.run(SpoolDirector=
ySou<u></u><u></u></p></div><div><p class=3D"MsoNormal">rce.java:189)<u></u=
><u></u></p></div><div>


<p class=3D"MsoNormal">=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0 at java.util.concu=
rrent.Executors$RunnableAdapter.call(Executors.java:471)<u></u><u></u></p><=
/div><div><p class=3D"MsoNormal">=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0 at java.=
util.concurrent.FutureTask$Sync.innerRunAndReset(FutureTask.java:351)<u></u=
><u></u></p>


</div><div><p class=3D"MsoNormal">=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0 at java=
.util.concurrent.FutureTask.runAndReset(FutureTask.java:178)<u></u><u></u><=
/p></div><div><p class=3D"MsoNormal">=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0 at j=
ava.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$=
201(Scheduled<u></u><u></u></p>


</div><div><p class=3D"MsoNormal">ThreadPoolExecutor.java:165)<u></u><u></u=
></p></div><div><p class=3D"MsoNormal">=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0 at=
 java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(S=
cheduledThreadP<u></u><u></u></p>


</div><div><p class=3D"MsoNormal">oolExecutor.java:267)<u></u><u></u></p></=
div><div><p class=3D"MsoNormal">=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0 at java.u=
til.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1110)<u=
></u><u></u></p></div>


<div><p class=3D"MsoNormal">=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0 at java.util.=
concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:603)<u></u=
><u></u></p></div><div><p class=3D"MsoNormal">=A0=A0=A0=A0=A0=A0=A0=A0=A0=
=A0=A0 at java.lang.Thread.run(Thread.java:679)<u></u><u></u></p>


</div><div><p class=3D"MsoNormal">Caused by: org.apache.flume.ChannelExcept=
ion: The channel has reached it&#39;s capacity. This might be t<u></u><u></=
u></p></div><div><p class=3D"MsoNormal">he result of a sink on the channel =
having too low of batch size, a downstream system running slower=A0<u></u><=
u></u></p>


</div><div><p class=3D"MsoNormal">than normal, or that the channel capacity=
 is just too low. [channel=3Dc1]<u></u><u></u></p></div><div><p class=3D"Ms=
oNormal">=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0 at org.apache.flume.channel.file=
.FileChannel$FileBackedTransaction.doPut(FileChannel.java:46<u></u><u></u><=
/p>


</div><div><p class=3D"MsoNormal">8)<u></u><u></u></p></div><div><p class=
=3D"MsoNormal">=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0 at org.apache.flume.channe=
l.BasicTransactionSemantics.put(BasicTransactionSemantics.java:93)<u></u><u=
></u></p></div><div><p class=3D"MsoNormal">


=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0 at org.apache.flume.channel.BasicChannelS=
emantics.put(BasicChannelSemantics.java:80)<u></u><u></u></p></div><div><p =
class=3D"MsoNormal">=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0 at org.apache.flume.c=
hannel.ChannelProcessor.processEventBatch(ChannelProcessor.java:189)<u></u>=
<u></u></p>


</div><div><p class=3D"MsoNormal">=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0 ... 9 m=
ore<u></u><u></u></p></div><div><p class=3D"MsoNormal"><u></u>=A0<u></u></p=
></div><div><p class=3D"MsoNormal"><u></u>=A0<u></u></p></div><div><p class=
=3D"MsoNormal"><u></u>=A0<u></u></p>


</div><div><p class=3D"MsoNormal"><u></u>=A0<u></u></p></div><div><p class=
=3D"MsoNormal"># define the pipeline parts --------------------------------=
------------------------------<u></u><u></u></p></div><div><p class=3D"MsoN=
ormal">


<u></u>=A0<u></u></p></div><div><p class=3D"MsoNormal">agent.sources =3D r1=
<u></u><u></u></p></div><div><p class=3D"MsoNormal">agent.sinks =3D k1<u></=
u><u></u></p></div><div><p class=3D"MsoNormal">agent.channels =3D c1<u></u>=
<u></u></p>


</div><div><p class=3D"MsoNormal"><u></u>=A0<u></u></p></div><div><p class=
=3D"MsoNormal">agent.sources.r1.channels =3D c1<u></u><u></u></p></div><div=
><p class=3D"MsoNormal">agent.sinks.k1.channel =3D c1<u></u><u></u></p></di=
v><div><p class=3D"MsoNormal">


<u></u>=A0<u></u></p></div><div><p class=3D"MsoNormal"># the main source, a=
 spooldir ------------------------------------------------------------<u></=
u><u></u></p></div><div><p class=3D"MsoNormal"><u></u>=A0<u></u></p></div><=
div>


<p class=3D"MsoNormal">agent.sources.r1.type =3D spooldir<u></u><u></u></p>=
</div><div><p class=3D"MsoNormal">agent.sources.r1.spoolDir =3D /var/inthew=
orld/flume-spool<u></u><u></u></p></div><div><p class=3D"MsoNormal">agent.s=
ources.r1.batchSize =3D 10000<u></u><u></u></p>


</div><div><p class=3D"MsoNormal">agent.sources.r1.deserializer.maxLineLeng=
th =3D 10000<u></u><u></u></p></div><div><p class=3D"MsoNormal">agent.sourc=
es.r1.interceptors =3D i1 i2<u></u><u></u></p></div><div><p class=3D"MsoNor=
mal">

<u></u>=A0<u></u></p>
</div><div><p class=3D"MsoNormal"># parse out the timestamp and add to head=
er<u></u><u></u></p></div><div><p class=3D"MsoNormal">agent.sources.r1.inte=
rceptors.i1.type =3D regex_extractor<u></u><u></u></p></div><div><p class=
=3D"MsoNormal">


agent.sources.r1.interceptors.i1.regex =3D ^.*\\&quot;ts\\&quot;:(<a>\\d+).=
*$</a><u></u><u></u></p></div><div><p class=3D"MsoNormal">agent.sources.r1.=
interceptors.i1.serializers =3D s1<u></u><u></u></p></div><div><p class=3D"=
MsoNormal">


<a href=3D"http://agent.sources.r1.interceptors.i1.serializers.s1.name" tar=
get=3D"_blank">agent.sources.r1.interceptors.i1.serializers.s1.name</a> =3D=
 timestamp<u></u><u></u></p></div><div><p class=3D"MsoNormal"><u></u>=A0<u>=
</u></p>


</div><div><p class=3D"MsoNormal"># also set host (hostname doesn&#39;t wor=
k properly, so set explicitly)<u></u><u></u></p></div><div><p class=3D"MsoN=
ormal">agent.sources.r1.interceptors.i2.type =3D static<u></u><u></u></p></=
div>


<div><p class=3D"MsoNormal">agent.sources.r1.interceptors.i2.key =3D host<u=
></u><u></u></p></div><div><p class=3D"MsoNormal">agent.sources.r1.intercep=
tors.i2.value =3D Ess003726<u></u><u></u></p></div><div><p class=3D"MsoNorm=
al"><u></u>=A0<u></u></p>


</div><div><p class=3D"MsoNormal"># the sink, HDFS ------------------------=
-------------------------------------------------<u></u><u></u></p></div><d=
iv><p class=3D"MsoNormal"><u></u>=A0<u></u></p></div><div><p class=3D"MsoNo=
rmal">


agent.sinks.k1.type =3D hdfs<u></u><u></u></p></div><div><p class=3D"MsoNor=
mal">agent.sinks.k1.hdfs.path =3D hdfs://<a href=3D"http://a.host.in.the.wo=
rld.com/events/raw/%25Y-%25m-%25d" target=3D"_blank">a.host.in.the.world.co=
m/events/raw/%Y-%m-%d</a><u></u><u></u></p>


</div><div><p class=3D"MsoNormal">agent.sinks.k1.hdfs.filePrefix =3D %{host=
}<u></u><u></u></p></div><div><p class=3D"MsoNormal">agent.sinks.k1.hdfs.ro=
llInterval =3D 0<u></u><u></u></p></div><div><p class=3D"MsoNormal">agent.s=
inks.k1.hdfs.rollSize =3D 0<u></u><u></u></p>


</div><div><p class=3D"MsoNormal">agent.sinks.k1.hdfs.rollCount =3D 0<u></u=
><u></u></p></div><div><p class=3D"MsoNormal">agent.sinks.k1.hdfs.batchSize=
 =3D 10000<u></u><u></u></p></div><div><p class=3D"MsoNormal">agent.sinks.k=
1.hdfs.txnEventMax =3D 10000<u></u><u></u></p>


</div><div><p class=3D"MsoNormal">agent.sinks.k1.hdfs.idleTimeout =3D 900<u=
></u><u></u></p></div><div><p class=3D"MsoNormal">agent.sinks.k1.hdfs.callT=
imeout =3D 300000<u></u><u></u></p></div><div><p class=3D"MsoNormal">agent.=
sinks.k1.hdfs.fileType =3D DataStream<u></u><u></u></p>


</div><div><p class=3D"MsoNormal">agent.sinks.k1.hdfs.writeFormat =3D Text<=
u></u><u></u></p></div><div><p class=3D"MsoNormal"><u></u>=A0<u></u></p></d=
iv><div><p class=3D"MsoNormal"># the channel ------------------------------=
----------------------------------------------<u></u><u></u></p>


</div><div><p class=3D"MsoNormal"><u></u>=A0<u></u></p></div><div><p class=
=3D"MsoNormal">agent.channels.c1.type =3D file<u></u><u></u></p></div><div>=
<p class=3D"MsoNormal">agent.channels.c1.capacity =3D 150000000<u></u><u></=
u></p></div>


<div><p class=3D"MsoNormal">agent.channels.c1.transactionCapacity =3D 10000=
<u></u><u></u></p></div><div><p class=3D"MsoNormal">agent.channels.c1.write=
-timeout =3D 360<u></u><u></u></p></div></div></div></div></div></div></div=
></blockquote>


</div><br></div>
</div></div></blockquote></div><br></div>
</blockquote></div><br></div>

--001a11c300f45908d004e75ed165--