Mailing-List: contact user-help@flume.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@flume.apache.org
Received-SPF: pass (nike.apache.org: domain of Matt.Kenison@disney.com
 designates 204.128.192.17 as permitted sender)
From: "Kenison, Matt" <Matt.Kenison@disney.com>
To: "user@flume.apache.org" <user@flume.apache.org>
Date: Thu, 6 Feb 2014 13:11:19 -0800
Subject: Re: Channel management: messages that will never be delivered
Thread-Topic: Channel management: messages that will never be delivered
Thread-Index: Ac8jf/0QH3bxBkopQYiUzZHu+CpkEw==
Message-ID: <CF19380F.233A%matt.kenison@disney.com>
In-Reply-To: 
 <4A7D5110AA4DCF4F98905065947956E315C5B39B@BGB01XUD1008.national.core.bbc.co.uk>
Accept-Language: en-US
Content-Language: en-US
user-agent: Microsoft-MacOutlook/14.2.4.120824
acceptlanguage: en-US
Content-Type: multipart/alternative;
	boundary="_000_CF19380F233Amattkenisondisneycom_"
MIME-Version: 1.0

--_000_CF19380F233Amattkenisondisneycom_
Content-Type: text/plain; charset="us-ascii"
Content-Transfer-Encoding: quoted-printable

It's possible, but not easy to do. In our application, the individual custo=
m sinks know which exceptions can rollback or are unrecoverable, but this d=
oesn't work for the built-in sinks, and it doesn't take into account unexpe=
cted failures. So, we control it manually with a JMX flag and by subclassin=
g BasicTransactionSemantics. When the flag is set, and the transaction trie=
s to rollback, it performs a commit instead (and directs the messages to a =
failure channel). When a transaction is successful, the flag is reset.

It's not the prettiest solution, but isn't a hack. It requires subclassing =
the channel to provide a custom transaction, and overriding the default tra=
nsaction behavior.  Flume really doesn't make it easy to extend behavior of=
 any of the standard components.


From: Paul Merry <paul.merry@bbc.co.uk<mailto:paul.merry@bbc.co.uk>>
Reply-To: "user@flume.apache.org<mailto:user@flume.apache.org>" <user@flume=
.apache.org<mailto:user@flume.apache.org>>
Date: Thursday, February 6, 2014 12:28 AM
To: "user@flume.apache.org<mailto:user@flume.apache.org>" <user@flume.apach=
e.org<mailto:user@flume.apache.org>>
Subject: RE: Channel management: messages that will never be delivered


Thanks for the suggestion Ed, it's definitely something we could look at.

I did find this ticket https://issues.apache.org/jira/browse/FLUME-2140 and=
 the linked discussion thread http://flume.markmail.org/thread/y3cks6hdgof3=
kxu6#query:+page:1+mid:rx3zm53t4dhmqskk+state:results

There are some suggestions for work arounds there, probably the use of fail=
iover sink is most relevant but I'd be concerned for what might happen to l=
egitimate messages in a situation where there is downtime or connection iss=
ues with the endpoint. It seems we'd loose the correct channel retry logic =
(in that scenario) and end up with messages that would need replaying.

If there isn't much to add on the handling of 'bad messages' can anyone inf=
orm on the handling of other messages in a batch with one or more of these =
messages that will never deliver. Will they also not make it to their desti=
nation or will they get rebatched?

Also keen for anyone with an idea for how to clear these messages from a ch=
annel once they are stuck, as the directory deletion can take good messages=
 down too.


- Paul


________________________________
From: ed [edorsey@gmail.com<mailto:edorsey@gmail.com>]
Sent: 05 February 2014 23:12
To: user@flume.apache.org<mailto:user@flume.apache.org>
Subject: Re: Channel management: messages that will never be delivered

Hi Paul,

Not sure if this would work for you but if you can error check prior to the=
 events reaching Elasticsearch you can handle this by writing a custom Inte=
rceptor that validates your events.  You can do more robust error checking =
here than you can by just relying on the already existing event header fiel=
ds as you'll have full access to the event header and body.  Within the int=
erceptor, if the event is not compatible with Elasticsearch you can add a b=
oolean flag to the header of the event like "hasError".  Then you can route=
 any events that have an error to a different "error" channel using a multi=
plexing selector by checking for the hasError flag.  The error channel can =
either be connected to a NullSink or a FileRoll sink if you want to preserv=
e the improperly formatted events.

We've only used the memory channel so far so I'm afraid I can't comment on =
the file channel specific questions you have.  Hopefully someone on the lis=
t with some more experience there can chime in.

Best,

Ed


On Wed, Feb 5, 2014 at 5:34 PM, Paul Merry <paul.merry@bbc.co.uk<mailto:pau=
l.merry@bbc.co.uk>> wrote:

Hi,

We are using an Elasticsearch sink and have seen a file channel filling wit=
h messages that will never be delivered as the format of the message is inc=
ompatible with Elasticsearch itself.

Example message from Flume logs:


24 Jan 2014 08:14:55,173 ERROR [SinkRunner-PollingRunner-DefaultSinkProcess=
or]
(org.apache.flume.SinkRunner$PollingRunner.run:160)  - Unable to deliver ev=
ent.
Exception follows.
org.elasticsearch.indices.InvalidIndexNameException: [UpperCase-2014-01-23]
Invalid index name [UpperCase-2014-01-23], must be lowercase

In this case the index name comes from a header so we have a workaround usi=
ng a multiplexing channel selector to detect and re-route messages based on=
 headers of this format.

To clean up the channel this time we removed the data and checkpoint direct=
ories, which is not ideal as we probably lost other messages in doing this.

We are wary of similar situations occurring in future for messages that we =
can't detect and divert in advance and so have a few questions:

- What would be the recommended handling of this situation?

- Is it possible to clear just these messages from the channel or does the =
whole channel have to be deleted ?

- Is there a way that we can divert these messages to another channel (dead=
 letter / invalid message style) ? Noting that they are not known to be pro=
blematic until after an attempt is made to deliver them from the sink

- What happens to other messages in a batch with a bad message ? Will they =
also be stuck forever or will they be taken in another batch ?


Thanks,

Paul.


----------------------------

http://www.bbc.co.uk
This e-mail (and any attachments) is confidential and may contain personal =
views which are not the views of the BBC unless specifically stated.
If you have received it in error, please delete it from your system.
Do not use, copy or disclose the information in any way nor act in reliance=
 on it and notify the sender immediately.
Please note that the BBC monitors e-mails sent or received.
Further communication will signify your consent to this.

---------------------


----------------------------

http://www.bbc.co.uk
This e-mail (and any attachments) is confidential and may contain personal =
views which are not the views of the BBC unless specifically stated.
If you have received it in error, please delete it from your system.
Do not use, copy or disclose the information in any way nor act in reliance=
 on it and notify the sender immediately.
Please note that the BBC monitors e-mails sent or received.
Further communication will signify your consent to this.

---------------------

--_000_CF19380F233Amattkenisondisneycom_
Content-Type: text/html; charset="us-ascii"
Content-Transfer-Encoding: quoted-printable

<html><head></head><body style=3D"word-wrap: break-word; -webkit-nbsp-mode:=
 space; -webkit-line-break: after-white-space; color: rgb(0, 0, 0); font-si=
ze: 14px; font-family: Calibri, sans-serif; "><div>It's possible, but not e=
asy to do. In our application, the individual custom sinks know which excep=
tions can rollback or are unrecoverable, but this doesn't work for the buil=
t-in sinks, and it doesn't take into account unexpected failures. So, we co=
ntrol it manually with a JMX flag and by subclassing&nbsp;BasicTransactionS=
emantics. When the flag is set, and the transaction tries to rollback, it p=
erforms a commit instead (and directs the messages to a failure channel). W=
hen a transaction is successful, the flag is reset.&nbsp;</div><div><br></d=
iv><div>It's not the prettiest solution, but isn't a hack. It requires subc=
lassing the channel to provide a custom transaction, and overriding the def=
ault transaction behavior. &nbsp;Flume really doesn't make it easy to exten=
d behavior of any of the standard components.</div><div><br></div><div><br>=
</div><span id=3D"OLK_SRC_BODY_SECTION"><div style=3D"font-family:Calibri; =
font-size:11pt; text-align:left; color:black; BORDER-BOTTOM: medium none; B=
ORDER-LEFT: medium none; PADDING-BOTTOM: 0in; PADDING-LEFT: 0in; PADDING-RI=
GHT: 0in; BORDER-TOP: #b5c4df 1pt solid; BORDER-RIGHT: medium none; PADDING=
-TOP: 3pt"><span style=3D"font-weight:bold">From: </span> Paul Merry &lt;<a=
 href=3D"mailto:paul.merry@bbc.co.uk">paul.merry@bbc.co.uk</a>&gt;<br><span=
 style=3D"font-weight:bold">Reply-To: </span> "<a href=3D"mailto:user@flume=
.apache.org">user@flume.apache.org</a>" &lt;<a href=3D"mailto:user@flume.ap=
ache.org">user@flume.apache.org</a>&gt;<br><span style=3D"font-weight:bold"=
>Date: </span> Thursday, February 6, 2014 12:28 AM<br><span style=3D"font-w=
eight:bold">To: </span> "<a href=3D"mailto:user@flume.apache.org">user@flum=
e.apache.org</a>" &lt;<a href=3D"mailto:user@flume.apache.org">user@flume.a=
pache.org</a>&gt;<br><span style=3D"font-weight:bold">Subject: </span> RE: =
Channel management: messages that will never be delivered<br></div><div><br=
></div><div dir=3D"ltr"><!-- Template generated by Exclaimer Mail Disclaime=
rs on 08:28:38 Thursday, 6 February 2014 --><meta http-equiv=3D"Content-Typ=
e" content=3D"text/html; charset=3Dutf-8"><style type=3D"text/css">P.24f680=
e3-3b0a-45ff-925b-70623029de92 {
	MARGIN: 0cm 0cm 0pt
}
LI.24f680e3-3b0a-45ff-925b-70623029de92 {
	MARGIN: 0cm 0cm 0pt
}
DIV.24f680e3-3b0a-45ff-925b-70623029de92 {
	MARGIN: 0cm 0cm 0pt
}
TABLE.24f680e3-3b0a-45ff-925b-70623029de92Table {
	MARGIN: 0cm 0cm 0pt
}
DIV.Section1 {
	page: Section1
}
</style><style id=3D"owaParaStyle" type=3D"text/css">P {margin-top:0;margin=
-bottom:0;}</style><div ocsi=3D"0" fpstyle=3D"1"><p class=3D"24f680e3-3b0a-=
45ff-925b-70623029de92"></p><div style=3D"direction: ltr;font-family: Tahom=
a;color: #000000;font-size: 10pt;">Thanks for the suggestion Ed, it's defin=
itely something we could look at.<br><br>
I did find this ticket <a href=3D"https://issues.apache.org/jira/browse/FLU=
ME-2140" target=3D"_blank">
https://issues.apache.org/jira/browse/FLUME-2140</a> and the linked discuss=
ion thread
<a href=3D"http://flume.markmail.org/thread/y3cks6hdgof3kxu6#query:+page:1+=
mid:rx3zm53t4dhmqskk+state:results" target=3D"_blank">
http://flume.markmail.org/thread/y3cks6hdgof3kxu6#query:+page:1+mid:rx3zm53=
t4dhmqskk+state:results</a><br><br>
There are some suggestions for work arounds there, probably the use of fail=
iover sink is most relevant but I'd be concerned for what might happen to l=
egitimate messages in a situation where there is downtime or connection iss=
ues with the endpoint. It seems
 we'd loose the correct channel retry logic (in that scenario) and end up w=
ith messages that would need replaying.<br><br>
If there isn't much to add on the handling of 'bad messages' can anyone inf=
orm on the handling of other messages in a batch with one or more of these =
messages that will never deliver. Will they also not make it to their desti=
nation or will they get rebatched?<br><br>
Also keen for anyone with an idea for how to clear these messages from a ch=
annel once they are stuck, as the directory deletion can take good messages=
 down too.<br><br><br>
- Paul<br><br><br><div style=3D"font-family: Times New Roman; color: #00000=
0; font-size: 16px"><hr tabindex=3D"-1"><div style=3D"direction: ltr;" id=
=3D"divRpF675776"><font color=3D"#000000" face=3D"Tahoma" size=3D"2"><b>Fro=
m:</b> ed [<a href=3D"mailto:edorsey@gmail.com">edorsey@gmail.com</a>]<br><=
b>Sent:</b> 05 February 2014 23:12<br><b>To:</b> <a href=3D"mailto:user@flu=
me.apache.org">user@flume.apache.org</a><br><b>Subject:</b> Re: Channel man=
agement: messages that will never be delivered<br></font><br></div><div></d=
iv><div><div dir=3D"ltr">Hi Paul,
<div><br></div><div>Not sure if this would work for you but if you can erro=
r check prior to the events reaching Elasticsearch you can handle this by w=
riting a custom Interceptor that validates your events. &nbsp;You can do mo=
re robust error checking here than you can by just relying
 on the already existing event header fields as you'll have full access to =
the event header and body. &nbsp;Within the interceptor, if the event is no=
t compatible with Elasticsearch you can add a boolean flag to the header of=
 the event like "hasError". &nbsp;Then you
 can route any events that have an error to a different "error" channel usi=
ng a multiplexing selector by checking for the hasError flag. &nbsp;The err=
or channel can either be connected to a NullSink or a FileRoll sink if you =
want to preserve the improperly formatted
 events.</div><div><br></div><div>We've only used the memory channel so far=
 so I'm afraid I can't comment on the file channel specific questions you h=
ave. &nbsp;Hopefully someone on the list with some more experience there ca=
n chime in.</div><div><br></div><div>Best,</div><div><br></div><div>Ed</div=
></div><div class=3D"gmail_extra"><br><br><div class=3D"gmail_quote">On Wed=
, Feb 5, 2014 at 5:34 PM, Paul Merry <span dir=3D"ltr">
&lt;<a href=3D"mailto:paul.merry@bbc.co.uk" target=3D"_blank">paul.merry@bb=
c.co.uk</a>&gt;</span> wrote:<br><blockquote class=3D"gmail_quote" style=3D=
"margin:0 0 0 .8ex; border-left:1px #ccc solid; padding-left:1ex"><div><p><=
/p><div style=3D"direction:ltr; font-size:10pt; font-family:Tahoma"><div st=
yle=3D"direction:ltr; font-size:10pt; font-family:Tahoma">Hi,<br><br>
We are using an Elasticsearch sink and have seen a file channel filling wit=
h messages that will never be delivered as the format of the message is inc=
ompatible with Elasticsearch itself.<br><br>
Example message from Flume logs:<br><br><pre>24 Jan 2014 08:14:55,173 ERROR=
 [SinkRunner-PollingRunner-DefaultSinkProcessor] <br>(org.apache.flume.Sink=
Runner$PollingRunner.run:160)  - Unable to deliver event. <br>Exception fol=
lows.
org.elasticsearch.indices.InvalidIndexNameException: [UpperCase-2014-01-23]=
 <br>Invalid index name [UpperCase-2014-01-23], must be lowercase</pre><br>
In this case the index name comes from a header so we have a workaround usi=
ng a multiplexing channel selector to detect and re-route messages based on=
 headers of this format.<br><br>
To clean up the channel this time we removed the data and checkpoint direct=
ories, which is not ideal as we probably lost other messages in doing this.=
<br><br>
We are wary of similar situations occurring in future for messages that we =
can't detect and divert in advance and so have a few questions:<br><br>
- What would be the recommended handling of this situation?<br><br>
- Is it possible to clear just these messages from the channel or does the =
whole channel have to be deleted ?<br><br>
- Is there a way that we can divert these messages to another channel (dead=
 letter / invalid message style) ? Noting that they are not known to be pro=
blematic until after an attempt is made to deliver them from the sink<br><b=
r>- What happens to other messages in a batch with a bad message ? Will the=
y also be stuck forever or will they be taken in another batch ?<br><br><br=
>
Thanks,<br><br>
Paul.<br></div></div><p></p><p>&nbsp;</p><p>----------------------------<br=
><font face=3D"Times New Roman" size=3D"3"><font face=3D"Times New Roman" s=
ize=3D"3"><font face=3D"Times New Roman" size=3D"3"><br><font face=3D"Times=
 New Roman" size=3D"3"><a href=3D"http://www.bbc.co.uk" target=3D"_blank">h=
ttp://www.<span>bbc</span>.<span>co</span>.<span>uk</span></a><br>
This e-mail (and any attachments) is confidential and may contain personal =
views which are not the views of the
<span>BBC</span> unless specifically stated.<br>
If you have received it in error, please delete it from your system.<br>
Do not use, copy or disclose the information in any way nor act in reliance=
 on it and notify the sender immediately.<br>
Please note that the <span>BBC</span> monitors e-mails sent or received.<br=
>
Further communication will signify your consent to this.</font></font></fon=
t></font></p><p>---------------------</p></div></blockquote></div><br></div=
></div></div></div><p></p><p class=3D"24f680e3-3b0a-45ff-925b-70623029de92"=
>&nbsp;</p><p class=3D"24f680e3-3b0a-45ff-925b-70623029de92">--------------=
--------------<br><font size=3D"3" face=3D"Times New Roman"><font size=3D"3=
" face=3D"Times New Roman"><font size=3D"3" face=3D"Times New Roman"><br><f=
ont size=3D"3" face=3D"Times New Roman"><a href=3D"http://www.bbc.co.uk" ta=
rget=3D"_blank">http://www.<span class=3D"il">bbc</span>.<span class=3D"il"=
>co</span>.<span class=3D"il">uk</span></a><br>
This e-mail (and any attachments) is confidential and may contain personal =
views which are not the views of the
<span class=3D"il">BBC</span> unless specifically stated.<br>
If you have received it in error, please delete it from your system.<br>
Do not use, copy or disclose the information in any way nor act in reliance=
 on it and notify the sender immediately.<br>
Please note that the <span class=3D"il">BBC</span> monitors e-mails sent or=
 received.<br>
Further communication will signify your consent to this.</font></font></fon=
t></font></p><p class=3D"24f680e3-3b0a-45ff-925b-70623029de92">------------=
---------</p></div></div></span></body></html>

--_000_CF19380F233Amattkenisondisneycom_--