Mailing-List: contact user-help@flink.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@flink.apache.org
MIME-Version: 1.0
In-Reply-To: 
 <CADoiZqp4CAsHU-_MQQ3tPxpOH_tkrpOQPRnB=WxYJEXL+gKbhQ@mail.gmail.com>
References: 
 <CADoiZqo22HPxvK_YeoihF5dj7xkFMOs58MLcaXDOvn3ad_MHqw@mail.gmail.com>
 <CAGco--ag1JkQXZ-OkH-Wu5dAmkc2=3-Paq==PW-QeTY7n1QC9A@mail.gmail.com>
 <CADoiZqrT7vB3O=QLZsyMMCQ8GabAVV-Fuv_Pv2CqX5q42ZqX1Q@mail.gmail.com>
 <CAGco--Zzm25Pv5fQcQDwXT5fqF8b0QOKg0Ks_sV6iEvaMt=iMw@mail.gmail.com>
 <CADoiZqrdC4d3+FfaPGM4hUEYaFfT_3Xz+4rswpfvbPC=sh-_hw@mail.gmail.com>
 <CAGr9p8A9p=-JrF2-RsB5U6tqW=u4YYfMuCazotb-zs7nLJh+QQ@mail.gmail.com>
 <CADoiZqp4CAsHU-_MQQ3tPxpOH_tkrpOQPRnB=WxYJEXL+gKbhQ@mail.gmail.com>
From: Maximilian Michels <mxm@apache.org>
Date: Thu, 5 Nov 2015 14:56:59 +0100
Message-ID: 
 <CAGco--Y_kPMuQUO+rx2-ehnESiUC-t_jyZ-J0z1ujy-+6HxXPA@mail.gmail.com>
Subject: Re: Running continuously on yarn with kerberos
To: "user@flink.apache.org" <user@flink.apache.org>
Content-Type: multipart/alternative; boundary=001a114377925d19210523cb8198

--001a114377925d19210523cb8198
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

Thank you for looking into the problem, Niels. Let us know if you need
anything. We would be happy to merge a pull request once you have verified
the fix.

On Thu, Nov 5, 2015 at 1:38 PM, Niels Basjes <Niels@basjes.nl> wrote:

> I created https://issues.apache.org/jira/browse/FLINK-2977
>
> On Thu, Nov 5, 2015 at 12:25 PM, Robert Metzger <rmetzger@apache.org>
> wrote:
>
>> Hi Niels,
>> thank you for analyzing the issue so properly. I agree with you. It seem=
s
>> that HDFS and HBase are using their own tokes which we need to transfer
>> from the client to the YARN containers. We should be able to port the fi=
x
>> from Spark (which they got from Storm) into our YARN client.
>> I think we would add this in org.apache.flink.yarn.Utils#setTokensFor().
>>
>> Do you want to implement and verify the fix yourself? If you are to busy
>> at the moment, we can also discuss how we share the work (I'm implementi=
ng
>> it, you test the fix)
>>
>>
>> Robert
>>
>> On Tue, Nov 3, 2015 at 5:26 PM, Niels Basjes <Niels@basjes.nl> wrote:
>>
>>> Update on the status so far.... I suspect I found a problem in a secure
>>> setup.
>>>
>>> I have created a very simple Flink topology consisting of a streaming
>>> Source (the outputs the timestamp a few times per second) and a Sink (t=
hat
>>> puts that timestamp into a single record in HBase).
>>> Running this on a non-secure Yarn cluster works fine.
>>>
>>> To run it on a secured Yarn cluster my main routine now looks like this=
:
>>>
>>> public static void main(String[] args) throws Exception {
>>>     System.setProperty("java.security.krb5.conf", "/etc/krb5.conf");
>>>     UserGroupInformation.loginUserFromKeytab("nbasjes@xxxxxx.NET", "/ho=
me/nbasjes/.krb/nbasjes.keytab");
>>>
>>>     final StreamExecutionEnvironment env =3D StreamExecutionEnvironment=
.getExecutionEnvironment();
>>>     env.setParallelism(1);
>>>
>>>     DataStream<String> stream =3D env.addSource(new TimerTicksSource())=
;
>>>     stream.addSink(new SetHBaseRowSink());
>>>     env.execute("Long running Flink application");
>>> }
>>>
>>> When I run this
>>>      flink run -m yarn-cluster -yn 1 -yjm 1024 -ytm 4096
>>> ./kerberos-1.0-SNAPSHOT.jar
>>>
>>> I see after the startup messages:
>>>
>>> 17:13:24,466 INFO  org.apache.hadoop.security.UserGroupInformation
>>>         - Login successful for user nbasjes@xxxxxx.NET using keytab
>>> file /home/nbasjes/.krb/nbasjes.keytab
>>> 11/03/2015 17:13:25 Job execution switched to status RUNNING.
>>> 11/03/2015 17:13:25 Custom Source -> Stream Sink(1/1) switched to
>>> SCHEDULED
>>> 11/03/2015 17:13:25 Custom Source -> Stream Sink(1/1) switched to
>>> DEPLOYING
>>> 11/03/2015 17:13:25 Custom Source -> Stream Sink(1/1) switched to
>>> RUNNING
>>>
>>> Which looks good.
>>>
>>> However ... no data goes into HBase.
>>> After some digging I found this error in the task managers log:
>>>
>>> 17:13:42,677 WARN  org.apache.hadoop.hbase.ipc.RpcClient               =
          - Exception encountered while connecting to the server : javax.se=
curity.sasl.SaslException: GSS initiate failed [Caused by GSSException: No =
valid credentials provided (Mechanism level: Failed to find any Kerberos tg=
t)]
>>> 17:13:42,677 FATAL org.apache.hadoop.hbase.ipc.RpcClient               =
          - SASL authentication failed. The most likely cause is missing or=
 invalid credentials. Consider 'kinit'.
>>> javax.security.sasl.SaslException: GSS initiate failed [Caused by GSSEx=
ception: No valid credentials provided (Mechanism level: Failed to find any=
 Kerberos tgt)]
>>> 	at com.sun.security.sasl.gsskerb.GssKrb5Client.evaluateChallenge(GssKr=
b5Client.java:212)
>>> 	at org.apache.hadoop.hbase.security.HBaseSaslRpcClient.saslConnect(HBa=
seSaslRpcClient.java:177)
>>> 	at org.apache.hadoop.hbase.ipc.RpcClient$Connection.setupSaslConnectio=
n(RpcClient.java:815)
>>> 	at org.apache.hadoop.hbase.ipc.RpcClient$Connection.access$800(RpcClie=
nt.java:349)
>>>
>>>
>>> First starting a yarn-session and then loading my job gives the same
>>> error.
>>>
>>> My best guess at this point is that Flink needs the same fix as
>>> described here:
>>>
>>> https://issues.apache.org/jira/browse/SPARK-6918   (
>>> https://github.com/apache/spark/pull/5586 )
>>>
>>> What do you guys think?
>>>
>>> Niels Basjes
>>>
>>>
>>>
>>> On Tue, Oct 27, 2015 at 6:12 PM, Maximilian Michels <mxm@apache.org>
>>> wrote:
>>>
>>>> Hi Niels,
>>>>
>>>> You're welcome. Some more information on how this would be configured:
>>>>
>>>> In the kdc.conf, there are two variables:
>>>>
>>>>         max_life =3D 2h 0m 0s
>>>>         max_renewable_life =3D 7d 0h 0m 0s
>>>>
>>>> max_life is the maximum life of the current ticket. However, it may be
>>>> renewed up to a time span of max_renewable_life from the first ticket =
issue
>>>> on. This means that from the first ticket issue, new tickets may be
>>>> requested for one week. Each renewed ticket has a life time of max_lif=
e (2
>>>> hours in this case).
>>>>
>>>> Please let us know about any difficulties with long-running streaming
>>>> application and Kerberos.
>>>>
>>>> Best regards,
>>>> Max
>>>>
>>>> On Tue, Oct 27, 2015 at 2:46 PM, Niels Basjes <Niels@basjes.nl> wrote:
>>>>
>>>>> Hi,
>>>>>
>>>>> Thanks for your feedback.
>>>>> So I guess I'll have to talk to the security guys about having specia=
l
>>>>> kerberos ticket expiry times for these types of jobs.
>>>>>
>>>>> Niels Basjes
>>>>>
>>>>> On Fri, Oct 23, 2015 at 11:45 AM, Maximilian Michels <mxm@apache.org>
>>>>> wrote:
>>>>>
>>>>>> Hi Niels,
>>>>>>
>>>>>> Thank you for your question. Flink relies entirely on the Kerberos
>>>>>> support of Hadoop. So your question could also be rephrased to "Does
>>>>>> Hadoop support long-term authentication using Kerberos?". And the
>>>>>> answer is: Yes!
>>>>>>
>>>>>> While Hadoop uses Kerberos tickets to authenticate users with servic=
es
>>>>>> initially, the authentication process continues differently
>>>>>> afterwards. Instead of saving the ticket to authenticate on a later
>>>>>> access, Hadoop creates its own security tockens (DelegationToken) th=
at
>>>>>> it passes around. These are authenticated to Kerberos periodically. =
To
>>>>>> my knowledge, the tokens have a life span identical to the Kerberos
>>>>>> ticket maximum life span. So be sure to set the maximum life span ve=
ry
>>>>>> high for long streaming jobs. The renewal time, on the other hand, i=
s
>>>>>> not important because Hadoop abstracts this away using its own
>>>>>> security tockens.
>>>>>>
>>>>>> I'm afraid there is not Kerberos how-to yet. If you are on Yarn, the=
n
>>>>>> it is sufficient to authenticate the client with Kerberos. On a Flin=
k
>>>>>> standalone cluster you need to ensure that, initially, all nodes are
>>>>>> authenticated with Kerberos using the kinit tool.
>>>>>>
>>>>>> Feel free to ask if you have more questions and let us know about an=
y
>>>>>> difficulties.
>>>>>>
>>>>>> Best regards,
>>>>>> Max
>>>>>>
>>>>>>
>>>>>>
>>>>>> On Thu, Oct 22, 2015 at 2:06 PM, Niels Basjes <Niels@basjes.nl>
>>>>>> wrote:
>>>>>> > Hi,
>>>>>> >
>>>>>> > I want to write a long running (i.e. never stop it) streaming flin=
k
>>>>>> > application on a kerberos secured Hadoop/Yarn cluster. My
>>>>>> application needs
>>>>>> > to do things with files on HDFS and HBase tables on that cluster s=
o
>>>>>> having
>>>>>> > the correct kerberos tickets is very important. The stream is to b=
e
>>>>>> ingested
>>>>>> > from Kafka.
>>>>>> >
>>>>>> > One of the things with Kerberos is that the tickets expire after a
>>>>>> > predetermined time. My knowledge about kerberos is very limited so
>>>>>> I hope
>>>>>> > you guys can help me.
>>>>>> >
>>>>>> > My question is actually quite simple: Is there an howto somewhere
>>>>>> on how to
>>>>>> > correctly run a long running flink application with kerberos that
>>>>>> includes a
>>>>>> > solution for the kerberos ticket timeout  ?
>>>>>> >
>>>>>> > Thanks
>>>>>> >
>>>>>> > Niels Basjes
>>>>>>
>>>>>
>>>>>
>>>>>
>>>>> --
>>>>> Best regards / Met vriendelijke groeten,
>>>>>
>>>>> Niels Basjes
>>>>>
>>>>
>>>>
>>>
>>>
>>> --
>>> Best regards / Met vriendelijke groeten,
>>>
>>> Niels Basjes
>>>
>>
>>
>
>
> --
> Best regards / Met vriendelijke groeten,
>
> Niels Basjes
>

--001a114377925d19210523cb8198
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">Thank you for looking into the problem, Niels. Let us know=
 if you need anything. We would be happy to merge a pull request once you h=
ave verified the fix.<br></div><div class=3D"gmail_extra"><br><div class=3D=
"gmail_quote">On Thu, Nov 5, 2015 at 1:38 PM, Niels Basjes <span dir=3D"ltr=
">&lt;<a href=3D"mailto:Niels@basjes.nl" target=3D"_blank">Niels@basjes.nl<=
/a>&gt;</span> wrote:<br><blockquote class=3D"gmail_quote" style=3D"margin:=
0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir=3D"ltr">I =
created=C2=A0<a href=3D"https://issues.apache.org/jira/browse/FLINK-2977" t=
arget=3D"_blank">https://issues.apache.org/jira/browse/FLINK-2977</a></div>=
<div class=3D"HOEnZb"><div class=3D"h5"><div class=3D"gmail_extra"><br><div=
 class=3D"gmail_quote">On Thu, Nov 5, 2015 at 12:25 PM, Robert Metzger <spa=
n dir=3D"ltr">&lt;<a href=3D"mailto:rmetzger@apache.org" target=3D"_blank">=
rmetzger@apache.org</a>&gt;</span> wrote:<br><blockquote class=3D"gmail_quo=
te" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"=
><div dir=3D"ltr">Hi Niels,<div>thank you for analyzing the issue so proper=
ly. I agree with you. It seems that HDFS and HBase are using their own toke=
s which we need to transfer from the client to the YARN containers. We shou=
ld be able to port the fix from Spark (which they got from Storm) into our =
YARN client.=C2=A0</div><div>I think we would add this in=C2=A0<span style=
=3D"color:rgb(0,0,0);font-family:&#39;DejaVu Sans Mono&#39;;font-size:9pt">=
org.apache.flink.yarn.</span><span style=3D"color:rgb(0,0,0);font-family:&#=
39;DejaVu Sans Mono&#39;;font-size:9pt">Utils#</span><span style=3D"color:r=
gb(0,0,0);font-family:&#39;DejaVu Sans Mono&#39;;font-size:9pt">setTokensFo=
r().</span></div><div><br></div>Do you want to implement and verify the fix=
 yourself? If you are to busy at the moment, we can also discuss how we sha=
re the work (I&#39;m implementing it, you test the fix)<span><font color=3D=
"#888888"><div><br></div><div><br></div><div>Robert</div></font></span></di=
v><div><div><div class=3D"gmail_extra"><br><div class=3D"gmail_quote">On Tu=
e, Nov 3, 2015 at 5:26 PM, Niels Basjes <span dir=3D"ltr">&lt;<a href=3D"ma=
ilto:Niels@basjes.nl" target=3D"_blank">Niels@basjes.nl</a>&gt;</span> wrot=
e:<br><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-l=
eft:1px #ccc solid;padding-left:1ex"><div dir=3D"ltr">Update on the status =
so far.... I suspect I found a problem in a secure setup.<div><br></div><di=
v><div>I have created a very simple Flink topology consisting of a streamin=
g Source (the outputs the timestamp a few times per second) and a Sink (tha=
t puts that timestamp into a single record in HBase).</div><div>Running thi=
s on a non-secure Yarn cluster works fine.</div><div><br></div><div>To run =
it on a secured Yarn cluster my main routine now looks like this:</div><div=
><br></div><div><pre style=3D"color:rgb(0,0,0);font-family:&#39;DejaVu Sans=
 Mono&#39;;font-size:9pt"><span style=3D"color:rgb(0,0,128);font-weight:bol=
d">public static void </span>main(String[] args) <span style=3D"color:rgb(0=
,0,128);font-weight:bold">throws </span>Exception {<br>    System.<span sty=
le=3D"font-style:italic">setProperty</span>(<span style=3D"color:rgb(0,128,=
0);font-weight:bold">&quot;java.security.krb5.conf&quot;</span>, <span styl=
e=3D"color:rgb(0,128,0);font-weight:bold">&quot;/etc/krb5.conf&quot;</span>=
);<br>    UserGroupInformation.<span style=3D"font-style:italic">loginUserF=
romKeytab</span>(<span style=3D"color:rgb(0,128,0);font-weight:bold">&quot;=
nbasjes@xxxxxx.NET&quot;</span>, <span style=3D"color:rgb(0,128,0);font-wei=
ght:bold">&quot;/home/nbasjes/.krb/nbasjes.keytab&quot;</span>);<br><br>   =
 <span style=3D"color:rgb(0,0,128);font-weight:bold">final </span>StreamExe=
cutionEnvironment env =3D StreamExecutionEnvironment.<span style=3D"font-st=
yle:italic">getExecutionEnvironment</span>();<br>    env.setParallelism(<sp=
an style=3D"color:rgb(0,0,255)">1</span>);<br><br>    DataStream&lt;String&=
gt; stream =3D env.addSource(<span style=3D"color:rgb(0,0,128);font-weight:=
bold">new </span>TimerTicksSource());<br>    stream.addSink(<span style=3D"=
color:rgb(0,0,128);font-weight:bold">new </span>SetHBaseRowSink());<br>    =
env.execute(<span style=3D"color:rgb(0,128,0);font-weight:bold">&quot;Long =
running Flink application&quot;</span>);<br>}<br></pre></div><div>When I ru=
n this=C2=A0</div><div>=C2=A0 =C2=A0 =C2=A0flink run -m yarn-cluster -yn 1 =
-yjm 1024 -ytm 4096 ./kerberos-1.0-SNAPSHOT.jar<br></div><div><br></div><di=
v>I see after the startup messages:<br></div><div><div><br></div><div>17:13=
:24,466 INFO =C2=A0org.apache.hadoop.security.UserGroupInformation =C2=A0 =
=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 - Login successful for user nbasj=
es@xxxxxx.NET using keytab file /home/nbasjes/.krb/nbasjes.keytab</div><div=
>11/03/2015 17:13:25<span style=3D"white-space:pre-wrap">	</span>Job execut=
ion switched to status RUNNING.</div><div>11/03/2015 17:13:25<span style=3D=
"white-space:pre-wrap">	</span>Custom Source -&gt; Stream Sink(1/1) switche=
d to SCHEDULED=C2=A0</div><div>11/03/2015 17:13:25<span style=3D"white-spac=
e:pre-wrap">	</span>Custom Source -&gt; Stream Sink(1/1) switched to DEPLOY=
ING=C2=A0</div><div>11/03/2015 17:13:25<span style=3D"white-space:pre-wrap"=
>	</span>Custom Source -&gt; Stream Sink(1/1) switched to RUNNING=C2=A0</di=
v></div><div><br></div><div>Which looks good.</div><div><br></div><div>Howe=
ver ... no data goes into HBase.</div><div>After some digging I found this =
error in the task managers log:</div><div><br></div><div><pre style=3D"marg=
in-top:0px;margin-bottom:0px;border:0px">17:13:42,677 WARN  org.apache.hado=
op.hbase.ipc.RpcClient                         - Exception encountered whil=
e connecting to the server : javax.security.sasl.SaslException: GSS initiat=
e failed [Caused by GSSException: No valid credentials provided (Mechanism =
level: Failed to find any Kerberos tgt)]
17:13:42,677 FATAL org.apache.hadoop.hbase.ipc.RpcClient                   =
      - SASL authentication failed. The most likely cause is missing or inv=
alid credentials. Consider &#39;kinit&#39;.
javax.security.sasl.SaslException: GSS initiate failed [Caused by GSSExcept=
ion: No valid credentials provided (Mechanism level: Failed to find any Ker=
beros tgt)]
	at com.sun.security.sasl.gsskerb.GssKrb5Client.evaluateChallenge(GssKrb5Cl=
ient.java:212)
	at org.apache.hadoop.hbase.security.HBaseSaslRpcClient.saslConnect(HBaseSa=
slRpcClient.java:177)
	at org.apache.hadoop.hbase.ipc.RpcClient$Connection.setupSaslConnection(Rp=
cClient.java:815)
	at org.apache.hadoop.hbase.ipc.RpcClient$Connection.access$800(RpcClient.j=
ava:349)</pre></div><div><br></div><div>First starting a yarn-session and t=
hen loading my job gives the same error.</div><div><br></div><div>My best g=
uess at this point is that Flink needs the same fix as described here:</div=
><div><br></div><div><a href=3D"https://issues.apache.org/jira/browse/SPARK=
-6918" target=3D"_blank">https://issues.apache.org/jira/browse/SPARK-6918</=
a> =C2=A0 ( <a href=3D"https://github.com/apache/spark/pull/5586" target=3D=
"_blank">https://github.com/apache/spark/pull/5586</a> )</div><div><br></di=
v><div>What do you guys think?</div><span><font color=3D"#888888"><div><br>=
</div><div>Niels Basjes</div><div><br></div><div><br></div></font></span></=
div></div><div><div><div class=3D"gmail_extra"><br><div class=3D"gmail_quot=
e">On Tue, Oct 27, 2015 at 6:12 PM, Maximilian Michels <span dir=3D"ltr">&l=
t;<a href=3D"mailto:mxm@apache.org" target=3D"_blank">mxm@apache.org</a>&gt=
;</span> wrote:<br><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 =
.8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir=3D"ltr"><div><di=
v><div><div>Hi Niels,<br><br></div><div>You&#39;re welcome. Some more infor=
mation on how this would be configured:<br><br></div>In the kdc.conf, there=
 are two variables:<br><br>=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 max_l=
ife =3D 2h 0m 0s<br>=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0 max_renewabl=
e_life =3D 7d 0h 0m 0s<br><br></div>max_life is the maximum life of the cur=
rent ticket. However, it may be renewed up to a time span of max_renewable_=
life from the first ticket issue on. This means that from the first ticket =
issue, new tickets may be requested for one week.  Each renewed ticket has =
a life time of max_life (2 hours in this case).<br><br></div><div>Please le=
t us know about any difficulties with  long-running streaming application a=
nd Kerberos.<br></div><div><br></div>Best regards,<br></div>Max<br></div><d=
iv><div><div class=3D"gmail_extra"><br><div class=3D"gmail_quote">On Tue, O=
ct 27, 2015 at 2:46 PM, Niels Basjes <span dir=3D"ltr">&lt;<a href=3D"mailt=
o:Niels@basjes.nl" target=3D"_blank">Niels@basjes.nl</a>&gt;</span> wrote:<=
br><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left=
:1px #ccc solid;padding-left:1ex"><div dir=3D"ltr">Hi,<div><br></div><div>T=
hanks for your feedback.</div><div>So I guess I&#39;ll have to talk to the =
security guys about having special=C2=A0</div><div>kerberos ticket expiry t=
imes for these types of jobs.</div><div><br></div><div>Niels Basjes</div></=
div><div class=3D"gmail_extra"><div><div><br><div class=3D"gmail_quote">On =
Fri, Oct 23, 2015 at 11:45 AM, Maximilian Michels <span dir=3D"ltr">&lt;<a =
href=3D"mailto:mxm@apache.org" target=3D"_blank">mxm@apache.org</a>&gt;</sp=
an> wrote:<br><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;=
border-left:1px #ccc solid;padding-left:1ex">Hi Niels,<br>
<br>
Thank you for your question. Flink relies entirely on the Kerberos<br>
support of Hadoop. So your question could also be rephrased to &quot;Does<b=
r>
Hadoop support long-term authentication using Kerberos?&quot;. And the<br>
answer is: Yes!<br>
<br>
While Hadoop uses Kerberos tickets to authenticate users with services<br>
initially, the authentication process continues differently<br>
afterwards. Instead of saving the ticket to authenticate on a later<br>
access, Hadoop creates its own security tockens (DelegationToken) that<br>
it passes around. These are authenticated to Kerberos periodically. To<br>
my knowledge, the tokens have a life span identical to the Kerberos<br>
ticket maximum life span. So be sure to set the maximum life span very<br>
high for long streaming jobs. The renewal time, on the other hand, is<br>
not important because Hadoop abstracts this away using its own<br>
security tockens.<br>
<br>
I&#39;m afraid there is not Kerberos how-to yet. If you are on Yarn, then<b=
r>
it is sufficient to authenticate the client with Kerberos. On a Flink<br>
standalone cluster you need to ensure that, initially, all nodes are<br>
authenticated with Kerberos using the kinit tool.<br>
<br>
Feel free to ask if you have more questions and let us know about any<br>
difficulties.<br>
<br>
Best regards,<br>
Max<br>
<div><div><br>
<br>
<br>
On Thu, Oct 22, 2015 at 2:06 PM, Niels Basjes &lt;<a href=3D"mailto:Niels@b=
asjes.nl" target=3D"_blank">Niels@basjes.nl</a>&gt; wrote:<br>
&gt; Hi,<br>
&gt;<br>
&gt; I want to write a long running (i.e. never stop it) streaming flink<br=
>
&gt; application on a kerberos secured Hadoop/Yarn cluster. My application =
needs<br>
&gt; to do things with files on HDFS and HBase tables on that cluster so ha=
ving<br>
&gt; the correct kerberos tickets is very important. The stream is to be in=
gested<br>
&gt; from Kafka.<br>
&gt;<br>
&gt; One of the things with Kerberos is that the tickets expire after a<br>
&gt; predetermined time. My knowledge about kerberos is very limited so I h=
ope<br>
&gt; you guys can help me.<br>
&gt;<br>
&gt; My question is actually quite simple: Is there an howto somewhere on h=
ow to<br>
&gt; correctly run a long running flink application with kerberos that incl=
udes a<br>
&gt; solution for the kerberos ticket timeout=C2=A0 ?<br>
&gt;<br>
&gt; Thanks<br>
&gt;<br>
&gt; Niels Basjes<br>
</div></div></blockquote></div><br><br clear=3D"all"><div><br></div></div><=
/div><span><font color=3D"#888888">-- <br><div>Best regards / Met vriendeli=
jke groeten,<br><br>Niels Basjes</div>
</font></span></div>
</blockquote></div><br></div>
</div></div></blockquote></div><br><br clear=3D"all"><div><br></div>-- <br>=
<div>Best regards / Met vriendelijke groeten,<br><br>Niels Basjes</div>
</div>
</div></div></blockquote></div><br></div>
</div></div></blockquote></div><br><br clear=3D"all"><div><br></div>-- <br>=
<div>Best regards / Met vriendelijke groeten,<br><br>Niels Basjes</div>
</div>
</div></div></blockquote></div><br></div>

--001a114377925d19210523cb8198--