Mailing-List: contact user-help@flink.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@flink.apache.org
From: "Torok, David" <David_Torok@comcast.com>
To: "user@flink.apache.org" <user@flink.apache.org>
Subject: Reference configs for HA / RocksDB / YARN / Zookeeper / HDFS
Thread-Topic: Reference configs for HA / RocksDB / YARN / Zookeeper / HDFS
Thread-Index: AdKY5uVaNGH5CXNtQ3OC77OrmEKkGQ==
Date: Fri, 10 Mar 2017 14:38:13 +0000
Message-ID: <da9ff80de1724545b3604c6fd9d5f43d@VAADCEX16.cable.comcast.com>
Accept-Language: en-US
Content-Language: en-US
Content-Type: multipart/alternative;
	boundary="_000_da9ff80de1724545b3604c6fd9d5f43dVAADCEX16cablecomcastco_"
MIME-Version: 1.0
archived-at: Fri, 10 Mar 2017 14:38:54 -0000

--_000_da9ff80de1724545b3604c6fd9d5f43dVAADCEX16cablecomcastco_
Content-Type: text/plain; charset="us-ascii"
Content-Transfer-Encoding: quoted-printable

Hi,

Forgive me if parts of this question have been answered before but I'd like=
 help in resolving some bits of confusion from the documentation and the fa=
ct that I haven't been able to find a good example anywhere for an enterpri=
se-style setup.  If anyone has a sample HA / Yarn / ZK / RocksDB configurat=
ion could you share?

We are currently using Flink 1.2.0 and Hortonworks (an older version, 2.2.9=
 based on Hadoop 2.6.0).  We're trying a small sample cluster with 9 Yarn c=
lient nodes.


1.       We have large state and large time-windows and therefore want to u=
se RocksDB as our state backend.  Is it a typical or best practice that Roc=
ksDB store to local-disk storage for speed, and the checkpoints store to HD=
FS for recovery / HA?  Or is everything in HDFS?  So from my understanding =
from the docs, "The RocksDBStateBackend holds in-flight data in a RocksDB<h=
ttp://rocksdb.org/> data base that is (per default) stored in the TaskManag=
er data directories"...  (is this set automatic via YARN?)... and the check=
point directory is via "state.backend.fs.checkpointdir: hdfs://namenode:400=
10/flink/checkpoints" or dynamically e.g. new RocksDBStateBackend(statepath=
).

2.       It's unclear to me whether Yarn automatically provides Flink with =
the Zookeeper information, or whether I also need to set the zookeeper info=
 in flink-conf.yaml... the examples seem to imply that the ZK information m=
ight only be used if you start your own Zookeeper rather than it already ex=
isting.  Do I need to set it up for HA via YARN?

3.       I've seen some conflicting information about including HADOOP_CLAS=
SPATH - some say there are many conflicts with Flink libraries whereas othe=
rs say it's important to resolve various deserialization errors during runt=
ime.

4.       Someone suggested that we build Flink from source ourselves agains=
t the Hortonworks distribution; I'm really hoping that's not necessary.

Appreciate any info as we learn how to productionize our Flink clusters!

Best Regards
Dave

--_000_da9ff80de1724545b3604c6fd9d5f43dVAADCEX16cablecomcastco_
Content-Type: text/html; charset="us-ascii"
Content-Transfer-Encoding: quoted-printable

<html xmlns:v=3D"urn:schemas-microsoft-com:vml" xmlns:o=3D"urn:schemas-micr=
osoft-com:office:office" xmlns:w=3D"urn:schemas-microsoft-com:office:word" =
xmlns:m=3D"http://schemas.microsoft.com/office/2004/12/omml" xmlns=3D"http:=
//www.w3.org/TR/REC-html40">
<head>
<meta http-equiv=3D"Content-Type" content=3D"text/html; charset=3Dus-ascii"=
>
<meta name=3D"Generator" content=3D"Microsoft Word 15 (filtered medium)">
<style><!--
/* Font Definitions */
@font-face
	{font-family:"Cambria Math";
	panose-1:2 4 5 3 5 4 6 3 2 4;}
@font-face
	{font-family:Calibri;
	panose-1:2 15 5 2 2 2 4 3 2 4;}
/* Style Definitions */
p.MsoNormal, li.MsoNormal, div.MsoNormal
	{margin:0in;
	margin-bottom:.0001pt;
	font-size:11.0pt;
	font-family:"Calibri",sans-serif;}
a:link, span.MsoHyperlink
	{mso-style-priority:99;
	color:#0563C1;
	text-decoration:underline;}
a:visited, span.MsoHyperlinkFollowed
	{mso-style-priority:99;
	color:#954F72;
	text-decoration:underline;}
p.MsoListParagraph, li.MsoListParagraph, div.MsoListParagraph
	{mso-style-priority:34;
	margin-top:0in;
	margin-right:0in;
	margin-bottom:0in;
	margin-left:.5in;
	margin-bottom:.0001pt;
	font-size:11.0pt;
	font-family:"Calibri",sans-serif;}
span.EmailStyle18
	{mso-style-type:personal-compose;
	font-family:"Calibri",sans-serif;
	color:windowtext;}
span.apple-converted-space
	{mso-style-name:apple-converted-space;}
.MsoChpDefault
	{mso-style-type:export-only;
	font-size:10.0pt;}
@page WordSection1
	{size:8.5in 11.0in;
	margin:1.0in 1.0in 1.0in 1.0in;}
div.WordSection1
	{page:WordSection1;}
/* List Definitions */
@list l0
	{mso-list-id:197164766;
	mso-list-type:hybrid;
	mso-list-template-ids:823565788 67698703 67698713 67698715 67698703 676987=
13 67698715 67698703 67698713 67698715;}
@list l0:level1
	{mso-level-tab-stop:none;
	mso-level-number-position:left;
	text-indent:-.25in;}
@list l0:level2
	{mso-level-number-format:alpha-lower;
	mso-level-tab-stop:none;
	mso-level-number-position:left;
	text-indent:-.25in;}
@list l0:level3
	{mso-level-number-format:roman-lower;
	mso-level-tab-stop:none;
	mso-level-number-position:right;
	text-indent:-9.0pt;}
@list l0:level4
	{mso-level-tab-stop:none;
	mso-level-number-position:left;
	text-indent:-.25in;}
@list l0:level5
	{mso-level-number-format:alpha-lower;
	mso-level-tab-stop:none;
	mso-level-number-position:left;
	text-indent:-.25in;}
@list l0:level6
	{mso-level-number-format:roman-lower;
	mso-level-tab-stop:none;
	mso-level-number-position:right;
	text-indent:-9.0pt;}
@list l0:level7
	{mso-level-tab-stop:none;
	mso-level-number-position:left;
	text-indent:-.25in;}
@list l0:level8
	{mso-level-number-format:alpha-lower;
	mso-level-tab-stop:none;
	mso-level-number-position:left;
	text-indent:-.25in;}
@list l0:level9
	{mso-level-number-format:roman-lower;
	mso-level-tab-stop:none;
	mso-level-number-position:right;
	text-indent:-9.0pt;}
ol
	{margin-bottom:0in;}
ul
	{margin-bottom:0in;}
--></style><!--[if gte mso 9]><xml>
<o:shapedefaults v:ext=3D"edit" spidmax=3D"1026" />
</xml><![endif]--><!--[if gte mso 9]><xml>
<o:shapelayout v:ext=3D"edit">
<o:idmap v:ext=3D"edit" data=3D"1" />
</o:shapelayout></xml><![endif]-->
</head>
<body lang=3D"EN-US" link=3D"#0563C1" vlink=3D"#954F72">
<div class=3D"WordSection1">
<p class=3D"MsoNormal">Hi,<o:p></o:p></p>
<p class=3D"MsoNormal"><o:p>&nbsp;</o:p></p>
<p class=3D"MsoNormal">Forgive me if parts of this question have been answe=
red before but I&#8217;d like help in resolving some bits of confusion from=
 the documentation and the fact that I haven&#8217;t been able to find a go=
od example anywhere for an enterprise-style setup.&nbsp;
 If anyone has a sample HA / Yarn / ZK / RocksDB configuration could you sh=
are?<o:p></o:p></p>
<p class=3D"MsoNormal"><o:p>&nbsp;</o:p></p>
<p class=3D"MsoNormal">We are currently using Flink 1.2.0 and Hortonworks (=
an older version, 2.2.9 based on Hadoop 2.6.0).&nbsp; We&#8217;re trying a =
small sample cluster with 9 Yarn client nodes.<o:p></o:p></p>
<p class=3D"MsoNormal"><o:p>&nbsp;</o:p></p>
<p class=3D"MsoListParagraph" style=3D"text-indent:-.25in;mso-list:l0 level=
1 lfo2"><![if !supportLists]><span style=3D"mso-list:Ignore">1.<span style=
=3D"font:7.0pt &quot;Times New Roman&quot;">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&=
nbsp;
</span></span><![endif]>We have large state and large time-windows and ther=
efore want to use RocksDB as our state backend.&nbsp; Is it a typical or be=
st practice that RocksDB store to local-disk storage for speed, and the che=
ckpoints store to HDFS for recovery /
 HA?&nbsp; Or is everything in HDFS?&nbsp; So from my understanding from th=
e docs, &#8220;<span style=3D"color:#333333;background:white">The RocksDBSt=
ateBackend holds in-flight data in a<span class=3D"apple-converted-space">&=
nbsp;</span></span><a href=3D"http://rocksdb.org/"><span style=3D"color:#33=
7AB7;background:white;text-decoration:none">RocksDB</span></a><span class=
=3D"apple-converted-space"><span style=3D"color:#333333;background:white">&=
nbsp;</span></span><span style=3D"color:#333333;background:white">data
 base that is (per default) stored in the TaskManager data directories&#822=
1;&#8230;&nbsp; (is this set automatic via YARN?)&#8230; and the checkpoint=
 directory is via &#8220;state.backend.fs.checkpointdir: hdfs://namenode:40=
010/flink/checkpoints&#8221; or dynamically e.g. new RocksDBStateBackend(st=
atepath).</span><o:p></o:p></p>
<p class=3D"MsoListParagraph" style=3D"text-indent:-.25in;mso-list:l0 level=
1 lfo2"><![if !supportLists]><span style=3D"mso-list:Ignore">2.<span style=
=3D"font:7.0pt &quot;Times New Roman&quot;">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&=
nbsp;
</span></span><![endif]><span style=3D"color:#333333;background:white">It&#=
8217;s unclear to me whether Yarn automatically provides Flink with the Zoo=
keeper information, or whether I also need to set the zookeeper info in fli=
nk-conf.yaml&#8230; the examples seem to imply
 that the ZK information might only be used if you start your own Zookeeper=
 rather than it already existing.&nbsp; Do I need to set it up for HA via Y=
ARN?</span><o:p></o:p></p>
<p class=3D"MsoListParagraph" style=3D"text-indent:-.25in;mso-list:l0 level=
1 lfo2"><![if !supportLists]><span style=3D"mso-list:Ignore">3.<span style=
=3D"font:7.0pt &quot;Times New Roman&quot;">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&=
nbsp;
</span></span><![endif]>I&#8217;ve seen some conflicting information about =
including HADOOP_CLASSPATH &#8211; some say there are many conflicts with F=
link libraries whereas others say it&#8217;s important to resolve various d=
eserialization errors during runtime.<o:p></o:p></p>
<p class=3D"MsoListParagraph" style=3D"text-indent:-.25in;mso-list:l0 level=
1 lfo2"><![if !supportLists]><span style=3D"mso-list:Ignore">4.<span style=
=3D"font:7.0pt &quot;Times New Roman&quot;">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&=
nbsp;
</span></span><![endif]>Someone suggested that we build Flink from source o=
urselves against the Hortonworks distribution; I&#8217;m really hoping that=
&#8217;s not necessary.<o:p></o:p></p>
<p class=3D"MsoNormal"><o:p>&nbsp;</o:p></p>
<p class=3D"MsoNormal">Appreciate any info as we learn how to productionize=
 our Flink clusters!<o:p></o:p></p>
<p class=3D"MsoNormal"><o:p>&nbsp;</o:p></p>
<p class=3D"MsoNormal">Best Regards<o:p></o:p></p>
<p class=3D"MsoNormal">Dave<o:p></o:p></p>
</div>
</body>
</html>

--_000_da9ff80de1724545b3604c6fd9d5f43dVAADCEX16cablecomcastco_--