Mailing-List: contact user-help@hive.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@hive.apache.org
From: =?utf-8?Q?J=C3=B6rn_Franke?= <jornfranke@gmail.com>
Content-Type: multipart/alternative;
	boundary=Apple-Mail-B1194972-FC55-4486-AB58-F5E6787DDECB
Content-Transfer-Encoding: 7bit
Mime-Version: 1.0 (1.0)
Subject: Re: Hive on Spark - Hadoop 2 - Installation - Ubuntu
Message-Id: <718ECA29-A16F-4643-B68A-67470510F6B4@gmail.com>
Date: Fri, 20 Nov 2015 12:52:33 +0100
References: 
 <CA+g1R4DG6bi4BUGt3NRrkdYq7Q3XYpVChsZbbMhurdohmJnWpw@mail.gmail.com>
 <01b801d12378$a0bd4020$e237c060$@peridale.co.uk>
In-Reply-To: <01b801d12378$a0bd4020$e237c060$@peridale.co.uk>
To: user@hive.apache.org


--Apple-Mail-B1194972-FC55-4486-AB58-F5E6787DDECB
Content-Type: text/plain;
	charset=utf-8
Content-Transfer-Encoding: quoted-printable

I recommend to use a Hadoop distribution containing these technologies. I th=
ink you get also other useful tools for your scenario, such as Auditing usin=
g sentry or ranger.

> On 20 Nov 2015, at 10:48, Mich Talebzadeh <mich@peridale.co.uk> wrote:
>=20
> Well
> =20
> =E2=80=9CI'm planning to deploy Hive on Spark but I can't find the install=
ation steps. I tried to read the official '[Hive on Spark][1]' guide but it h=
as problems. As an example it says under 'Configuring Yarn' `yarn.resourcema=
nager.scheduler.class=3Dorg.apache.hadoop.yarn.server.resourcemanager.schedu=
ler.fair.FairScheduler` but does not imply where should I do it. Also as per=
 the guide configurations are set in the Hive runtime shell which is not per=
manent according to my knowledge.=E2=80=9D
> =20
> You can do that in yarn-site.xml file which is normally under $HADOOP_HOME=
/etc/hadoop.
> =20
> =20
> HTH
> =20
> =20
> =20
> Mich Talebzadeh
> =20
> Sybase ASE 15 Gold Medal Award 2008
> A Winning Strategy: Running the most Critical Financial Data on ASE 15
> http://login.sybase.com/files/Product_Overviews/ASE-Winning-Strategy-09190=
8.pdf
> Author of the books "A Practitioner=E2=80=99s Guide to Upgrading to Sybase=
 ASE 15", ISBN 978-0-9563693-0-7.
> co-author "Sybase Transact SQL Guidelines Best Practices", ISBN 978-0-9759=
693-0-4
> Publications due shortly:
> Complex Event Processing in Heterogeneous Environments, ISBN: 978-0-956369=
3-3-8
> Oracle and Sybase, Concepts and Contrasts, ISBN: 978-0-9563693-1-4, volume=
 one out shortly
> =20
> http://talebzadehmich.wordpress.com
> =20
> NOTE: The information in this email is proprietary and confidential. This m=
essage is for the designated recipient only, if you are not the intended rec=
ipient, you should destroy it immediately. Any information in this message s=
hall not be understood as given or endorsed by Peridale Technology Ltd, its s=
ubsidiaries or their employees, unless expressly so stated. It is the respon=
sibility of the recipient to ensure that this email is virus free, therefore=
 neither Peridale Ltd, its subsidiaries nor their employees accept any respo=
nsibility.
> =20
> From: Dasun Hegoda [mailto:dasunhegoda@gmail.com]=20
> Sent: 20 November 2015 09:36
> To: user@hive.apache.org
> Subject: Hive on Spark - Hadoop 2 - Installation - Ubuntu
> =20
> Hi,
> =20
> What I'm planning to do is develop a reporting platform using existing dat=
a. I have an existing RDBMS which has large number of records. So I'm using.=
 (http://stackoverflow.com/questions/33635234/hadoop-2-7-spark-hive-jasperre=
ports-scoop-architecuture)
> =20
>  - Scoop - Extract data from RDBMS to Hadoop
>  - Hadoop - Storage platform -> *Deployment Completed*
>  - Hive - Datawarehouse
>  - Spark - Read time processing -> *Deployment Completed*
> =20
> I'm planning to deploy Hive on Spark but I can't find the installation ste=
ps. I tried to read the official '[Hive on Spark][1]' guide but it has probl=
ems. As an example it says under 'Configuring Yarn' `yarn.resourcemanager.sc=
heduler.class=3Dorg.apache.hadoop.yarn.server.resourcemanager.scheduler.fair=
.FairScheduler` but does not imply where should I do it. Also as per the gui=
de configurations are set in the Hive runtime shell which is not permanent a=
ccording to my knowledge.
> =20
> Given that I read [this][2] but it does not have any steps.
> =20
> Please provide me the steps to run Hive on Spark on Ubuntu as a production=
 system?
> =20
> =20
>   [1]: https://cwiki.apache.org/confluence/display/Hive/Hive+on+Spark%3A+G=
etting+Started
>   [2]: http://stackoverflow.com/questions/26018306/how-to-configure-hive-t=
o-use-spark
> =20
> --
> Regards,
> Dasun Hegoda, Software Engineer =20
> www.dasunhegoda.com | dasunhegoda@gmail.com

--Apple-Mail-B1194972-FC55-4486-AB58-F5E6787DDECB
Content-Type: text/html;
	charset=utf-8
Content-Transfer-Encoding: quoted-printable

<html><head><meta http-equiv=3D"content-type" content=3D"text/html; charset=3D=
utf-8"></head><body dir=3D"auto"><div>I recommend to use a Hadoop distributi=
on containing these technologies. I think you get also other useful tools fo=
r your scenario, such as Auditing using sentry or ranger.<br></div><div><br>=
On 20 Nov 2015, at 10:48, Mich Talebzadeh &lt;<a href=3D"mailto:mich@peridal=
e.co.uk">mich@peridale.co.uk</a>&gt; wrote:<br><br></div><blockquote type=3D=
"cite"><div><meta http-equiv=3D"Content-Type" content=3D"text/html; charset=3D=
utf-8"><meta name=3D"Generator" content=3D"Microsoft Word 15 (filtered mediu=
m)"><style><!--
/* Font Definitions */
@font-face
	{font-family:"Cambria Math";
	panose-1:2 4 5 3 5 4 6 3 2 4;}
@font-face
	{font-family:Calibri;
	panose-1:2 15 5 2 2 2 4 3 2 4;}
/* Style Definitions */
p.MsoNormal, li.MsoNormal, div.MsoNormal
	{margin:0cm;
	margin-bottom:.0001pt;
	font-size:12.0pt;
	font-family:"Times New Roman",serif;}
a:link, span.MsoHyperlink
	{mso-style-priority:99;
	color:blue;
	text-decoration:underline;}
a:visited, span.MsoHyperlinkFollowed
	{mso-style-priority:99;
	color:purple;
	text-decoration:underline;}
span.EmailStyle17
	{mso-style-type:personal-reply;
	font-family:"Arial",sans-serif;
	color:windowtext;}
.MsoChpDefault
	{mso-style-type:export-only;
	font-family:"Calibri",sans-serif;
	mso-fareast-language:EN-US;}
@page WordSection1
	{size:612.0pt 792.0pt;
	margin:72.0pt 72.0pt 72.0pt 72.0pt;}
div.WordSection1
	{page:WordSection1;}
--></style><!--[if gte mso 9]><xml>
<o:shapedefaults v:ext=3D"edit" spidmax=3D"1026" />
</xml><![endif]--><!--[if gte mso 9]><xml>
<o:shapelayout v:ext=3D"edit">
<o:idmap v:ext=3D"edit" data=3D"1" />
</o:shapelayout></xml><![endif]--><div class=3D"WordSection1"><p class=3D"Ms=
oNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Arial&quot;,sans-=
serif;mso-fareast-language:EN-US">Well<o:p></o:p></span></p><p class=3D"MsoN=
ormal"><span style=3D"font-size:11.0pt;font-family:&quot;Arial&quot;,sans-se=
rif;mso-fareast-language:EN-US"><o:p>&nbsp;</o:p></span></p><p class=3D"MsoN=
ormal"><span style=3D"font-size:11.0pt;font-family:&quot;Arial&quot;,sans-se=
rif;mso-fareast-language:EN-US">=E2=80=9C</span>I'm planning to deploy Hive o=
n Spark but I can't find the installation steps. I tried to read the officia=
l '[Hive on Spark][1]' guide but it has problems. As an example it says unde=
r 'Configuring Yarn' `yarn.resourcemanager.scheduler.class=3Dorg.apache.hado=
op.yarn.server.resourcemanager.scheduler.fair.FairScheduler` but does not im=
ply where should I do it. Also as per the guide configurations are set in th=
e Hive runtime shell which is not permanent according to my knowledge.=E2=80=
=9D<o:p></o:p></p><p class=3D"MsoNormal"><o:p>&nbsp;</o:p></p><p class=3D"Ms=
oNormal">You can do that in yarn-site.xml file which is normally under $HADO=
OP_HOME/etc/hadoop.<o:p></o:p></p><p class=3D"MsoNormal"><o:p>&nbsp;</o:p></=
p><p class=3D"MsoNormal"><o:p>&nbsp;</o:p></p><p class=3D"MsoNormal">HTH<o:p=
></o:p></p><p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-famil=
y:&quot;Arial&quot;,sans-serif;mso-fareast-language:EN-US"><o:p>&nbsp;</o:p>=
</span></p><p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-famil=
y:&quot;Arial&quot;,sans-serif;mso-fareast-language:EN-US"><o:p>&nbsp;</o:p>=
</span></p><p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-famil=
y:&quot;Arial&quot;,sans-serif;mso-fareast-language:EN-US"><o:p>&nbsp;</o:p>=
</span></p><p class=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-famil=
y:&quot;Arial&quot;,sans-serif">Mich Talebzadeh<o:p></o:p></span></p><p clas=
s=3D"MsoNormal"><span style=3D"font-size:11.0pt;font-family:&quot;Arial&quot=
;,sans-serif"><o:p>&nbsp;</o:p></span></p><p class=3D"MsoNormal"><i><span st=
yle=3D"font-size:10.0pt;font-family:&quot;Arial&quot;,sans-serif">Sybase ASE=
 15 Gold Medal Award 2008<o:p></o:p></span></i></p><p class=3D"MsoNormal" st=
yle=3D"text-autospace:none"><span style=3D"font-size:10.0pt;font-family:&quo=
t;Arial&quot;,sans-serif;color:#C0504D">A Winning Strategy: Running the most=
 Critical Financial Data on ASE 15<o:p></o:p></span></p><p class=3D"MsoNorma=
l"><span style=3D"font-size:9.0pt;font-family:&quot;Arial&quot;,sans-serif">=
<a href=3D"http://login.sybase.com/files/Product_Overviews/ASE-Winning-Strat=
egy-091908.pdf">http://login.sybase.com/files/Product_Overviews/ASE-Winning-=
Strategy-091908.pdf</a><o:p></o:p></span></p><p class=3D"MsoNormal" style=3D=
"text-autospace:none"><span style=3D"font-size:10.0pt;font-family:&quot;Aria=
l&quot;,sans-serif;color:blue">Author of the books<b> "A Practitioner=E2=80=99=
s Guide to Upgrading to Sybase ASE 15", ISBN 978-0-9563693-0-7</b>. <o:p></o=
:p></span></p><p class=3D"MsoNormal" style=3D"text-autospace:none"><span sty=
le=3D"font-size:10.0pt;font-family:&quot;Arial&quot;,sans-serif;color:blue">=
co-author <b>"Sybase Transact SQL Guidelines Best Practices", ISBN 978-0-975=
9693-0-4</b><o:p></o:p></span></p><p class=3D"MsoNormal" style=3D"text-align=
:justify"><u><span style=3D"font-size:11.0pt;font-family:&quot;Calibri&quot;=
,sans-serif;letter-spacing:-.15pt">Publications due shortly:</span></u><u><s=
pan style=3D"font-size:11.0pt;font-family:&quot;Calibri&quot;,sans-serif;let=
ter-spacing:-.15pt"><o:p></o:p></span></u></p><p class=3D"MsoNormal" style=3D=
"text-align:justify"><b><span style=3D"font-size:10.0pt;font-family:&quot;Ar=
ial&quot;,sans-serif;color:black">Complex Event Processing in Heterogeneous E=
nvironments</span></b><span style=3D"font-size:10.0pt;font-family:&quot;Aria=
l&quot;,sans-serif;color:black">, ISBN: 978-0-9563693-3-8<o:p></o:p></span><=
/p><p class=3D"MsoNormal" style=3D"text-align:justify"><b><span style=3D"fon=
t-size:10.0pt;font-family:&quot;Arial&quot;,sans-serif">Oracle and Sybase, C=
oncepts and Contrasts</span></b><span style=3D"font-size:10.0pt;font-family:=
&quot;Arial&quot;,sans-serif">, ISBN: 978-0-9563693-1-4, <span style=3D"colo=
r:black">volume one out shortly<o:p></o:p></span></span></p><p class=3D"MsoN=
ormal" style=3D"text-align:justify"><span style=3D"font-size:10.0pt;font-fam=
ily:&quot;Arial&quot;,sans-serif;color:black"><o:p>&nbsp;</o:p></span></p><p=
 class=3D"MsoNormal" style=3D"text-align:justify"><span style=3D"font-size:1=
0.0pt;font-family:&quot;Arial&quot;,sans-serif"><a href=3D"http://talebzadeh=
mich.wordpress.com/">http://talebzadehmich.wordpress.com</a><o:p></o:p></spa=
n></p><p class=3D"MsoNormal"><span style=3D"font-size:9.0pt;font-family:&quo=
t;Arial&quot;,sans-serif"><o:p>&nbsp;</o:p></span></p><p class=3D"MsoNormal"=
><span style=3D"font-size:7.5pt;font-family:&quot;Arial&quot;,sans-serif;col=
or:black">NOTE: The information in this email is proprietary and confidentia=
l. This message is for the designated recipient only, if you are not the int=
ended recipient, you should destroy it immediately. Any information in this m=
essage shall not be understood as given or endorsed by Peridale Technology L=
td, its subsidiaries or their employees, unless expressly so stated. It is t=
he responsibility of the recipient to ensure that this email is virus free, t=
herefore neither Peridale Ltd, its subsidiaries nor their employees accept a=
ny responsibility.</span><span style=3D"font-size:11.0pt;font-family:&quot;C=
alibri&quot;,sans-serif;color:black"><o:p></o:p></span></p><p class=3D"MsoNo=
rmal"><span style=3D"font-size:11.0pt;font-family:&quot;Arial&quot;,sans-ser=
if;mso-fareast-language:EN-US"><o:p>&nbsp;</o:p></span></p><p class=3D"MsoNo=
rmal"><b><span lang=3D"EN-US" style=3D"font-size:11.0pt;font-family:&quot;Ca=
libri&quot;,sans-serif">From:</span></b><span lang=3D"EN-US" style=3D"font-s=
ize:11.0pt;font-family:&quot;Calibri&quot;,sans-serif"> Dasun Hegoda [<a hre=
f=3D"mailto:dasunhegoda@gmail.com">mailto:dasunhegoda@gmail.com</a>] <br><b>=
Sent:</b> 20 November 2015 09:36<br><b>To:</b> <a href=3D"mailto:user@hive.a=
pache.org">user@hive.apache.org</a><br><b>Subject:</b> Hive on Spark - Hadoo=
p 2 - Installation - Ubuntu<o:p></o:p></span></p><p class=3D"MsoNormal"><o:p=
>&nbsp;</o:p></p><div><p class=3D"MsoNormal">Hi,<o:p></o:p></p><div><p class=
=3D"MsoNormal"><o:p>&nbsp;</o:p></p></div><div><div><p class=3D"MsoNormal">W=
hat I'm planning to do is develop a reporting platform using existing data. I=
 have an existing RDBMS which has large number of records. So I'm using. (<a=
 href=3D"http://stackoverflow.com/questions/33635234/hadoop-2-7-spark-hive-j=
asperreports-scoop-architecuture">http://stackoverflow.com/questions/3363523=
4/hadoop-2-7-spark-hive-jasperreports-scoop-architecuture</a>)<o:p></o:p></p=
></div><div><p class=3D"MsoNormal"><o:p>&nbsp;</o:p></p></div><div><p class=3D=
"MsoNormal">&nbsp;- Scoop - Extract data from RDBMS to Hadoop<o:p></o:p></p>=
</div><div><p class=3D"MsoNormal">&nbsp;- Hadoop - Storage platform -&gt; *D=
eployment Completed*<o:p></o:p></p></div><div><p class=3D"MsoNormal">&nbsp;-=
 Hive - Datawarehouse<o:p></o:p></p></div><div><p class=3D"MsoNormal">&nbsp;=
- Spark - Read time processing -&gt; *Deployment Completed*<o:p></o:p></p></=
div><div><p class=3D"MsoNormal"><o:p>&nbsp;</o:p></p></div><div><p class=3D"=
MsoNormal">I'm planning to deploy Hive on Spark but I can't find the install=
ation steps. I tried to read the official '[Hive on Spark][1]' guide but it h=
as problems. As an example it says under 'Configuring Yarn' `yarn.resourcema=
nager.scheduler.class=3Dorg.apache.hadoop.yarn.server.resourcemanager.schedu=
ler.fair.FairScheduler` but does not imply where should I do it. Also as per=
 the guide configurations are set in the Hive runtime shell which is not per=
manent according to my knowledge.<o:p></o:p></p></div><div><p class=3D"MsoNo=
rmal"><o:p>&nbsp;</o:p></p></div><div><p class=3D"MsoNormal">Given that I re=
ad [this][2] but it does not have any steps.<o:p></o:p></p></div><div><p cla=
ss=3D"MsoNormal"><o:p>&nbsp;</o:p></p></div><div><p class=3D"MsoNormal">Plea=
se provide me the steps to run Hive on Spark on Ubuntu as a production syste=
m?<o:p></o:p></p></div><div><p class=3D"MsoNormal"><o:p>&nbsp;</o:p></p></di=
v><div><p class=3D"MsoNormal"><o:p>&nbsp;</o:p></p></div><div><p class=3D"Ms=
oNormal">&nbsp; [1]: <a href=3D"https://cwiki.apache.org/confluence/display/=
Hive/Hive+on+Spark%3A+Getting+Started">https://cwiki.apache.org/confluence/d=
isplay/Hive/Hive+on+Spark%3A+Getting+Started</a><o:p></o:p></p></div><div><p=
 class=3D"MsoNormal">&nbsp; [2]: <a href=3D"http://stackoverflow.com/questio=
ns/26018306/how-to-configure-hive-to-use-spark">http://stackoverflow.com/que=
stions/26018306/how-to-configure-hive-to-use-spark</a><o:p></o:p></p></div><=
div><p class=3D"MsoNormal"><o:p>&nbsp;</o:p></p></div><p class=3D"MsoNormal"=
>-- <o:p></o:p></p><div><p class=3D"MsoNormal"><span style=3D"color:#888888"=
>Regards,</span><o:p></o:p></p><div><p class=3D"MsoNormal"><span style=3D"fo=
nt-size:13.5pt">Dasun Hegoda</span><span style=3D"color:#888888">, Software E=
ngineer&nbsp;&nbsp;<br></span><a href=3D"http://www.dasunhegoda.com/" target=
=3D"_blank">www.dasunhegoda.com</a><span style=3D"color:#888888">&nbsp;| <a h=
ref=3D"mailto:dasunhegoda@gmail.com" target=3D"_blank">dasunhegoda@gmail.com=
</a></span><o:p></o:p></p></div></div></div></div></div></div></blockquote><=
/body></html>=

--Apple-Mail-B1194972-FC55-4486-AB58-F5E6787DDECB--