Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@hadoop.apache.org
Received-SPF: pass (athena.apache.org: domain of unmeshabiju@gmail.com
 designates 209.85.213.182 as permitted sender)
MIME-Version: 1.0
In-Reply-To: 
 <CADtHtMzzQM5y_FNuJ-ybLMHpwKo7L3kZ-YsfDNS0JeSs1V2zhQ@mail.gmail.com>
References: 
 <CACp0qUEPHjO30DuMfTjd=qDx4kumU658prpCabAn9Da=rq3aJQ@mail.gmail.com>
 <CACp0qUGFK3ET2WQAuRo8CVWVtPVPm7HL6t_4+=4J=3vWw70WUw@mail.gmail.com>
 <CADtHtMzzQM5y_FNuJ-ybLMHpwKo7L3kZ-YsfDNS0JeSs1V2zhQ@mail.gmail.com>
From: unmesha sreeveni <unmeshabiju@gmail.com>
Date: Mon, 17 Nov 2014 13:01:14 +0530
Message-ID: 
 <CACp0qUF1-nqSQk0Jgkm4uwmgd2E3DimRwDNxEKpntvphHruSgA@mail.gmail.com>
Subject: Fwd: Values getting duplicated in Hive table(Partitioned)
To: User Hadoop <user@hadoop.apache.org>
Content-Type: multipart/alternative; boundary=20cf305644230955ad050808f941

--20cf305644230955ad050808f941
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

In non partitioned table I am getting the correct values.

Is my update query wrong?

INSERT OVERWRITE TABLE Unm_Parti_Trail PARTITION (Department =3D 'A') SELEC=
T
employeeid,firstname,designation, CASE WHEN employeeid=3D19 THEN '50000 ELS=
E
salary END AS salary FROM Unm_Parti_Trail;


What I tried to include in the query is , In partion with department =3D A,
update employeeid =3D19 's salary with 50000

Is that query statement wrong? and the replication is not affected to dept
B and C


---------- Forwarded message ----------
From: hadoop hive <hadoophive@gmail.com>
Date: Mon, Nov 17, 2014 at 10:08 AM
Subject: Re: Values getting duplicated in Hive table(Partitioned)
To: user@hive.apache.org


Can you check your select query to run on non partitioned tables. Check if
it's giving correct values.

Same as for dept. B
 On Nov 17, 2014 10:03 AM, "unmesha sreeveni" <unmeshabiju@gmail.com> wrote=
:

> ***I created a Hive table with *non*- *partitioned* and using select
> query I inserted data into *Partioned* Hive table.
>
> On Mon, Nov 17, 2014 at 10:00 AM, unmesha sreeveni <unmeshabiju@gmail.com=
>
> wrote:
>
>> I created a Hive table with *partition* and inserted data into Partioned
>> Hive table.
>>
>> Refered site
>> <https://blog.safaribooksonline.com/2012/12/03/tip-partitioning-data-in-=
hive/>
>>
>>    1.
>>
>>    *Initially created one Non -partioned table and then using select
>>    query and loaded data into partioned table. Is there an alternate way=
?*
>>    2.
>>
>>    *By following above link my partioned table contains duplicate
>>    values. Below are the setps*
>>
>> This is my Sample employee dataset:link1 <http://pastebin.com/tVh16Yxt>
>>
>> I tried the following queries: link2 <http://pastebin.com/U2yykWpy>
>>
>> But after updating a value in Hive table,the values are getting
>> duplicated.
>>
>> 7       Nirmal  Tech    12000   A
>> 7       Nirmal  Tech    12000   B
>>
>> Nirmal is placed in Department *A* only
>> =E2=80=8B,=E2=80=8B
>> but it is duplicated to department *B*.
>>
>> And Once I update a column value in middle I am getting NULL values
>> displayed,while updating last column it is fine.
>>
>> Am I doing any thing wrong.
>> Please suggest.--
>>
>
> --
*Thanks & Regards *


*Unmesha Sreeveni U.B*
*Hadoop, Bigdata Developer*
*Centre for Cyber Security | Amrita Vishwa Vidyapeetham*
http://www.unmeshasreeveni.blogspot.in/

--20cf305644230955ad050808f941
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr"><div class=3D"gmail_default" style><div class=3D"gmail_def=
ault" style=3D"font-family:verdana,sans-serif;font-size:13.333333969116211p=
x"><br></div><div class=3D"gmail_default" style><div style>In non partition=
ed table I am getting the correct values.<br><br>Is my update query wrong?<=
font color=3D"#000000" face=3D"Consolas, Menlo, Monaco, Lucida Console, Lib=
eration Mono, DejaVu Sans Mono, Bitstream Vera Sans Mono, monospace, serif"=
><div style><span style=3D"font-size:12.222222328186035px;line-height:21px"=
><br></span></div><div style><span style=3D"font-size:12.222222328186035px;=
line-height:21px">INSERT OVERWRITE TABLE Unm_Parti_Trail PARTITION (Departm=
ent =3D &#39;A&#39;) SELECT employeeid,firstname,designation, CASE WHEN emp=
loyeeid=3D19 THEN &#39;50000 ELSE salary END AS salary FROM Unm_Parti_Trail=
;</span></div><div style><span style=3D"font-size:12.222222328186035px;line=
-height:21px"><br></span></div><div style><span style=3D"font-size:12.22222=
2328186035px;line-height:21px"><br></span></div></font>What I tried to incl=
ude in the query is , In partion with department =3D A, update employeeid =
=3D19 &#39;s salary with 50000<br><br>Is that query statement wrong? and th=
e replication is not affected to dept B and C<font color=3D"#000000" face=
=3D"Consolas, Menlo, Monaco, Lucida Console, Liberation Mono, DejaVu Sans M=
ono, Bitstream Vera Sans Mono, monospace, serif"><div style=3D"font-family:=
arial,sans-serif;font-size:12.222222328186035px;line-height:21px"><br></div=
></font></div><div style=3D"font-family:arial,sans-serif;font-size:13.33333=
3969116211px"><font color=3D"#000000" face=3D"Consolas, Menlo, Monaco, Luci=
da Console, Liberation Mono, DejaVu Sans Mono, Bitstream Vera Sans Mono, mo=
nospace, serif"><span style=3D"font-size:12.222222328186035px;line-height:2=
1px"><br></span></font></div></div></div><div class=3D"gmail_quote">-------=
--- Forwarded message ----------<br>From: <b class=3D"gmail_sendername">had=
oop hive</b> <span dir=3D"ltr">&lt;<a href=3D"mailto:hadoophive@gmail.com" =
target=3D"_blank">hadoophive@gmail.com</a>&gt;</span><br>Date: Mon, Nov 17,=
 2014 at 10:08 AM<br>Subject: Re: Values getting duplicated in Hive table(P=
artitioned)<br>To: <a href=3D"mailto:user@hive.apache.org" target=3D"_blank=
">user@hive.apache.org</a><br><br><br><p dir=3D"ltr">Can you check your sel=
ect query to run on non partitioned tables. Check if it&#39;s giving correc=
t values.</p>
<p dir=3D"ltr">Same as for dept. B<br>
</p><div><div>
<div class=3D"gmail_quote">On Nov 17, 2014 10:03 AM, &quot;unmesha sreeveni=
&quot; &lt;<a href=3D"mailto:unmeshabiju@gmail.com" target=3D"_blank">unmes=
habiju@gmail.com</a>&gt; wrote:<br type=3D"attribution"><blockquote class=
=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;border-left-width:1px;bo=
rder-left-color:rgb(204,204,204);border-left-style:solid;padding-left:1ex">=
<div dir=3D"ltr"><div style=3D"font-family:verdana,sans-serif">***<span sty=
le=3D"font-size:13.333333969116211px;color:rgb(0,0,0);font-family:Arial,=
9;Liberation Sans&#39;,&#39;DejaVu Sans&#39;,sans-serif;line-height:17.8047=
98126220703px">I created a Hive table with <b>non</b>-=C2=A0</span><strong =
style=3D"font-size:13.63636302947998px;color:rgb(0,0,0);font-family:Arial,&=
#39;Liberation Sans&#39;,&#39;DejaVu Sans&#39;,sans-serif;line-height:17.80=
4798126220703px;margin:0px;padding:0px;border:0px;vertical-align:baseline;b=
ackground:transparent">partitioned</strong><span style=3D"font-size:13.3333=
33969116211px;color:rgb(0,0,0);font-family:Arial,&#39;Liberation Sans&#39;,=
&#39;DejaVu Sans&#39;,sans-serif;line-height:17.804798126220703px">=C2=A0an=
d using select query I inserted data into <b>Partioned</b> Hive table.</spa=
n></div></div><div class=3D"gmail_extra"><br><div class=3D"gmail_quote">On =
Mon, Nov 17, 2014 at 10:00 AM, unmesha sreeveni <span dir=3D"ltr">&lt;<a hr=
ef=3D"mailto:unmeshabiju@gmail.com" target=3D"_blank">unmeshabiju@gmail.com=
</a>&gt;</span> wrote:<br><blockquote class=3D"gmail_quote" style=3D"margin=
:0px 0px 0px 0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204)=
;border-left-style:solid;padding-left:1ex"><div dir=3D"ltr"><div style=3D"f=
ont-family:verdana,sans-serif"><p style=3D"margin:0px 0px 1em;padding:0px;b=
order:0px;font-size:13.63636302947998px;vertical-align:baseline;clear:both;=
color:rgb(0,0,0);font-family:Arial,&#39;Liberation Sans&#39;,&#39;DejaVu Sa=
ns&#39;,sans-serif;line-height:17.804800033569336px;background-image:initia=
l;background-repeat:initial">I created a Hive table with=C2=A0<strong style=
=3D"margin:0px;padding:0px;border:0px;font-size:13.63636302947998px;vertica=
l-align:baseline;background:transparent">partition</strong>=C2=A0and insert=
ed data into Partioned Hive table.</p><p style=3D"margin:0px 0px 1em;paddin=
g:0px;border:0px;font-size:13.63636302947998px;vertical-align:baseline;clea=
r:both;color:rgb(0,0,0);font-family:Arial,&#39;Liberation Sans&#39;,&#39;De=
jaVu Sans&#39;,sans-serif;line-height:17.804800033569336px;background-image=
:initial;background-repeat:initial"><a href=3D"https://blog.safaribooksonli=
ne.com/2012/12/03/tip-partitioning-data-in-hive/" rel=3D"nofollow" style=3D=
"margin:0px;padding:0px;border:0px;font-size:13.63636302947998px;vertical-a=
lign:baseline;color:rgb(74,107,130);text-decoration:none;background:transpa=
rent" target=3D"_blank">Refered site</a></p><ol style=3D"margin:0px 0px 1em=
 30px;padding:0px;border:0px;font-size:13.63636302947998px;vertical-align:b=
aseline;list-style-position:initial;color:rgb(0,0,0);font-family:Arial,&#39=
;Liberation Sans&#39;,&#39;DejaVu Sans&#39;,sans-serif;line-height:17.80480=
0033569336px;background-image:initial;background-repeat:initial"><li style=
=3D"margin:0px;padding:0px;border:0px;font-size:13.63636302947998px;vertica=
l-align:baseline;background:transparent"><p style=3D"margin:0px 0px 1em;pad=
ding:0px;border:0px;font-size:13.63636302947998px;vertical-align:baseline;c=
lear:both;background:transparent"><strong style=3D"margin:0px;padding:0px;b=
order:0px;font-size:13.63636302947998px;vertical-align:baseline;background:=
transparent">Initially created one Non -partioned table and then using sele=
ct query and loaded data into partioned table. Is there an alternate way?</=
strong></p></li><li style=3D"margin:0px;padding:0px;border:0px;font-size:13=
.63636302947998px;vertical-align:baseline;background:transparent"><p style=
=3D"margin:0px 0px 1em;padding:0px;border:0px;font-size:13.63636302947998px=
;vertical-align:baseline;clear:both;background:transparent"><strong style=
=3D"margin:0px;padding:0px;border:0px;font-size:13.63636302947998px;vertica=
l-align:baseline;background:transparent">By following above link my partion=
ed table contains duplicate values. Below are the setps</strong></p></li></=
ol><p style=3D"margin:0px 0px 1em;padding:0px;border:0px;font-size:13.63636=
302947998px;vertical-align:baseline;clear:both;color:rgb(0,0,0);font-family=
:Arial,&#39;Liberation Sans&#39;,&#39;DejaVu Sans&#39;,sans-serif;line-heig=
ht:17.804800033569336px;background-image:initial;background-repeat:initial"=
>This is my Sample employee dataset:<a href=3D"http://pastebin.com/tVh16Yxt=
" rel=3D"nofollow" style=3D"margin:0px;padding:0px;border:0px;font-size:13.=
63636302947998px;vertical-align:baseline;color:rgb(74,107,130);text-decorat=
ion:none;background:transparent" target=3D"_blank">link1</a></p><p style=3D=
"margin:0px 0px 1em;padding:0px;border:0px;font-size:13.63636302947998px;ve=
rtical-align:baseline;clear:both;color:rgb(0,0,0);font-family:Arial,&#39;Li=
beration Sans&#39;,&#39;DejaVu Sans&#39;,sans-serif;line-height:17.80480003=
3569336px;background-image:initial;background-repeat:initial">I tried the f=
ollowing queries:=C2=A0<a href=3D"http://pastebin.com/U2yykWpy" rel=3D"nofo=
llow" style=3D"margin:0px;padding:0px;border:0px;font-size:13.6363630294799=
8px;vertical-align:baseline;color:rgb(74,107,130);text-decoration:none;back=
ground:transparent" target=3D"_blank">link2</a></p><p style=3D"margin:0px 0=
px 1em;padding:0px;border:0px;font-size:13.63636302947998px;vertical-align:=
baseline;clear:both;color:rgb(0,0,0);font-family:Arial,&#39;Liberation Sans=
&#39;,&#39;DejaVu Sans&#39;,sans-serif;line-height:17.804800033569336px;bac=
kground-image:initial;background-repeat:initial">But after updating a value=
 in Hive table,the values are getting duplicated.</p><pre style=3D"margin-t=
op:0px;margin-bottom:10px;padding:5px;border:0px;font-size:13.6363630294799=
8px;vertical-align:baseline;font-family:Consolas,Menlo,Monaco,&#39;Lucida C=
onsole&#39;,&#39;Liberation Mono&#39;,&#39;DejaVu Sans Mono&#39;,&#39;Bitst=
ream Vera Sans Mono&#39;,&#39;Courier New&#39;,monospace,serif;overflow:aut=
o;width:auto;max-height:600px;word-wrap:normal;color:rgb(0,0,0);line-height=
:17.804800033569336px;background:rgb(238,238,238)"><code style=3D"margin:0p=
x;padding:0px;border:0px;font-size:13.63636302947998px;vertical-align:basel=
ine;font-family:Consolas,Menlo,Monaco,&#39;Lucida Console&#39;,&#39;Liberat=
ion Mono&#39;,&#39;DejaVu Sans Mono&#39;,&#39;Bitstream Vera Sans Mono&#39;=
,&#39;Courier New&#39;,monospace,serif;white-space:inherit;background-image=
:initial;background-repeat:initial">7       Nirmal  Tech    12000   A
7       Nirmal  Tech    12000   B</code></pre></div><p style=3D"margin:0px =
0px 1em;padding:0px;border:0px;font-size:13.63636302947998px;vertical-align=
:baseline;clear:both;color:rgb(0,0,0);font-family:Arial,&#39;Liberation San=
s&#39;,&#39;DejaVu Sans&#39;,sans-serif;line-height:17.804800033569336px;ba=
ckground-image:initial;background-repeat:initial">Nirmal is placed in Depar=
tment=C2=A0<strong style=3D"margin:0px;padding:0px;border:0px;font-size:13.=
63636302947998px;vertical-align:baseline;background:transparent">A</strong>=
=C2=A0only</p><div style=3D"font-family:verdana,sans-serif;display:inline">=
=E2=80=8B,=E2=80=8B</div> but it is duplicated to department=C2=A0<strong s=
tyle=3D"margin:0px;padding:0px;border:0px;font-size:13.63636302947998px;ver=
tical-align:baseline;background:transparent">B</strong>.<p></p><p style=3D"=
margin:0px 0px 1em;padding:0px;border:0px;font-size:13.63636302947998px;ver=
tical-align:baseline;clear:both;color:rgb(0,0,0);font-family:Arial,&#39;Lib=
eration Sans&#39;,&#39;DejaVu Sans&#39;,sans-serif;line-height:17.804800033=
569336px;background-image:initial;background-repeat:initial">And Once I upd=
ate a column value in middle I am getting NULL values displayed,while updat=
ing last column it is fine.</p><p style=3D"margin:0px 0px 1em;padding:0px;b=
order:0px;font-size:13.63636302947998px;vertical-align:baseline;clear:both;=
color:rgb(0,0,0);font-family:Arial,&#39;Liberation Sans&#39;,&#39;DejaVu Sa=
ns&#39;,sans-serif;line-height:17.804800033569336px;background-image:initia=
l;background-repeat:initial">Am I doing any thing wrong.</p><div><span styl=
e=3D"color:rgb(0,0,0);font-family:Arial,&#39;Liberation Sans&#39;,&#39;Deja=
Vu Sans&#39;,sans-serif;font-size:13.63636302947998px;line-height:17.804800=
033569336px">Please suggest.</span>--=C2=A0</div></div></blockquote></div><=
div><br></div></div></blockquote></div></div></div></div>-- <br><div><div d=
ir=3D"ltr"><div><div dir=3D"ltr"><b><font color=3D"#3d85c6"><i>Thanks &amp;=
 Regards</i>
</font></b><div><i><b><font color=3D"#3d85c6"><br></font></b></i></div><div=
><b><font color=3D"#3d85c6">Unmesha Sreeveni U.B<i><br></i></font></b></div=
><div><b><font color=3D"#3d85c6">Hadoop, Bigdata Developer</font></b></div>=
<div><b><font color=3D"#3d85c6">Centre for Cyber Security | Amrita Vishwa V=
idyapeetham</font></b><br></div><div style=3D"color:rgb(102,0,0)"><a href=
=3D"http://www.unmeshasreeveni.blogspot.in/" target=3D"_blank">http://www.u=
nmeshasreeveni.blogspot.in/</a><br></div><div style=3D"color:rgb(102,0,0)">=
<br></div><i><span><br></span></i></div></div></div></div>
</div>

--20cf305644230955ad050808f941--