hadoop-mapreduce-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From unmesha sreeveni <unmeshab...@gmail.com>
Subject Values getting duplicated in Hive table(Partitioned)
Date Mon, 17 Nov 2014 04:30:10 GMT
I created a Hive table with *partition* and inserted data into Partioned
Hive table.

Refered site
<https://blog.safaribooksonline.com/2012/12/03/tip-partitioning-data-in-hive/>

   1.

   *Initially created one Non -partioned table and then using select query
   and loaded data into partioned table. Is there an alternate way?*
   2.

   *By following above link my partioned table contains duplicate values.
   Below are the setps*

This is my Sample employee dataset:link1 <http://pastebin.com/tVh16Yxt>

I tried the following queries: link2 <http://pastebin.com/U2yykWpy>

But after updating a value in Hive table,the values are getting duplicated.

7       Nirmal  Tech    12000   A
7       Nirmal  Tech    12000   B

Nirmal is placed in Department *A* only
​,​
but it is duplicated to department *B*.

And Once I update a column value in middle I am getting NULL values
displayed,while updating last column it is fine.

Am I doing any thing wrong.
Please suggest.--
*Thanks & Regards *


*Unmesha Sreeveni U.B*
*Hadoop, Bigdata Developer*
*Centre for Cyber Security | Amrita Vishwa Vidyapeetham*
http://www.unmeshasreeveni.blogspot.in/

Mime
View raw message