Return-Path: X-Original-To: apmail-hive-user-archive@www.apache.org Delivered-To: apmail-hive-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 3034D107FA for ; Mon, 17 Nov 2014 04:38:39 +0000 (UTC) Received: (qmail 73406 invoked by uid 500); 17 Nov 2014 04:38:37 -0000 Delivered-To: apmail-hive-user-archive@hive.apache.org Received: (qmail 73338 invoked by uid 500); 17 Nov 2014 04:38:37 -0000 Mailing-List: contact user-help@hive.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hive.apache.org Delivered-To: mailing list user@hive.apache.org Received: (qmail 73322 invoked by uid 99); 17 Nov 2014 04:38:37 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 17 Nov 2014 04:38:37 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of hadoophive@gmail.com designates 209.85.223.171 as permitted sender) Received: from [209.85.223.171] (HELO mail-ie0-f171.google.com) (209.85.223.171) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 17 Nov 2014 04:38:33 +0000 Received: by mail-ie0-f171.google.com with SMTP id rl12so701681iec.30 for ; Sun, 16 Nov 2014 20:38:12 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=QJIFRZBDoHuAEKb6IPHa6oW17V0d4eE+i4ypbZ98jp8=; b=05v3vGgClvLCLMeJalmsDaprsuvq6m65qnfTZ3Uvfnje+ElAc6owE6sDkDNLjMYu2V zoCqgE/qQEiRIymk7XZcXpy62dRQGhJc/Bppz7rj3TGKhsl4XpsOgazalupjObn+p8l0 yGJ6l5zqFx+bqP2xxVivByZvq6E70zxQK3ySN2cuAzg0KUoaYAFoGccGna7CGvZGJnYa NcT/ayRSiH/MYq4BH69mGcmJMI9cfe4HFgXqGOEp3dZpLViSEvBlJPEcM0h3LB/19vLW 7of5hxJKni/JqqQMMQxvY6WDNY2UbQHrr8gjDzbF4Yzqn7l0QWrHn8OITb+2S9XylNXf P2aQ== MIME-Version: 1.0 X-Received: by 10.107.155.209 with SMTP id d200mr27546710ioe.12.1416199092674; Sun, 16 Nov 2014 20:38:12 -0800 (PST) Received: by 10.107.43.147 with HTTP; Sun, 16 Nov 2014 20:38:12 -0800 (PST) Received: by 10.107.43.147 with HTTP; Sun, 16 Nov 2014 20:38:12 -0800 (PST) In-Reply-To: References: Date: Sun, 16 Nov 2014 20:38:12 -0800 Message-ID: Subject: Re: Values getting duplicated in Hive table(Partitioned) From: hadoop hive To: user@hive.apache.org Content-Type: multipart/alternative; boundary=001a11403402d9a8c30508068b86 X-Virus-Checked: Checked by ClamAV on apache.org --001a11403402d9a8c30508068b86 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable Can you check your select query to run on non partitioned tables. Check if it's giving correct values. Same as for dept. B On Nov 17, 2014 10:03 AM, "unmesha sreeveni" wrote= : > ***I created a Hive table with *non*- *partitioned* and using select > query I inserted data into *Partioned* Hive table. > > On Mon, Nov 17, 2014 at 10:00 AM, unmesha sreeveni > wrote: > >> I created a Hive table with *partition* and inserted data into Partioned >> Hive table. >> >> Refered site >> >> >> 1. >> >> *Initially created one Non -partioned table and then using select >> query and loaded data into partioned table. Is there an alternate way= ?* >> 2. >> >> *By following above link my partioned table contains duplicate >> values. Below are the setps* >> >> This is my Sample employee dataset:link1 >> >> I tried the following queries: link2 >> >> But after updating a value in Hive table,the values are getting >> duplicated. >> >> 7 Nirmal Tech 12000 A >> 7 Nirmal Tech 12000 B >> >> Nirmal is placed in Department *A* only >> =E2=80=8B,=E2=80=8B >> but it is duplicated to department *B*. >> >> And Once I update a column value in middle I am getting NULL values >> displayed,while updating last column it is fine. >> >> Am I doing any thing wrong. >> Please suggest.-- >> *Thanks & Regards * >> >> >> *Unmesha Sreeveni U.B* >> *Hadoop, Bigdata Developer* >> *Centre for Cyber Security | Amrita Vishwa Vidyapeetham* >> http://www.unmeshasreeveni.blogspot.in/ >> >> >> > > > -- > *Thanks & Regards * > > > *Unmesha Sreeveni U.B* > *Hadoop, Bigdata Developer* > *Centre for Cyber Security | Amrita Vishwa Vidyapeetham* > http://www.unmeshasreeveni.blogspot.in/ > > > --001a11403402d9a8c30508068b86 Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable

Can you check your select query to run on non partitioned ta= bles. Check if it's giving correct values.

Same as for dept. B

On Nov 17, 2014 10:03 AM, "unmesha sreeveni= " <unmeshabiju@gmail.com> wrote:
***I created a Hive table with non-=C2= =A0partitioned=C2=A0and using select query I inserted data into Partioned Hive table.

On Mon, Nov 17, 2014 at 10:00 AM, unmesha sreeveni <u= nmeshabiju@gmail.com> wrote:

I created a Hive table with=C2=A0partition=C2=A0and inserted data into Part= ioned Hive table.

Refered site

  1. Initi= ally created one Non -partioned table and then using select query and loade= d data into partioned table. Is there an alternate way?

  2. By following above link my partioned table contains dup= licate values. Below are the setps

This is my Sample em= ployee dataset:link1

I tried the following queries:=C2= =A0link2

But after updating a value in Hive table,the v= alues are getting duplicated.

7       Nirmal  Tech    12000   A
7       Nirmal  Tech    12000   B

Nirmal is placed in Depar= tment=C2=A0A= =C2=A0only

=E2=80=8B,=E2=80=8B
but it is duplicated to d= epartment=C2=A0B.

And Once I update a column value in middle I am getting NULL valu= es displayed,while updating last column it is fine.

Am I doing any thing w= rong.

Please suggest.--=C2=A0
Thanks & Regards

Unmesha Sreeveni U.B
Hadoop, Bigdata Developer
=
Centre for Cyber Security | Amrita Vishwa V= idyapeetham
=




--
Thanks & = Regards

Unmesha Sreeveni U.B
Hadoop, Bigdata Developer
=
Centre for Cyber Security | Amrita Vishwa V= idyapeetham
=

--001a11403402d9a8c30508068b86--