Return-Path: X-Original-To: apmail-hive-user-archive@www.apache.org Delivered-To: apmail-hive-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 837589B34 for ; Tue, 26 Jun 2012 07:42:59 +0000 (UTC) Received: (qmail 76340 invoked by uid 500); 26 Jun 2012 07:42:58 -0000 Delivered-To: apmail-hive-user-archive@hive.apache.org Received: (qmail 76269 invoked by uid 500); 26 Jun 2012 07:42:58 -0000 Mailing-List: contact user-help@hive.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hive.apache.org Delivered-To: mailing list user@hive.apache.org Received: (qmail 76250 invoked by uid 99); 26 Jun 2012 07:42:57 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 26 Jun 2012 07:42:57 +0000 X-ASF-Spam-Status: No, hits=1.8 required=5.0 tests=FREEMAIL_ENVFROM_END_DIGIT,FSL_RCVD_USER,HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of shaik5943@gmail.com designates 209.85.214.176 as permitted sender) Received: from [209.85.214.176] (HELO mail-ob0-f176.google.com) (209.85.214.176) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 26 Jun 2012 07:42:49 +0000 Received: by obbwd18 with SMTP id wd18so9460611obb.35 for ; Tue, 26 Jun 2012 00:42:29 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:date:message-id:subject:from:to:content-type; bh=ID8qADCa9K+jKA+HyGhgDLNknjb+yhRJYgA9Y/lz6bQ=; b=R0ATc1AmmZyQTvrCVkeZYCabzCesSp6vDoTx8Ybaf2CNwkN8cTux0M4zroTjOF2Hue tSBhRfWhV5WfEflsv/mqiSCVFAOVoz/KrA9BfwmQBNidjxY7tWCXFcshPfluVXniTdCy Qv0hdyFGRyQgnf4FbIKgNJBe3PRuNDLFochqsnIk+hoZEXUkXEVCaNhgXcDRhXK7CXKX pKrBSbbva4Xf3772q7zSfmBOIIM63BhkuhAerwrtgXs1sK09rTIqs7wpMAvX8LSB0Qr1 0+Ga4kokDYMtJTkfFDxEz2fMaFjmEqT9S9PJaEBmVwVEVtLSq00DyBcnO40tUVbj/P3x sS5w== MIME-Version: 1.0 Received: by 10.60.168.230 with SMTP id zz6mr8955897oeb.11.1340696548887; Tue, 26 Jun 2012 00:42:28 -0700 (PDT) Received: by 10.76.170.132 with HTTP; Tue, 26 Jun 2012 00:42:28 -0700 (PDT) Date: Tue, 26 Jun 2012 13:12:28 +0530 Message-ID: Subject: hi all From: shaik ahamed To: user@hive.apache.org Content-Type: multipart/alternative; boundary=bcaec54ee8cc8c4c1f04c35b3dab --bcaec54ee8cc8c4c1f04c35b3dab Content-Type: text/plain; charset=ISO-8859-1 Hi Users, As i created an hive table with the below syntax CREATE EXTERNAL TABLE vender_part(vender string, supplier string,quantity int ) PARTITIONED BY (order_date string) row format delimited fields terminated by ',' stored as textfile; And inserted the 100GB of data with the below command INSERT OVERWRITE TABLE vender_part PARTITION (order_date) SELECT vender,supplier,order_date,quantity FROM vender; then im getting the below output Vendor_1 Supplier_111 2012-03-07 4240 NULL NULL __HIVE_DEFAULT_PARTITION__ Vendor_1 Supplier_112 2012-03-07 1237 NULL NULL __HIVE_DEFAULT_PARTITION__ Vendor_1 Supplier_113 2012-03-07 2970 NULL NULL __HIVE_DEFAULT_PARTITION__ Vendor_1 Supplier_114 2012-03-07 4652 NULL NULL __HIVE_DEFAULT_PARTITION__ Vendor_1 Supplier_115 2012-03-07 7414 NULL NULL __HIVE_DEFAULT_PARTITION__ Vendor_1 Supplier_116 2012-03-07 2334 NULL NULL __HIVE_DEFAULT_PARTITION__ Vendor_1 Supplier_117 2012-03-07 10522 NULL NULL __HIVE_DEFAULT_PARTITION__ Vendor_1 Supplier_118 2012-03-07 1776 NULL NULL __HIVE_DEFAULT_PARTITION__ Vendor_1 Supplier_119 2012-03-07 8344 NULL NULL __HIVE_DEFAULT_PARTITION__ Vendor_1 Supplier_120 2012-03-07 10362 NULL NULL __HIVE_DEFAULT_PARTITION__ Vendor_1 Supplier_121 2012-03-07 4579 NULL NULL __HIVE_DEFAULT_PARTITION__ Vendor_1 Supplier_122 2012-03-07 8020 NULL NULL __HIVE_DEFAULT_PARTITION__ Vendor_1 Supplier_123 2012-03-07 3520 NULL NULL __HIVE_DEFAULT_PARTITION__ Vendor_1 Supplier_124 2012-03-07 9124 NULL NULL __HIVE_DEFAULT_PARTITION__ please tell me that the above output is correct or not and why the 2 columns are null and there is a column with __HIVE_DEFAULT_PARTITION__ And if i select the partition table then the time taken to retrieve the data should be less ,when compare to before partition right that not happening for me. Time taken for 100GB of data is : 2192.416 seconds 3.If i select the partition table order_date im not getting the data. select * from vender_part where order_date='2012-03-07'; hive> select * from vender_part where order_date='2012-03-07'; OK Time taken: 2.801 seconds Please reply back to my above questions and help me out in going further with the clear output who it will come when we do the hive table partitioning And why im not getting the data for the partitoned table if i select the order_date. Thanks in advance shaik. --bcaec54ee8cc8c4c1f04c35b3dab Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Hi Users,


=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0=A0 As i created an hive= table with the below syntax

CREATE EXTERNAL TABLE vender_part(vende= r string, supplier string,quantity int ) PARTITIONED BY (order_date string)= row format delimited fields terminated by ',' stored as textfile;<= br>

And inserted the 100GB of data with the below command

INSERT= OVERWRITE TABLE vender_part PARTITION (order_date) SELECT vender,supplier,= order_date,quantity=A0 FROM vender;
=A0

then im getting the below= output

Vendor_1 Supplier_111 2012-03-07 4240=A0=A0 NULL=A0=A0=A0 NULL=A0=A0=A0= __HIVE_DEFAULT_PARTITION__
Vendor_1 Supplier_112 2012-03-07 1237=A0=A0 = NULL=A0=A0=A0 NULL=A0=A0=A0 __HIVE_DEFAULT_PARTITION__
Vendor_1 Supplier= _113 2012-03-07 2970=A0=A0 NULL=A0=A0=A0 NULL=A0=A0=A0 __HIVE_DEFAULT_PARTI= TION__
Vendor_1 Supplier_114 2012-03-07 4652=A0=A0 NULL=A0=A0=A0 NULL=A0=A0=A0 __H= IVE_DEFAULT_PARTITION__
Vendor_1 Supplier_115 2012-03-07 7414=A0=A0 NULL= =A0=A0=A0 NULL=A0=A0=A0 __HIVE_DEFAULT_PARTITION__
Vendor_1 Supplier_116= 2012-03-07 2334=A0=A0 NULL=A0=A0=A0 NULL=A0=A0=A0 __HIVE_DEFAULT_PARTITION= __
Vendor_1 Supplier_117 2012-03-07 10522=A0 NULL=A0=A0=A0 NULL=A0=A0=A0 __HIV= E_DEFAULT_PARTITION__
Vendor_1 Supplier_118 2012-03-07 1776=A0=A0 NULL= =A0=A0=A0 NULL=A0=A0=A0 __HIVE_DEFAULT_PARTITION__
Vendor_1 Supplier_119= 2012-03-07 8344=A0=A0 NULL=A0=A0=A0 NULL=A0=A0=A0 __HIVE_DEFAULT_PARTITION= __
Vendor_1 Supplier_120 2012-03-07 10362=A0 NULL=A0=A0=A0 NULL=A0=A0=A0 __HIV= E_DEFAULT_PARTITION__
Vendor_1 Supplier_121 2012-03-07 4579=A0=A0 NULL= =A0=A0=A0 NULL=A0=A0=A0 __HIVE_DEFAULT_PARTITION__
Vendor_1 Supplier_122= 2012-03-07 8020=A0=A0 NULL=A0=A0=A0 NULL=A0=A0=A0 __HIVE_DEFAULT_PARTITION= __
Vendor_1 Supplier_123 2012-03-07 3520=A0=A0 NULL=A0=A0=A0 NULL=A0=A0=A0 __H= IVE_DEFAULT_PARTITION__
Vendor_1 Supplier_124 2012-03-07 9124=A0=A0 NULL= =A0=A0=A0 NULL=A0=A0=A0 __HIVE_DEFAULT_PARTITION__

please tell me th= at the above output is correct or not and why the 2 columns are null and th= ere is a column with __HIVE_DEFAULT_PARTITION__

And if i select the partition table then the time taken to retrieve the= data should be less ,when compare to before partition right that not happe= ning for me.

Time taken for 100GB of data is : 2192.416 seconds

3.If i select the partition table order_date im not getting the data.
select * from vender_part where order_date=3D'2012-03-07';
hive> select * from vender_part=A0 where order_date=3D'2012-03-= 07';
OK
Time taken: 2.801 seconds

Please reply back to my above questi= ons and help me out in going further with the clear output who it will come= when we do the hive table partitioning
=A0And why im not getting the d= ata for the partitoned table if i select the order_date.


Thanks in advance

shaik.

--bcaec54ee8cc8c4c1f04c35b3dab--