Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id C75D32009F2 for ; Thu, 5 May 2016 22:19:09 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id C5EE9160A04; Thu, 5 May 2016 20:19:09 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id E7A331609F9 for ; Thu, 5 May 2016 22:19:08 +0200 (CEST) Received: (qmail 93122 invoked by uid 500); 5 May 2016 20:19:07 -0000 Mailing-List: contact user-help@hive.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hive.apache.org Delivered-To: mailing list user@hive.apache.org Received: (qmail 93103 invoked by uid 99); 5 May 2016 20:19:07 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd2-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 05 May 2016 20:19:07 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd2-us-west.apache.org (ASF Mail Server at spamd2-us-west.apache.org) with ESMTP id 22B9C1A0630; Thu, 5 May 2016 20:19:07 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd2-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 2.629 X-Spam-Level: ** X-Spam-Status: No, score=2.629 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=2, KAM_INFOUSMEBIZ=0.75, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd2-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd2-us-west.apache.org [10.40.0.9]) (amavisd-new, port 10024) with ESMTP id 8ZZs7HYsBZ1i; Thu, 5 May 2016 20:19:05 +0000 (UTC) Received: from mail-yw0-f174.google.com (mail-yw0-f174.google.com [209.85.161.174]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTPS id CF2325FAEA; Thu, 5 May 2016 20:19:04 +0000 (UTC) Received: by mail-yw0-f174.google.com with SMTP id o66so161150427ywc.3; Thu, 05 May 2016 13:19:04 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:date:message-id:subject:from:to; bh=0m5ye9TtFKT/qFwGj6saBx/iOrTZYMTT7gA+Du9nXPk=; b=anOyhNYXa8quYqMM33THNfZidFyMQ77Mp++ojsAjXnFr762ERhdid1XpasdkGSMAWz 5DI5M015baRYMg/dEXZHy3xRjcdJ90SToanFIVgeDSdoF2zLASgRltleoqEDCzN5aUXz 0c8yWeQrzb8kFBc4Evb4B5PjGCOC865LXQ3cvUFU19l5b3Sg62LRledoRRcqoDiW2KXC 6VL9fG06tutQoPwbq1Lo2z4tC/Xkjy6KmBu/mAbu0IjuSOc6WtUzSryFnPDFS/h5Jylh TT0cmrkRkkILYAXsecK9JD68ocjm03tbdynwlvZt/RLICX9zH01+H7xrDzTSpj979ek7 v6aQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:mime-version:date:message-id:subject:from:to; bh=0m5ye9TtFKT/qFwGj6saBx/iOrTZYMTT7gA+Du9nXPk=; b=G3QHS+rIn9nu5B9YDXDaSO60A2eH9f8r1eiu7HlvUrTCY1bJbZPOQGSGshEWOK+6Tk FANNIj3MeJbNwFmbXI4vzIjrDTLUW7TL0G9VS0vmchLN7VxapeZnSfUeBPy6sp3KyWdU zpjPbpAPopg39lFSQV2A3EHXTaVLUKr1N0sMRbQR2lKmVF52Koaly+BYtVjXxR4moFdL +F8ZU4r3R0zWgYBXemNQKRqH6k2V9NBU8JJ5NIIhtKliAEtNlfjSNy8ybG9Wn8ayB6zi gGaYAp4c2E28NDlYVblwgQsbB9tGIpjuhbB1pBtvgth94nhpKgirdEyrczbwez4XSt5m dsxA== X-Gm-Message-State: AOPr4FUHGZmzVNzoszgJejjIPHlHXaipSvGLvGtzfnFmEnR2TiBlcGgfQ39+/Fg30zwoVi2QSduXDaGxzaQacg== MIME-Version: 1.0 X-Received: by 10.159.33.181 with SMTP id 50mr10047511uac.114.1462479543793; Thu, 05 May 2016 13:19:03 -0700 (PDT) Received: by 10.103.87.142 with HTTP; Thu, 5 May 2016 13:19:03 -0700 (PDT) Date: Thu, 5 May 2016 13:19:03 -0700 Message-ID: Subject: ListBucketing feature does not support uppercase string. From: Jim Green To: dev@hive.apache.org, user@hive.apache.org Content-Type: multipart/alternative; boundary=001a113ac280b304af05321e0df5 archived-at: Thu, 05 May 2016 20:19:10 -0000 --001a113ac280b304af05321e0df5 Content-Type: text/plain; charset=UTF-8 Hi Team, I found when there is uppercase string as the skew value, ListBucketing is not working. https://issues.apache.org/jira/browse/HIVE-13697 is filed: For example: 1. This is good: CREATE TABLE testskew (id INT, a STRING) SKEWED BY (a) ON ('abc', 'xyz') STORED AS DIRECTORIES; set hive.mapred.supports.subdirectories=true; set mapred.input.dir.recursive=true; INSERT OVERWRITE TABLE testskew SELECT 123,'abc' FROM dual union all SELECT 123,'xyz' FROM dual union all SELECT 123,'others' FROM dual; # hadoop fs -ls /user/hive/warehouse/testskew Found 3 items drwxrwxrwx - mapr mapr 1 2016-05-05 14:56 /user/hive/warehouse/testskew/HIVE_DEFAULT_LIST_BUCKETING_DIR_NAME drwxrwxrwx - mapr mapr 1 2016-05-05 14:56 /user/hive/warehouse/testskew/a=abc drwxrwxrwx - mapr mapr 1 2016-05-05 14:56 /user/hive/warehouse/testskew/a=xyz This is good, because both "abc" and "xyz" directories got created. 2. This is bad: Drop table testskew2; CREATE TABLE testskew2 (id INT, a STRING) SKEWED BY (a) ON ('aus', 'US') STORED AS DIRECTORIES; set hive.mapred.supports.subdirectories=true; set mapred.input.dir.recursive=true; INSERT OVERWRITE TABLE testskew2 SELECT 123, 'aus' FROM dual union all SELECT 123, 'US' FROM dual union all SELECT 123, 'others' FROM dual; # hadoop fs -ls /user/hive/warehouse/testskew2 Found 2 items drwxrwxrwx - mapr mapr 1 2016-05-05 15:11 /user/hive/warehouse/testskew2/HIVE_DEFAULT_LIST_BUCKETING_DIR_NAME drwxrwxrwx - mapr mapr 1 2016-05-05 15:11 /user/hive/warehouse/testskew2/a=aus You can see, only "aus" directory got created... -- Thanks, www.openkb.info (Open KnowledgeBase for Hadoop/Database/OS/Network/Tool) --001a113ac280b304af05321e0df5 Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable
Hi Team,

I found when there is uppercas= e string as the skew value,=C2=A0ListBucketing is not working.
https://issues.a= pache.org/jira/browse/HIVE-13697 is filed:

For example:
1. This is good:
CREATE TABLE testskew (id INT, a STRING)
SKEWED BY (a) ON ('abc', 'xyz') STORED AS DIRECTORIES;

set hive.mapred.supports.subdirectories=3Dtrue;
set mapred.input.dir.recursive=3Dtrue;

 INSERT OVERWRITE TABLE testskew=20
 SELECT 123,'abc' FROM dual
 union all
 SELECT 123,'xyz' FROM dual
 union all
 SELECT 123,'others' FROM dual;

# hadoop fs -ls /user/hive/warehouse/testskew
Found 3 items
drwxrwxrwx   - mapr mapr          1 2016-05-05 14:56
/user/hive/warehouse/testskew/HIVE_DEFAULT_LIST_BUCKETING_DIR_NAME
drwxrwxrwx   - mapr mapr          1 2016-05-05 14:56
/user/hive/warehouse/testskew/a=3Dabc
drwxrwxrwx   - mapr mapr          1 2016-05-05 14:56
/user/hive/warehouse/testskew/a=3Dxyz

This is good, because both "abc" and "xyz" directories =
got created.

2. This is bad:
Drop table testskew2;
CREATE TABLE testskew2 (id INT, a STRING)
SKEWED BY (a) ON ('aus', 'US') STORED AS DIRECTORIES;

set hive.mapred.supports.subdirectories=3Dtrue;
set mapred.input.dir.recursive=3Dtrue;

 INSERT OVERWRITE TABLE testskew2=20
 SELECT 123, 'aus' FROM dual
 union all
 SELECT 123, 'US' FROM dual
 union all
 SELECT 123, 'others' FROM dual;

# hadoop fs -ls /user/hive/warehouse/testskew2
Found 2 items
drwxrwxrwx   - mapr mapr          1 2016-05-05 15:11
/user/hive/warehouse/testskew2/HIVE_DEFAULT_LIST_BUCKETING_DIR_NAME
drwxrwxrwx   - mapr mapr          1 2016-05-05 15:11
/user/hive/warehouse/testskew2/a=3Daus

You can see, only "aus" directory got created...
=

--
T= hanks,
(Open KnowledgeBase for Hadoop/Database/OS/Network= /Tool)
--001a113ac280b304af05321e0df5--