Return-Path: X-Original-To: apmail-hive-user-archive@www.apache.org Delivered-To: apmail-hive-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 9613B106E0 for ; Fri, 7 Mar 2014 19:47:25 +0000 (UTC) Received: (qmail 99579 invoked by uid 500); 7 Mar 2014 19:47:23 -0000 Delivered-To: apmail-hive-user-archive@hive.apache.org Received: (qmail 98822 invoked by uid 500); 7 Mar 2014 19:47:19 -0000 Mailing-List: contact user-help@hive.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hive.apache.org Delivered-To: mailing list user@hive.apache.org Received: (qmail 98798 invoked by uid 99); 7 Mar 2014 19:47:16 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 07 Mar 2014 19:47:16 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=5.0 tests=RCVD_IN_DNSWL_NONE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of kwiley@keithwiley.com designates 64.5.38.13 as permitted sender) Received: from [64.5.38.13] (HELO gateway09.websitewelcome.com) (64.5.38.13) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 07 Mar 2014 19:47:10 +0000 Received: by gateway09.websitewelcome.com (Postfix, from userid 507) id A83D2B53248C2; Fri, 7 Mar 2014 13:46:49 -0600 (CST) Received: from gator3023.hostgator.com (gator3023.hostgator.com [50.87.144.34]) by gateway09.websitewelcome.com (Postfix) with ESMTP id 7FA96B5324847 for ; Fri, 7 Mar 2014 13:46:49 -0600 (CST) Received: from [24.19.6.8] (port=48040 helo=[192.168.10.2]) by gator3023.hostgator.com with esmtpsa (TLSv1:AES128-SHA:128) (Exim 4.82) (envelope-from ) id 1WM0jJ-0005JJ-1u for user@hive.apache.org; Fri, 07 Mar 2014 13:46:49 -0600 From: Keith Wiley Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: quoted-printable Subject: bucketed table problems Date: Fri, 7 Mar 2014 11:46:47 -0800 Message-Id: To: user@hive.apache.org Mime-Version: 1.0 (Apple Message framework v1085) X-Mailer: Apple Mail (2.1085) X-AntiAbuse: This header was added to track abuse, please include it with any abuse report X-AntiAbuse: Primary Hostname - gator3023.hostgator.com X-AntiAbuse: Original Domain - hive.apache.org X-AntiAbuse: Originator/Caller UID/GID - [47 12] / [47 12] X-AntiAbuse: Sender Address Domain - keithwiley.com X-BWhitelist: no X-Source-IP: 24.19.6.8 X-Source: X-Source-Args: X-Source-Dir: X-Source-Sender: ([192.168.10.2]) [24.19.6.8]:48040 X-Source-Auth: kwiley+keithwiley.com X-Email-Count: 4 X-Source-Cap: a2J3aWxleTtrYndpbGV5O2dhdG9yMzAyMy5ob3N0Z2F0b3IuY29t X-Virus-Checked: Checked by ClamAV on apache.org I want to convert a table to a bucketed table, so I made a new table = with the same schema as the old table and specified a cluster column: create table foo_bucketed ( a string, b int, c float ) clustered by (b) into 10 buckets; Then I populate it from my original table: set hive.enforce.bucketing =3D true; insert overwrite table foo_bucketed select * from foo; All of the data goes into the first bucket, leaving the remaining 9 = buckets empty (in the file system, the remaining 9 files are 0 size). = Furthermore, the cluster column is now NULL. Its values have been = completely erased by the insertion (which might explain how they all = ended up in a single bucket of course). = __________________________________________________________________________= ______ Keith Wiley kwiley@keithwiley.com keithwiley.com = music.keithwiley.com "Yet mark his perfect self-contentment, and hence learn his lesson, that = to be self-contented is to be vile and ignorant, and that to aspire is better = than to be blindly and impotently happy." -- Edwin A. Abbott, Flatland = __________________________________________________________________________= ______