Return-Path: X-Original-To: apmail-hive-dev-archive@www.apache.org Delivered-To: apmail-hive-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 52C7A9021 for ; Fri, 1 Jun 2012 14:50:13 +0000 (UTC) Received: (qmail 35485 invoked by uid 500); 1 Jun 2012 14:50:11 -0000 Delivered-To: apmail-hive-dev-archive@hive.apache.org Received: (qmail 35376 invoked by uid 500); 1 Jun 2012 14:50:11 -0000 Mailing-List: contact dev-help@hive.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@hive.apache.org Delivered-To: mailing list dev@hive.apache.org Received: (qmail 35359 invoked by uid 99); 1 Jun 2012 14:50:11 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 01 Jun 2012 14:50:11 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=5.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of prvs=mgrover=492d6cf5a@oanda.com designates 98.158.95.75 as permitted sender) Received: from [98.158.95.75] (HELO ironport-01.sms.scalar.ca) (98.158.95.75) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 01 Jun 2012 14:50:02 +0000 Received: from unknown (HELO sms-zimbra-mta-01.sms.scalar.ca) ([192.168.32.56]) by ironport-01.sms.scalar.ca with ESMTP; 01 Jun 2012 10:49:40 -0400 Received: from localhost (localhost.localdomain [127.0.0.1]) by sms-zimbra-mta-01.sms.scalar.ca (Postfix) with ESMTP id 7CCD417F5C; Fri, 1 Jun 2012 10:49:40 -0400 (EDT) Received: from sms-zimbra-mta-01.sms.scalar.ca ([127.0.0.1]) by localhost (sms-zimbra-mta-01.sms.scalar.ca [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id l0E8eWKFioTQ; Fri, 1 Jun 2012 10:49:40 -0400 (EDT) Received: from sms-zimbra-message-store-03.sms.scalar.ca (unknown [172.17.19.202]) by sms-zimbra-mta-01.sms.scalar.ca (Postfix) with ESMTP id 1257F17F4A; Fri, 1 Jun 2012 10:49:40 -0400 (EDT) Date: Fri, 1 Jun 2012 10:49:40 -0400 (EDT) From: Mark Grover To: user@hive.apache.org Cc: dev@hive.apache.org Message-ID: <947836419.398853.1338562179993.JavaMail.root@sms-zimbra-message-store-03.sms.scalar.ca> In-Reply-To: Subject: Re: Behavior of Hive 2837: insert into external tables should not be allowed MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-Originating-IP: [216.235.10.210] X-Mailer: Zimbra 7.1.2_GA_3268 (ZimbraWebClient - SAF3 (Linux)/7.1.2_GA_3268) Thanks, Ashutosh and Ed. Historically, I didn't have much reason choose managed over external tables or vice-versa since the semantics were very similar. I chose external because it allowed me a better handle on the table metadata. For example, if a new column got added to the file, I could just drop the external table and recreate with the new schema. With managed, I could do the same using ALTER TABLE commands but at that point, not all metadata for the table could be modified using ALTER TABLE commands so I decided to go with external tables. I think a lot of people use external tables on HDFS in preference to managed tables. I did see the property hive.insert.into.external.tables but it's a all-or-none switch. If I had an HBase external table and a HDFS external table, it might very well be the case that I want to be able to insert into the HDFS backed external but not the HBase table. So, to me disallowing insert into all the external tables doesn't seem like the right thing to do. Like Ed suggested, it's dependent on the storage handler not on the table being external. I could go ahead and use table locking in that case, but that kinda defeats the purpose of this feature and property. Thoughts? Mark ----- Original Message ----- From: "Ashutosh Chauhan" To: dev@hive.apache.org Cc: user@hive.apache.org Sent: Friday, June 1, 2012 10:24:24 AM Subject: Re: Behavior of Hive 2837: insert into external tables should not be allowed Hi Mark, I understand your concern w.r.t backward compatibility. But as Ed pointed out there is a config variable and by default semantic is unchanged so you can continue to insert into your external table. I have a question though. Why are you creating all your tables as "external" tables ? Why not regular tables? Thanks, Ashutosh On Thu, May 31, 2012 at 9:35 PM, Mark Grover < grover.markgrover@gmail.com > wrote: Hi folks, I have a question regarding HIVE 2837( https://issues.apache.org/jira/browse/HIVE-2837 ) that deals with disallowing external table from using insert into queries. >From looking at the JIRA, it seems like it applies to external tables on HDFS as well. Technically, insert into should be ok for external tables on HDFS (and S3 as well). Seems like a storage file system level thing to specify whether insert into is applied and implement it. Historically, there hasn't been any real difference between creating an external table on HDFS vs creating a managed one. However, if we disallow insert into on external tables, that would mean that folks with external tables on HDFS wouldn't be able to make use of insert into functionality even though they should be able to. Do we want to allow insert into on HDFS tables regardless of whether they are external or not? Mark