Return-Path: Delivered-To: apmail-hive-user-archive@www.apache.org Received: (qmail 46704 invoked from network); 15 Dec 2010 23:51:28 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 15 Dec 2010 23:51:28 -0000 Received: (qmail 49066 invoked by uid 500); 15 Dec 2010 23:51:28 -0000 Delivered-To: apmail-hive-user-archive@hive.apache.org Received: (qmail 49045 invoked by uid 500); 15 Dec 2010 23:51:28 -0000 Mailing-List: contact user-help@hive.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hive.apache.org Delivered-To: mailing list user@hive.apache.org Received: (qmail 49030 invoked by uid 99); 15 Dec 2010 23:51:28 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 15 Dec 2010 23:51:28 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=10.0 tests=FREEMAIL_FROM,RCVD_IN_DNSWL_LOW,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of dnquark@gmail.com designates 209.85.161.54 as permitted sender) Received: from [209.85.161.54] (HELO mail-fx0-f54.google.com) (209.85.161.54) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 15 Dec 2010 23:51:20 +0000 Received: by fxm16 with SMTP id 16so2626143fxm.27 for ; Wed, 15 Dec 2010 15:51:00 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:received:in-reply-to :references:date:message-id:subject:from:to:content-type; bh=k2gg87/TyGmnemxm3WlvRdzep9HaNbDhTqFyX3MpHfo=; b=cIKb0ubaOLNr47SfHAV5dyM3bTr5lXfkJ+W1rBGyOLpKYIQkcTlE+a1oUa6TUtfQn+ kynH7OFSutWFa58tNKZqnq+dPDk/msFK784s5Pou7B4TNgeKlig1KV1NCZV8vBASypYw VZBYzVGUfmHBj8s28IHl88pcZwo3i1QZoZ5dI= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; b=NVAIN4RZavzu+hV3wz9S6wQ9/24y5bDVGP3L2OKNgdvmpYeirxyzko439DkXvYRaYo RhdAuWSZTg+4jvWdtWv38KEgqll2WnXH7OqnrsAILNYNKbFJbp0kaOGetHOQBssTCSuj xf76Ahw187EVjXiNfbzTUuWwTlSNy4fy9abgo= MIME-Version: 1.0 Received: by 10.223.95.197 with SMTP id e5mr4533191fan.104.1292457060325; Wed, 15 Dec 2010 15:51:00 -0800 (PST) Received: by 10.223.106.20 with HTTP; Wed, 15 Dec 2010 15:51:00 -0800 (PST) In-Reply-To: <4D09504E.3080809@gmail.com> References: <4D09504E.3080809@gmail.com> Date: Wed, 15 Dec 2010 15:51:00 -0800 Message-ID: Subject: Re: Tables and importing From: Leo Alekseyev To: user@hive.apache.org Content-Type: text/plain; charset=ISO-8859-1 X-Virus-Checked: Checked by ClamAV on apache.org You can use CREATE EXTERNAL TABLE... LOCATION. See http://wiki.apache.org/hadoop/Hive/LanguageManual/DDL and examples thereof. When you LOAD DATA INPATH, the directory gets moved to the Hive warehouse dir; it does not get modified. On Wed, Dec 15, 2010 at 3:33 PM, Mark wrote: > Can someone explain what actually happens when you create a table and import > data into using "LOAD DATA INPATH..." > > I noticed that when I load the data from files already existing in HDFS that > it actually removes the original file from its location and moves it under > the /user/hive directory. Is there anyway I can prevent this from happening > or is this just the way things work? At this point is the file modified in > anyway? I have some other Hadoop jobs that rely on this data. Should I just > update those jobs to operate on the data within these directories? Thanks >