Return-Path: X-Original-To: apmail-hive-user-archive@www.apache.org Delivered-To: apmail-hive-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 5C7EE9581 for ; Mon, 5 Mar 2012 02:54:13 +0000 (UTC) Received: (qmail 90828 invoked by uid 500); 5 Mar 2012 02:54:12 -0000 Delivered-To: apmail-hive-user-archive@hive.apache.org Received: (qmail 90787 invoked by uid 500); 5 Mar 2012 02:54:12 -0000 Mailing-List: contact user-help@hive.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hive.apache.org Delivered-To: mailing list user@hive.apache.org Received: (qmail 90765 invoked by uid 99); 5 Mar 2012 02:54:11 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 05 Mar 2012 02:54:11 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of edlinuxguru@gmail.com designates 209.85.210.44 as permitted sender) Received: from [209.85.210.44] (HELO mail-pz0-f44.google.com) (209.85.210.44) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 05 Mar 2012 02:54:06 +0000 Received: by dakl33 with SMTP id l33so4861448dak.31 for ; Sun, 04 Mar 2012 18:53:45 -0800 (PST) Received-SPF: pass (google.com: domain of edlinuxguru@gmail.com designates 10.68.125.195 as permitted sender) client-ip=10.68.125.195; Authentication-Results: mr.google.com; spf=pass (google.com: domain of edlinuxguru@gmail.com designates 10.68.125.195 as permitted sender) smtp.mail=edlinuxguru@gmail.com; dkim=pass header.i=edlinuxguru@gmail.com Received: from mr.google.com ([10.68.125.195]) by 10.68.125.195 with SMTP id ms3mr43403445pbb.62.1330916025886 (num_hops = 1); Sun, 04 Mar 2012 18:53:45 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=NH6YTvXbDx6Fx3urlUJbm++gvp6ayzXbQaBwYX5DTyU=; b=TYOmhXOIAovX4Y9ZAi5Lu5JJ97+bc97qVWuJi/nXJiRQdV11M43ObuCpR9nF5Kv/Os +CPPuPeZgFPuuAmIZc/7E5KfOWoT2Pzc1lAeJGNGWDaPanNo+Gn3I8xTyIf0Cy8niwv7 0P7fCJKSGRNBMj9kv34w7DpKChUC6txH+X+MlKeDObyP6JORKB9IwM+HVWdufY356CEv GKoJ6aWwD2CMGWSKovMB5vf+gNiNWmEOjZ8tE1T/56wbFKHgqisHU5KMm0BFE+NTLQxU C/s3DOz/mEFvWDyLF+Ro98kEMLWTkHW/tVoO2WhHGW+yDi7xsbseRrZaOktNOaCeNfl0 LrbQ== MIME-Version: 1.0 Received: by 10.68.125.195 with SMTP id ms3mr37227014pbb.62.1330916025714; Sun, 04 Mar 2012 18:53:45 -0800 (PST) Received: by 10.142.4.32 with HTTP; Sun, 4 Mar 2012 18:53:45 -0800 (PST) In-Reply-To: <489411E1EA33684CB6E6E8A81970EA8232D7C4@PROD-EXCH-M3.corp.microstrategy.com> References: <489411E1EA33684CB6E6E8A81970EA8232D7C4@PROD-EXCH-M3.corp.microstrategy.com> Date: Sun, 4 Mar 2012 21:53:45 -0500 Message-ID: Subject: Re: load zip file to hive table From: Edward Capriolo To: user@hive.apache.org Content-Type: text/plain; charset=ISO-8859-1 X-Virus-Checked: Checked by ClamAV on apache.org If the file ends in .bz2 .gz or .deflate there is nothing special you need to. TextInputFormat (the default) will automatically unzip and read these. However these types are not split-table so if the file is large it can not be processed in parallel. On Sun, Mar 4, 2012 at 9:26 PM, Lu, Wei wrote: > Hi, > > > > I need to load data directly from a ctl A delimiter zipped file from the > Linux box directly. > > Do I need to 1) un-zip the files and then load them to Hive tables, or 2) is > there a direct command that can load zipped data to Hive table directly? > > > > Thanks, > > Wei