Return-Path: Delivered-To: apmail-hive-user-archive@www.apache.org Received: (qmail 14845 invoked from network); 15 Dec 2010 04:36:57 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 15 Dec 2010 04:36:57 -0000 Received: (qmail 86185 invoked by uid 500); 15 Dec 2010 04:36:57 -0000 Delivered-To: apmail-hive-user-archive@hive.apache.org Received: (qmail 85848 invoked by uid 500); 15 Dec 2010 04:36:55 -0000 Mailing-List: contact user-help@hive.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hive.apache.org Delivered-To: mailing list user@hive.apache.org Received: (qmail 85840 invoked by uid 99); 15 Dec 2010 04:36:54 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 15 Dec 2010 04:36:54 +0000 X-ASF-Spam-Status: No, hits=1.5 required=10.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS,T_FILL_THIS_FORM_SHORT X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of btalbot@aeriagames.com designates 209.85.216.48 as permitted sender) Received: from [209.85.216.48] (HELO mail-qw0-f48.google.com) (209.85.216.48) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 15 Dec 2010 04:36:49 +0000 Received: by qwh6 with SMTP id 6so1543427qwh.35 for ; Tue, 14 Dec 2010 20:36:28 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=aeriagames.com; s=google; h=domainkey-signature:mime-version:received:received:in-reply-to :references:date:message-id:subject:from:to:content-type; bh=ktlJ+38+fxBnz/ORknyR8Bux/PMfLGQVZfNXaynJnFA=; b=dcX+BMF6oEw4MC3IRFvy5YS+j/OAPO1l+bjXlu+KZ4cEgQRuyKXEq5p9pO8EsDYLc4 gQ8fDqIBiq/Q0FIeAF3JHrJqV/PI/A2Amybfkxcdmm0fEFSUwLjcR4jqq6XpzZZiIzHo xBjUjfAkKKyyDAIbUIJxj/liVSEQbNXwC0Dzk= DomainKey-Signature: a=rsa-sha1; c=nofws; d=aeriagames.com; s=google; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; b=odfGxLDD1Fi/u/5nAp71Jx6UQODR98D+K3o8LyL4MOfpUchxH6kBQj/ozzwhMevE28 GDEp/gsaZ1aTMKRsvUL8QJYjbwte3MoTzvPUZ3JyYcd4/0imz5wrk8kEmIsR4yjZlaEz JydOJNYE3zY7FUQIKeMVY7fzxPgmIKVxpTVpM= MIME-Version: 1.0 Received: by 10.229.98.206 with SMTP id r14mr361645qcn.145.1292387788393; Tue, 14 Dec 2010 20:36:28 -0800 (PST) Received: by 10.229.91.134 with HTTP; Tue, 14 Dec 2010 20:36:28 -0800 (PST) In-Reply-To: <39BE3894-108F-4C44-AAFE-48161D3C632F@gmail.com> References: <39BE3894-108F-4C44-AAFE-48161D3C632F@gmail.com> Date: Tue, 14 Dec 2010 20:36:28 -0800 Message-ID: Subject: Re: hive newbie - importing data into hive From: Bryan Talbot To: user@hive.apache.org Content-Type: multipart/alternative; boundary=0016367f96780a0e9304976b7bc8 --0016367f96780a0e9304976b7bc8 Content-Type: text/plain; charset=ISO-8859-1 I'll help by recommending that you get started by looking at the "Getting Started Guide". http://wiki.apache.org/hadoop/Hive/GettingStarted -Bryan On Tue, Dec 14, 2010 at 8:23 PM, Sean Curtis wrote: > just wondering if i have a pipe delimited file, how i can just import this > data into hive: > > basically i am using the microlens database, which is pipe separated. for > example: > > user id | age | gender | occupation | zip code > > translates to > > 123 | 24 | M | worker | 12345 > > > i'd like to just import this straight into Hive. my initial thoughts: > 1. use unix substitute command and change all "|" to "Ctrl-A". > 2. import into mysql, then use sqoop > > seems it should be easier than this. can someone help? > > thanks for the help. > > sean --0016367f96780a0e9304976b7bc8 Content-Type: text/html; charset=ISO-8859-1
I'll help by recommending that you get started by looking at the "Getting Started Guide".



-Bryan


On Tue, Dec 14, 2010 at 8:23 PM, Sean Curtis <sean.curtis@gmail.com> wrote:
just wondering if i have a pipe delimited file, how i can just import this data into hive:

basically i am using the microlens database, which is pipe separated. for example:

user id | age | gender | occupation | zip code

translates to

123 | 24 | M | worker | 12345


i'd like to just import this straight into Hive. my initial thoughts:
1. use unix substitute command and change all "|" to "Ctrl-A".
2. import into mysql, then use sqoop

seems it should be easier than this. can someone help?

thanks for the help.

sean

--0016367f96780a0e9304976b7bc8--