Return-Path: X-Original-To: apmail-hive-user-archive@www.apache.org Delivered-To: apmail-hive-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 3AE5A10421 for ; Fri, 20 Sep 2013 19:48:28 +0000 (UTC) Received: (qmail 83647 invoked by uid 500); 20 Sep 2013 19:48:22 -0000 Delivered-To: apmail-hive-user-archive@hive.apache.org Received: (qmail 83395 invoked by uid 500); 20 Sep 2013 19:48:20 -0000 Mailing-List: contact user-help@hive.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hive.apache.org Delivered-To: mailing list user@hive.apache.org Received: (qmail 83372 invoked by uid 99); 20 Sep 2013 19:48:19 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 20 Sep 2013 19:48:19 +0000 X-ASF-Spam-Status: No, hits=2.2 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_NONE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: local policy) Received: from [98.138.229.31] (HELO nm38.bullet.mail.ne1.yahoo.com) (98.138.229.31) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 20 Sep 2013 19:48:11 +0000 Received: from [127.0.0.1] by nm38.bullet.mail.ne1.yahoo.com with NNFMP; 20 Sep 2013 19:47:48 -0000 Received: from [98.138.90.51] by nm38.bullet.mail.ne1.yahoo.com with NNFMP; 20 Sep 2013 19:44:51 -0000 Received: from [66.196.81.170] by tm4.bullet.mail.ne1.yahoo.com with NNFMP; 20 Sep 2013 19:44:51 -0000 Received: from [98.139.212.208] by tm16.bullet.mail.bf1.yahoo.com with NNFMP; 20 Sep 2013 19:44:50 -0000 Received: from [127.0.0.1] by omp1017.mail.bf1.yahoo.com with NNFMP; 20 Sep 2013 19:44:50 -0000 X-Yahoo-Newman-Property: ymail-4 X-Yahoo-Newman-Id: 928542.47254.bm@omp1017.mail.bf1.yahoo.com Received: (qmail 88271 invoked by uid 60001); 20 Sep 2013 19:44:50 -0000 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=yahoo.com; s=s1024; t=1379706290; bh=7PR1/yFsLFpD4vRugOrSejSjhDR2gvPSdiErwqxQGBc=; h=X-YMail-OSG:Received:X-Rocket-MIMEInfo:X-Mailer:References:Message-ID:Date:From:Reply-To:Subject:To:In-Reply-To:MIME-Version:Content-Type; b=i60QjCVhh5YBzZEtpi5rZGIREToTzcd5smKtz7+k043j0O3A5AfJtlcPlNsocuzkHOwDsLUB9r8k6byMMaWbs3tgGYq+Ub6JSof/zoXh6LV01JKhRh7L8K87yaNEXX0VdLI35fUTZOERshGxr/yyqhxBlKf0JJSTagOcxYZvsnk= DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=s1024; d=yahoo.com; h=X-YMail-OSG:Received:X-Rocket-MIMEInfo:X-Mailer:References:Message-ID:Date:From:Reply-To:Subject:To:In-Reply-To:MIME-Version:Content-Type; b=depwrA2A8BXAp3zwvl+yZ6H5dQeCKHYRqOGtZH27zKL7mrslNuLCHQQzTmsIK/j4o/xWpKaXt1XfYAA8py/UX1XLOR/WUcs71OAZwJOWYEc1khzRbUhOXIN5yEn1CFb+EPXOrWFYjlkdJ47jvqiHmNtkkNzXdX+43QFyWU2emyw=; X-YMail-OSG: 2mMhsrkVM1k3UANkc9dMtlecS.OepYHmpGqYDRkJxxKPcd4 ryg6wt3WotgcPTTsb5w2ZP0Tksfe2QNG4fkRWd0gvbkiQxN0MOcUc3xZoerP 7OH4V.9GkFsSNyXYg.Y5ThO5FoEkwnD6NtmSrV88gzZdDztSDSduP_aEa2Jp 96F5X5sfZR.KxbONFNXLxah_q_zprAv49uhsoWC5mIArxlKCp_2fHVrTbs7F 3YrJazwGFUIbHFtxfvQvFujPjr_r5EWMmEF6A1u41faXIJUlA.SPA5Noy8fZ _bofVrYwCpQXN4nYJAkxbSblZs3B8.ko06gLJ_jP5QojHXkt2Vl5fHORu2Yp pFUJM788fH93pk21efZN7KVBzOvoROj4Gmf3Xt7FjDCavlJZTw93nHPpeVAU e5nBsCnPKFkRnkQ_q2vgebm7OQEJ5X2rV.0dXm6M_3OIzTxDPNl1N1yQQJKX U5qmLRu78WLgsOgWt0gv0D59rRKFeiYQHnf0RAi4d44idNY2eSvFkiwukIoa UYzC__ADAPQYFGEzh4kmmuIYYYqPwthqU9jLDVxCkh_HEB7DeVrw- Received: from [151.124.247.200] by web162206.mail.bf1.yahoo.com via HTTP; Fri, 20 Sep 2013 12:44:50 PDT X-Rocket-MIMEInfo: 002.001,SGkgTml0aW4sCsKgClRoYW5rcyBmb3IgdGhlIHJlcGx5LiBJIGhhdmUgYSBodWdlIGZpbGUgaW4gdW5peC4KwqAKQXMgcGVyIHRoZSBmaWxlIGRlZmluaXRpb24sIHRoZSBmaWxlIGlzIGEgdGFiIHNlcGFyYXRlZCBmaWxlIG9mIGZpZWxkcy4gQnV0IEkgYW0gc3VyZSB0aGF0IHdpdGhpbsKgc29tZSBmaWVsZCdzIEkgaGF2ZSBzb21lIG5ldyBsaW5lIGNoYXJhY3Rlci4gCsKgCkhvdyBzaG91bGQgSSBmaW5kIGEgcmVjb3JkPyBJdCBpcyBhIGh1Z2UgZmlsZS4gSXMgdGhlcmUgc29tZSBjb21tYW5kPwrCoApUaGEBMAEBAQE- X-Mailer: YahooMailWebService/0.8.157.561 References: <1379703865.46912.YahooMailNeo@web162201.mail.bf1.yahoo.com> <1379704127.62601.YahooMailNeo@web162205.mail.bf1.yahoo.com> Message-ID: <1379706290.87991.YahooMailNeo@web162206.mail.bf1.yahoo.com> Date: Fri, 20 Sep 2013 12:44:50 -0700 (PDT) From: Raj Hadoop Reply-To: Raj Hadoop Subject: Re: How to load /t /n file to Hive To: "user@hive.apache.org" In-Reply-To: MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="1120943518-1165846899-1379706290=:87991" X-Virus-Checked: Checked by ClamAV on apache.org --1120943518-1165846899-1379706290=:87991 Content-Type: text/plain; charset=iso-8859-1 Content-Transfer-Encoding: quoted-printable Hi Nitin,=0A=A0=0AThanks for the reply. I have a huge file in unix.=0A=A0= =0AAs per the file definition, the file is a tab separated file of fields. = But I am sure that within=A0some field's I have some new line character. = =0A=A0=0AHow should I find a record? It is a huge file. Is there some comma= nd?=0A=A0=0AThanks,=0A=A0=0A=0A=0A________________________________=0AFrom: = Nitin Pawar =0ATo: "user@hive.apache.org" ; Raj Hadoop =0ASent: Friday, September= 20, 2013 3:15 PM=0ASubject: Re: How to load /t /n file to Hive=0A=0A=0A=0A= If your data contains new line chars, its better you write a custom map red= uce job and convert the data into a single line removing all unwanted chars= in column separator as well just having single new line char per line=A0= =0A=0A=0A=0AOn Sat, Sep 21, 2013 at 12:38 AM, Raj Hadoop wrote:=0A=0APlease note that there is an escape chacter in the fields w= here the /t and /n are present.=0A>=0A>=0A>=0A>From: Raj Hadoop =0A>To: Hive =0A>Sent: Friday, September 2= 0, 2013 3:04 PM=0A>Subject: How to load /t /n file to Hive=0A>=0A>=0A>=0A>H= i,=0A>=0A>I have a file which is delimted by a tab. Also, there are some fi= elds in the file which has a tab /t character and a new line /n character i= n=A0some fields.=0A>=0A>Is there any way to load this file using Hive load = command? Or do i have to use a Custom Map Reduce (custom) Input format with= java ? Please advise.=0A>=0A>Thanks,=0A>Raj=0A>=0A>=0A=0A=0A-- =0ANitin Pa= war --1120943518-1165846899-1379706290=:87991 Content-Type: text/html; charset=iso-8859-1 Content-Transfer-Encoding: quoted-printable
Hi Nitin,
 
Thanks for the reply= . I have a huge file in unix.
 
As per the file definition, t= he file is a tab separated file of fields. But I am sure that within s= ome field's I have some new line character.
 
How should I find a record? I= t is a huge file. Is there some command?
 
Thanks,
 

From: Nitin Pawar <nitinpawar432@gmail= .com>
To: "user@hive.= apache.org" <user@hive.apache.org>; Raj Hadoop <hadoopraj@yahoo.co= m>
Sent: Friday, Sep= tember 20, 2013 3:15 PM
Subject: Re: How to load /t /n file to Hive

If your data contains new line chars, = its better you write a custom map reduce job and convert the data into a si= ngle line removing all unwanted chars in column separator as well just havi= ng single new line char per line 


On Sat, Sep 21, 2013 at 12:38 AM, Raj= Hadoop <hadoopraj@yah= oo.com> wrote:
Please note that there is an escape chacter in the fields where = the /t and /n are present.

From: Raj Hadoop <hadoopraj@yahoo.com>
To: Hive <user@hive.apache.org>
Sent: Friday, September 20, 2013 3:04 PM
Subject: How to load /t /n file to Hive
<= /FONT>

Hi,
 
I have a file which is delimted by a tab. Also, there are some fields = in the file which has a tab /t character and a new line /n character in&nbs= p;some fields.
 
Is there any way to load this file using Hive load command? Or do i ha= ve to use a Custom Map Reduce (custom) Input format with java ? Please advi= se.
 
Thanks,
Raj


=



--
Nitin Pawar


=
--1120943518-1165846899-1379706290=:87991--