Return-Path: Delivered-To: apmail-hadoop-hive-user-archive@minotaur.apache.org Received: (qmail 13080 invoked from network); 7 Jul 2009 19:08:50 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 7 Jul 2009 19:08:50 -0000 Received: (qmail 30790 invoked by uid 500); 7 Jul 2009 19:07:48 -0000 Delivered-To: apmail-hadoop-hive-user-archive@hadoop.apache.org Received: (qmail 30486 invoked by uid 500); 7 Jul 2009 19:07:47 -0000 Mailing-List: contact hive-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hive-user@hadoop.apache.org Delivered-To: mailing list hive-user@hadoop.apache.org Received: (qmail 29848 invoked by uid 99); 7 Jul 2009 19:02:05 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 07 Jul 2009 19:02:05 +0000 X-ASF-Spam-Status: No, hits=4.2 required=10.0 tests=HTML_MESSAGE,NO_RDNS_DOTCOM_HELO,SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (athena.apache.org: local policy) Received: from [216.145.54.171] (HELO mrout1.yahoo.com) (216.145.54.171) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 07 Jul 2009 19:01:53 +0000 Received: from SP1-EX07CAS02.ds.corp.yahoo.com (sp1-ex07cas02.ds.corp.yahoo.com [216.252.116.138]) by mrout1.yahoo.com (8.13.6/8.13.6/y.out) with ESMTP id n67IwDCe022687 for ; Tue, 7 Jul 2009 12:01:02 -0700 (PDT) DomainKey-Signature: a=rsa-sha1; s=serpent; d=yahoo-inc.com; c=nofws; q=dns; h=received:from:to:date:subject:thread-topic:thread-index: message-id:accept-language:content-language:x-ms-has-attach: x-ms-tnef-correlator:acceptlanguage:content-type:mime-version; b=jRqjw2MlNv6Bvo6k4iKkUXmA52rt4KTnkcoU8U8QECXxGqHCwy0cLwh0pa0WJpsm Received: from SP1-EX07VS01.ds.corp.yahoo.com ([216.252.116.139]) by SP1-EX07CAS02.ds.corp.yahoo.com ([216.252.116.167]) with mapi; Tue, 7 Jul 2009 12:00:14 -0700 From: Rakesh Setty To: "hive-user@hadoop.apache.org" Date: Tue, 7 Jul 2009 12:00:12 -0700 Subject: RE: Issue with nested types Thread-Topic: Issue with nested types Thread-Index: Acn/MfZxax/p9WyxSGumlpFO+0l+sQAAsunQ Message-ID: <929E373E02121B49BCCC534E03B137A253330E7FC7@SP1-EX07VS01.ds.corp.yahoo.com> Accept-Language: en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: acceptlanguage: en-US Content-Type: multipart/alternative; boundary="_000_929E373E02121B49BCCC534E03B137A253330E7FC7SP1EX07VS01ds_" MIME-Version: 1.0 X-Virus-Checked: Checked by ClamAV on apache.org --_000_929E373E02121B49BCCC534E03B137A253330E7FC7SP1EX07VS01ds_ Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable I think this solution will not deal with maps within maps and lists within = lists. Thanks, Rakesh ________________________________ From: Rakesh Setty Sent: Tuesday, July 07, 2009 11:37 AM To: 'hive-user@hadoop.apache.org' Subject: Issue with nested types Hi, The issue of nested types addressed recently through JIRA HIVE-= 603 is very useful. But I have an issue with the schema specification. I have a table page_views with two columns - page_info is a map= with key delimiter as Ctrl-D and the key-value pair (record) delimiter as = Ctrl-C and page_links is a list of maps with each list item separated using= Ctrl-B, map delimiters being Ctrl-D and Ctrl-C as mentioned above. In the DDL statement, if I do not specify "collection items ter= minated by" and "array items terminated by" clauses, page_links is deserial= ized properly, but page_info is not deserialized properly. If I specify the= clauses - collection items terminated by '\003' and map keys terminated by= '\004', page_info is deserialized properly but page_links is not deseriali= zed properly. The reason I think is that in page_links it considers '\003' = or Ctrl-C as delimiter for both array and map record. But I have Ctrl-B as = array delimiter and Ctrl-D as map record delimiter. I think we should replace the clause "collection items terminat= ed by" with separate clauses like "list items terminated by" and "map items= terminated by". Thanks, Rakesh --_000_929E373E02121B49BCCC534E03B137A253330E7FC7SP1EX07VS01ds_ Content-Type: text/html; charset="us-ascii" Content-Transfer-Encoding: quoted-printable

I think this solution will not deal wi= th maps within maps and lists within lists.

 

Thanks,

Rakesh

 


From: Rakesh S= etty
Sent: Tuesday, July 07, 2009= 11:37 AM
To: 'hive-user@hadoop.apache= .org'
Subject: Issue with nested t= ypes

 

Hi,

 

         &n= bsp;  The issue of nested types addressed recently through JIRA HIVE-603 is very useful. But I have an issue with the schema specification.

         &n= bsp;  I have a table page_views with two columns – page_info is a map with = key delimiter as Ctrl-D and the key-value pair (record) delimiter as Ctrl-C and page_link= s is a list of maps with each list item separated using Ctrl-B, map delimiters b= eing Ctrl-D and Ctrl-C as mentioned above.

         &n= bsp;  In the DDL statement, if I do not specify “collection items terminate= d by” and “array items terminated by” clauses, page_links i= s deserialized properly, but page_info is not deserialized properly. If I spe= cify the clauses - collection items terminated by ‘\003’ and map key= s terminated by ‘\004’, page_info is deserialized properly but page_links is not deserialized properly. The reason I think is that in page_links it considers ‘\003’ or Ctrl-C as delimiter for both array and map record. But I have Ctrl-B as array delimiter and Ctrl-D as ma= p record delimiter.

         &n= bsp;  I think we should replace the clause “collection items terminated by” with separate clauses like “list items terminated by”= and “map items terminated by”.

 

Thanks,

Rakesh

--_000_929E373E02121B49BCCC534E03B137A253330E7FC7SP1EX07VS01ds_--