Return-Path: Delivered-To: apmail-hadoop-hive-user-archive@locus.apache.org Received: (qmail 6363 invoked from network); 12 Dec 2008 03:57:18 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 12 Dec 2008 03:57:18 -0000 Received: (qmail 54576 invoked by uid 500); 12 Dec 2008 03:57:31 -0000 Delivered-To: apmail-hadoop-hive-user-archive@hadoop.apache.org Received: (qmail 54555 invoked by uid 500); 12 Dec 2008 03:57:31 -0000 Mailing-List: contact hive-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hive-user@hadoop.apache.org Delivered-To: mailing list hive-user@hadoop.apache.org Received: (qmail 54546 invoked by uid 99); 12 Dec 2008 03:57:31 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 11 Dec 2008 19:57:31 -0800 X-ASF-Spam-Status: No, hits=1.2 required=10.0 tests=SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (athena.apache.org: local policy) Received: from [209.85.200.174] (HELO wf-out-1314.google.com) (209.85.200.174) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 12 Dec 2008 03:57:10 +0000 Received: by wf-out-1314.google.com with SMTP id 24so1008174wfg.2 for ; Thu, 11 Dec 2008 19:56:48 -0800 (PST) Received: by 10.143.4.16 with SMTP id g16mr1140023wfi.124.1229054208559; Thu, 11 Dec 2008 19:56:48 -0800 (PST) Received: from ?192.168.1.102? (c-71-202-152-68.hsd1.ca.comcast.net [71.202.152.68]) by mx.google.com with ESMTPS id 9sm4019202wfc.36.2008.12.11.19.56.47 (version=TLSv1/SSLv3 cipher=RC4-MD5); Thu, 11 Dec 2008 19:56:47 -0800 (PST) Message-Id: <1F8E8156-E3D1-46DE-BA56-9FD70A61481A@besquared.net> From: Josh Ferguson To: hive-user@hadoop.apache.org In-Reply-To: <7D1AE356-4B25-4E68-9227-177594192CEF@besquared.net> Content-Type: text/plain; charset=US-ASCII; format=flowed; delsp=yes Content-Transfer-Encoding: 7bit Mime-Version: 1.0 (Apple Message framework v929.2) Subject: Re: Failed Query Date: Thu, 11 Dec 2008 19:57:04 -0800 References: <7D1AE356-4B25-4E68-9227-177594192CEF@besquared.net> X-Mailer: Apple Mail (2.929.2) X-Virus-Checked: Checked by ClamAV on apache.org Is there any word on what we should do with this? It's sort of blocking me from doing things. Anyone have any leads? Josh On Dec 10, 2008, at 8:32 PM, Josh Ferguson wrote: > Forgot about this one > > hive> describe extended aggregations; > OK > value int > account string > application string > dataset string > hour int > aggregation string > aggregated_by string > Detailed Table Information: > Table(tableName:aggregations,dbName:default,owner:Josh,createTime: > 1228463486,lastAccessTime:0,retention:0,sd:StorageDescriptor(cols: > [FieldSchema(name:value,type:int,comment:null)],location:/user/hive/ > warehouse/ > aggregations > ,inputFormat:org > .apache > .hadoop > .mapred > .TextInputFormat > ,outputFormat:org > .apache > .hadoop > .hive > .ql > .io > .IgnoreKeyTextOutputFormat > ,compressed:false > ,numBuckets > :-1 > ,serdeInfo:SerDeInfo > (name:null > ,serializationLib:org > .apache.hadoop.hive.serde2.dynamic_type.DynamicSerDe,parameters: > {colelction > .delim > = > 44 > ,mapkey > .delim > = > 58 > ,serialization > .format > = > org > .apache.hadoop.hive.serde2.thrift.TCTLSeparatedProtocol}),bucketCols: > [],sortCols:[],parameters:{}),partitionKeys: > [FieldSchema(name:account,type:string,comment:null), > FieldSchema(name:application,type:string,comment:null), > FieldSchema(name:dataset,type:string,comment:null), > FieldSchema(name:hour,type:int,comment:null), > FieldSchema(name:aggregation,type:string,comment:null), > FieldSchema(name:aggregated_by,type:string,comment:null)],parameters: > {}) > Time taken: 2.884 seconds > > On Dec 10, 2008, at 8:31 PM, Josh Ferguson wrote: > >> hive> SELECT COUNT(actor_id) AS value FROM activities WHERE ( >> > account='80c27664-b047-4c0a-86f3-342c0cdf36c7' AND >> application='myproduct' >> > AND dataset='purchase' AND hour=341165 ); >> >> This actually works fine, it's writing to the new table that is >> broken. >> >> hive> describe extended activities; >> OK >> actor_id string >> actee_id string >> properties map >> account string >> application string >> dataset string >> hour int >> Detailed Table Information: >> Table(tableName:activities,dbName:default,owner:Josh,createTime: >> 1228208598,lastAccessTime:0,retention:0,sd:StorageDescriptor(cols: >> [FieldSchema(name:actor_id,type:string,comment:null), >> FieldSchema(name:actee_id,type:string,comment:null), >> FieldSchema >> (name:properties,type:map,comment:null)],location:/ >> user/hive/warehouse/ >> activities >> ,inputFormat:org >> .apache >> .hadoop >> .mapred >> .TextInputFormat >> ,outputFormat:org >> .apache >> .hadoop >> .hive.ql.io.IgnoreKeyTextOutputFormat,compressed:false,numBuckets: >> 32 >> ,serdeInfo:SerDeInfo >> (name:null >> ,serializationLib:org >> .apache.hadoop.hive.serde2.dynamic_type.DynamicSerDe,parameters: >> {colelction >> .delim >> = >> 44 >> ,mapkey >> .delim >> = >> 58 >> ,serialization >> .format >> = >> org >> .apache >> .hadoop.hive.serde2.thrift.TCTLSeparatedProtocol}),bucketCols: >> [actor_id, actee_id],sortCols:[],parameters:{}),partitionKeys: >> [FieldSchema(name:account,type:string,comment:null), >> FieldSchema(name:application,type:string,comment:null), >> FieldSchema(name:dataset,type:string,comment:null), >> FieldSchema(name:hour,type:int,comment:null)],parameters:{}) >> Time taken: 2.656 seconds >> >> I' not sure what describe extended partition is, doesn't work for me >> >> Josh >> >> On Dec 10, 2008, at 2:31 PM, Raghu Murthy wrote: >> >>> Can you check the output of the query without the insert clause? >>> >>> SELECT COUNT(actor_id) AS value FROM activities WHERE ( >>> account='80c27664-b047-4c0a-86f3-342c0cdf36c7' AND >>> application='myproduct' >>> AND dataset='purchase' AND hour=341165 ) >>> >>> Is it empty? >>> >>> >>> On 12/10/08 1:08 AM, "Josh Ferguson" wrote: >>> >>>> I should say this is during the last reduction step that occures >>>> in this query >>>> >>>> Josh >>>> >>>> On Dec 10, 2008, at 1:06 AM, Josh Ferguson wrote: >>>> >>>>> This Query >>>>> >>>>> INSERT OVERWRITE TABLE aggregations PARTITION ( >>>>> account='80c27664-b047-4c0a-86f3-342c0cdf36c7', >>>>> application='myproduct', >>>>> dataset='purchase', hour=341165, aggregation='count', >>>>> aggregated_by='all' ) >>>>> SELECT COUNT(actor_id) AS value FROM activities WHERE ( >>>>> account='80c27664-b047-4c0a-86f3-342c0cdf36c7' AND >>>>> application='myproduct' >>>>> AND >>>>> dataset='purchase' AND hour=341165 ) >>>>> >>>>> Generates this message >>>>> >>>>> java.lang.RuntimeException: Error while closing operators: >>>>> org.apache.hadoop.hive.ql.metadata.HiveException: >>>>> org.apache.hadoop.hive.serde2.SerDeException: >>>>> org.apache.hadoop.hive.serde2.SerDeException: Trying to >>>>> serialize 0 fields >>>>> into a struct with 1 >>>>> at >>>>> org >>>>> .apache.hadoop.hive.ql.exec.ExecReducer.close(ExecReducer.java: >>>>> 202) >>>>> at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:440) >>>>> at org.apache.hadoop.mapred.Child.main(Child.java:155) >>>>> >>>>> >>>>> Josh Ferguson >>>> >>> >> >