Return-Path: Delivered-To: apmail-hadoop-hive-user-archive@minotaur.apache.org Received: (qmail 94628 invoked from network); 4 Mar 2009 10:50:22 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 4 Mar 2009 10:50:22 -0000 Received: (qmail 58424 invoked by uid 500); 4 Mar 2009 10:50:22 -0000 Delivered-To: apmail-hadoop-hive-user-archive@hadoop.apache.org Received: (qmail 58403 invoked by uid 500); 4 Mar 2009 10:50:22 -0000 Mailing-List: contact hive-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hive-user@hadoop.apache.org Delivered-To: mailing list hive-user@hadoop.apache.org Received: (qmail 58389 invoked by uid 99); 4 Mar 2009 10:50:21 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 04 Mar 2009 02:50:21 -0800 X-ASF-Spam-Status: No, hits=2.2 required=10.0 tests=FUZZY_CPILL,HTML_MESSAGE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: local policy) Received: from [209.85.218.179] (HELO mail-bw0-f179.google.com) (209.85.218.179) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 04 Mar 2009 10:50:12 +0000 Received: by bwz27 with SMTP id 27so3010629bwz.29 for ; Wed, 04 Mar 2009 02:49:52 -0800 (PST) MIME-Version: 1.0 Received: by 10.223.105.195 with SMTP id u3mr5918044fao.13.1236163792061; Wed, 04 Mar 2009 02:49:52 -0800 (PST) In-Reply-To: <7e536b1f0903040232p2dfb5548t34400cd59d9ceef7@mail.gmail.com> References: <7e536b1f0903040232p2dfb5548t34400cd59d9ceef7@mail.gmail.com> Date: Wed, 4 Mar 2009 11:49:52 +0100 Message-ID: <7e536b1f0903040249n34be6049j10cafe6a96a8e2b3@mail.gmail.com> Subject: Cannot issue query From: Marcus Herou To: hive-user@hadoop.apache.org Content-Type: multipart/alternative; boundary=001636c59864b5c62e046448cf0d X-Virus-Checked: Checked by ClamAV on apache.org --001636c59864b5c62e046448cf0d Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Hi. Started to lab with Hive today since it seems to suit us quite well and since we are processing our weblogstats with Hadoop as of today and ends up doing SQL in Hadoop form it seems fair to try out a system which does that in one step :) I've created and loaded data into Hive with the following statements; hive> drop table DailyUniqueSiteVisitorSample; OK Time taken: 4.064 seconds hive> CREATE TABLE DailyUniqueSiteVisitorSample (sampleDate date,uid bigint,site int,concreteStatistics int,network smallint,category smallint,country smallint,countryCode String,sessions smallint,pageImpressions smallint) COMMENT 'This is our weblog stats table' PARTITIONED BY(dt STRING) ROW FORMAT DELIMITED FIELDS TERMINATED BY ',' LINES TERMINATED BY '\n' STORED AS TEXTFILE; OK Time taken: 0.248 seconds hive> LOAD DATA LOCAL INPATH '/tmp/data-DenormalizedSiteVisitor.VisitsPi.2009-03-02.csv' INTO TABLE DailyUniqueSiteVisitorSample PARTITION(dt='2009-03-02'); Copying data from file:/tmp/data-2009-03-02.csv Loading data to table dailyuniquesitevisitorsample partition {dt=2009-03-02} OK Time taken: 2.258 seconds A little confused about the text-file part but since the csv I need to insert is a text-file so... (the tutorial only uses SequenceFile(s)), seems to work though. Anyway this goes well but when I issue a simple query like the below it throws an exception: hive> select d.* from dailyuniquesitevisitorsample d where d.site=1; Total MapReduce jobs = 1 Number of reduce tasks is set to 0 since there's no reduce operator java.lang.AbstractMethodError: org.apache.hadoop.hive.ql.io.HiveInputFormat.validateInput(Lorg/apache/hadoop/mapred/JobConf;)V at org.apache.hadoop.mapred.JobClient.submitJob(JobClient.java:735) at org.apache.hadoop.hive.ql.exec.ExecDriver.execute(ExecDriver.java:391) at org.apache.hadoop.hive.ql.Driver.run(Driver.java:239) at org.apache.hadoop.hive.cli.CliDriver.processCmd(CliDriver.java:174) at org.apache.hadoop.hive.cli.CliDriver.processLine(CliDriver.java:207) at org.apache.hadoop.hive.cli.CliDriver.main(CliDriver.java:306) at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39) at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25) at java.lang.reflect.Method.invoke(Method.java:597) at org.apache.hadoop.util.RunJar.main(RunJar.java:155) at org.apache.hadoop.mapred.JobShell.run(JobShell.java:54) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:79) at org.apache.hadoop.mapred.JobShell.main(JobShell.java:68) I run Hadoop-018.2 Not sure that I am doing this correctly. Please guide me if I am stupid. Kindly //Marcus -- Marcus Herou CTO and co-founder Tailsweep AB +46702561312 marcus.herou@tailsweep.com http://www.tailsweep.com/ http://blogg.tailsweep.com/ --001636c59864b5c62e046448cf0d Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Hi.

Started to lab with Hive today since = it seems to suit us quite well and since we are processing our weblogstats = with Hadoop as of today and ends up doing SQL in Hadoop form it seems fair = to try out a system which does that in one step :)

I've created and loaded data into Hive with the following statement= s;
hive> drop table DailyUniqueSiteVisitorSample;
OK
Time taken= : 4.064 seconds
hive> CREATE TABLE DailyUniqueSiteVisitorSample (samp= leDate date,uid bigint,site int,concreteStatistics int,network smallint,cat= egory smallint,country smallint,countryCode String,sessions smallint,pageIm= pressions smallint) COMMENT 'This is our weblog stats table' PARTIT= IONED BY(dt STRING) ROW FORMAT DELIMITED FIELDS TERMINATED BY ',' L= INES TERMINATED BY '\n' STORED AS TEXTFILE;
OK
Time taken: 0.248 seconds
hive> LOAD DATA LOCAL INPATH '/tm= p/data-DenormalizedSiteVisitor.VisitsPi.2009-03-02.csv' INTO TABLE Dail= yUniqueSiteVisitorSample PARTITION(dt=3D'2009-03-02');
Copying d= ata from file:/tmp/data-2009-03-02.csv
Loading data to table dailyuniquesitevisitorsample partition {dt=3D2009-03-= 02}
OK
Time taken: 2.258 seconds

A little confused about the t= ext-file part but since the csv I need to insert is a text-file so... (the = tutorial only uses SequenceFile(s)), seems to work though.

Anyway this goes well but when I issue a simple query like the below it= throws an exception:
hive> select d.* from dailyuniquesitevisitorsam= ple d where d.site=3D1;
Total MapReduce jobs =3D 1
Number of reduce tasks is set to 0 since ther= e's no reduce operator
java.lang.AbstractMethodError: org.apache.had= oop.hive.ql.io.HiveInputFormat.validateInput(Lorg/apache/hadoop/mapred/JobC= onf;)V
=A0=A0=A0=A0=A0=A0=A0 at org.apache.hadoop.mapred.JobClient.submitJob(JobCl= ient.java:735)
=A0=A0=A0=A0=A0=A0=A0 at org.apache.hadoop.hive.ql.exec.E= xecDriver.execute(ExecDriver.java:391)
=A0=A0=A0=A0=A0=A0=A0 at org.apac= he.hadoop.hive.ql.Driver.run(Driver.java:239)
=A0=A0=A0=A0=A0=A0=A0 at org.apache.hadoop.hive.cli.CliDriver.processCmd(Cl= iDriver.java:174)
=A0=A0=A0=A0=A0=A0=A0 at org.apache.hadoop.hive.cli.Cl= iDriver.processLine(CliDriver.java:207)
=A0=A0=A0=A0=A0=A0=A0 at org.apa= che.hadoop.hive.cli.CliDriver.main(CliDriver.java:306)
=A0=A0=A0=A0=A0=A0=A0 at sun.reflect.NativeMethodAccessorImpl.invoke0(Nativ= e Method)
=A0=A0=A0=A0=A0=A0=A0 at sun.reflect.NativeMethodAccessorImpl.= invoke(NativeMethodAccessorImpl.java:39)
=A0=A0=A0=A0=A0=A0=A0 at sun.re= flect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java= :25)
=A0=A0=A0=A0=A0=A0=A0 at java.lang.reflect.Method.invoke(Method.java:597)=A0=A0=A0=A0=A0=A0=A0 at org.apache.hadoop.util.RunJar.main(RunJar.java:1= 55)
=A0=A0=A0=A0=A0=A0=A0 at org.apache.hadoop.mapred.JobShell.run(JobSh= ell.java:54)
=A0=A0=A0=A0=A0=A0=A0 at org.apache.hadoop.util.ToolRunner.= run(ToolRunner.java:65)
=A0=A0=A0=A0=A0=A0=A0 at org.apache.hadoop.util.ToolRunner.run(ToolRunner.j= ava:79)
=A0=A0=A0=A0=A0=A0=A0 at org.apache.hadoop.mapred.JobShell.main(= JobShell.java:68)

I run Hadoop-018.2

Not sure that I am doing= this correctly. Please guide me if I am stupid.

Kindly

//Marcus

--
Marcus Herou CTO and co-founder Tailsweep AB
+46702561312
marcus.herou@t= ailsweep.com
http://www.tailswee= p.com/
htt= p://blogg.tailsweep.com/
--001636c59864b5c62e046448cf0d--