Mailing-List: contact user-help@hive.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@hive.apache.org
Received-SPF: pass (nike.apache.org: domain of nitinpawar432@gmail.com
 designates 209.85.212.48 as permitted sender)
MIME-Version: 1.0
In-Reply-To: <50FF9842.7040208@smartek21.com>
References: <50F7E61F.5060306@smartek21.com>
	<50FF90E9.3080105@smartek21.com>
	<CAORpBsi-G+buMeYceG4TBaAU6LPcNTc0bPcWLJD-tdsLzxWXTg@mail.gmail.com>
	<50FF9842.7040208@smartek21.com>
Date: Wed, 23 Jan 2013 13:30:04 +0530
Message-ID: 
 <CAORpBsgPgV36UxHuDxQ9C5Uh8UnHCH6xGzWs_N-Ky1sjnjoPbA@mail.gmail.com>
Subject: Re: Configure Hive in Cluster
From: Nitin Pawar <nitinpawar432@gmail.com>
To: user@hive.apache.org
Content-Type: multipart/alternative; boundary=20cf307f35f4047ccf04d3f015c1

--20cf307f35f4047ccf04d3f015c1
Content-Type: text/plain; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

this is the error on hadoop job

2013-01-23 12:15:44,884 INFO org.apache.hadoop.mapred.ReduceTask:
Failed to fetch map-output from attempt_201301231151_0002_m_000001_0
even after MAX_FETCH_RETRIES_PER_MAP retries...  or it is a read
error,  reporting to the JobTracker
2013-01-23 12:15:44,885 FATAL org.apache.hadoop.mapred.ReduceTask:
Shuffle failed with too many fetch failures and insufficient
progress!Killing task attempt_201301231151_0002_r_000000_0.

2013-01-23 12:15:45,220 FATAL org.apache.hadoop.mapred.Task: Failed to
contact the tasktracker
org.apache.hadoop.ipc.RemoteException: java.io.IOException:
JvmValidate Failed. Ignoring request from task:
attempt_201301231151_0002_r_000000_0, with JvmId:
jvm_201301231151_0002_r_1079250852


so something is a mess either your network went down or nodes went down

hive tries to get the same task log from the host (savitha-vitualbox)
and it can't figure out what that host is.


On Wed, Jan 23, 2013 at 1:28 PM, venkatramanan <venkatramanann@smartek21.co=
m
> wrote:

>  No, all the nodes are up and running. i dont know, when hive takes the
> other nodes "HOST NAME" thats the error i guess..
>
> revert me if am wrong
>
>
> On Wednesday 23 January 2013 01:07 PM, Nitin Pawar wrote:
>
> when you ran the query, did the VM shutdown ?
>
>
> On Wed, Jan 23, 2013 at 12:57 PM, venkatramanan <
> venkatramanann@smartek21.com> wrote:
>
>>  Hi,
>>
>> I got the following error while executing the "select count(1) from
>> tweettrend;"
>>
>> Below are the exact log msg from the jobtracker Web Interface
>>
>> *Hive Cli Error:*
>>
>> Exception in thread "Thread-21" java.lang.RuntimeException: Error while
>> reading from task log url
>>     at
>> org.apache.hadoop.hive.ql.exec.errors.TaskLogProcessor.getStackTraces(Ta=
skLogProcessor.java:240)
>>     at
>> org.apache.hadoop.hive.ql.exec.JobDebugger.showJobFailDebugInfo(JobDebug=
ger.java:227)
>>     at org.apache.hadoop.hive.ql.exec.JobDebugger.run(JobDebugger.java:9=
2)
>>     at java.lang.Thread.run(Thread.java:722)
>> Caused by: java.net.UnknownHostException: savitha-VirtualBox
>>     at
>> java.net.AbstractPlainSocketImpl.connect(AbstractPlainSocketImpl.java:17=
8)
>>     at java.net.SocksSocketImpl.connect(SocksSocketImpl.java:391)
>>     at java.net.Socket.connect(Socket.java:579)
>>     at java.net.Socket.connect(Socket.java:528)
>>     at sun.net.NetworkClient.doConnect(NetworkClient.java:180)
>>     at sun.net.www.http.HttpClient.openServer(HttpClient.java:378)
>>     at sun.net.www.http.HttpClient.openServer(HttpClient.java:473)
>>     at sun.net.www.http.HttpClient.<init>(HttpClient.java:203)
>>     at sun.net.www.http.HttpClient.New(HttpClient.java:290)
>>     at sun.net.www.http.HttpClient.New(HttpClient.java:306)
>>     at
>> sun.net.www.protocol.http.HttpURLConnection.getNewHttpClient(HttpURLConn=
ection.java:995)
>>     at
>> sun.net.www.protocol.http.HttpURLConnection.plainConnect(HttpURLConnecti=
on.java:931)
>>     at
>> sun.net.www.protocol.http.HttpURLConnection.connect(HttpURLConnection.ja=
va:849)
>>     at
>> sun.net.www.protocol.http.HttpURLConnection.getInputStream(HttpURLConnec=
tion.java:1299)
>>     at java.net.URL.openStream(URL.java:1037)
>>     at
>> org.apache.hadoop.hive.ql.exec.errors.TaskLogProcessor.getStackTraces(Ta=
skLogProcessor.java:192)
>>     ... 3 more
>> FAILED: Execution Error, return code 2 from
>> org.apache.hadoop.hive.ql.exec.MapRedTask
>> MapReduce Jobs Launched:
>> Job 0: Map: 2  Reduce: 1   Cumulative CPU: 9.0 sec   HDFS Read: 40867105=
3
>> HDFS Write: 0 FAIL
>> Total MapReduce CPU Time Spent: 9 seconds 0 msec
>>
>> *syslog logs*
>>
>> utCopier.copyOutput(ReduceTask.java:1394)
>> 	at org.apache.hadoop.mapred.ReduceTask$ReduceCopier$MapOutputCopier.run=
(ReduceTask.java:1326)
>>
>> 2013-01-23 12:15:44,884 INFO org.apache.hadoop.mapred.ReduceTask: Task a=
ttempt_201301231151_0002_r_000000_0: Failed fetch #10 from attempt_20130123=
1151_0002_m_000001_0
>> 2013-01-23 12:15:44,884 INFO org.apache.hadoop.mapred.ReduceTask: Failed=
 to fetch map-output from attempt_201301231151_0002_m_000001_0 even after M=
AX_FETCH_RETRIES_PER_MAP retries...  or it is a read error,  reporting to t=
he JobTracker
>> 2013-01-23 12:15:44,885 FATAL org.apache.hadoop.mapred.ReduceTask: Shuff=
le failed with too many fetch failures and insufficient progress!Killing ta=
sk attempt_201301231151_0002_r_000000_0.
>> 2013-01-23 12:15:44,889 WARN org.apache.hadoop.mapred.ReduceTask: attemp=
t_201301231151_0002_r_000000_0 adding host savitha-VirtualBox to penalty bo=
x, next contact in 137 seconds
>> 2013-01-23 12:15:44,889 INFO org.apache.hadoop.mapred.ReduceTask: attemp=
t_201301231151_0002_r_000000_0: Got 1 map-outputs from previous failures
>> 2013-01-23 12:15:45,218 FATAL org.apache.hadoop.mapred.Task: attempt_201=
301231151_0002_r_000000_0 GetMapEventsThread Ignoring exception : org.apach=
e.hadoop.ipc.RemoteException: java.io.IOException: JvmValidate Failed. Igno=
ring request from task: attempt_201301231151_0002_r_000000_0, with JvmId: j=
vm_201301231151_0002_r_1079250852
>> 	at org.apache.hadoop.mapred.TaskTracker.validateJVM(TaskTracker.java:32=
78)
>> 	at org.apache.hadoop.mapred.TaskTracker.getMapCompletionEvents(TaskTrac=
ker.java:3537)
>> 	at sun.reflect.GeneratedMethodAccessor3.invoke(Unknown Source)
>> 	at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAcce=
ssorImpl.java:43)
>> 	at java.lang.reflect.Method.invoke(Method.java:601)
>> 	at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:563)
>> 	at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:1388)
>> 	at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:1384)
>> 	at java.security.AccessController.doPrivileged(Native Method)
>> 	at javax.security.auth.Subject.doAs(Subject.java:415)
>> 	at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInform=
ation.java:1121)
>> 	at org.apache.hadoop.ipc.Server$Handler.run(Server.java:1382)
>>
>> 	at org.apache.hadoop.ipc.Client.call(Client.java:1070)
>> 	at org.apache.hadoop.ipc.RPC$Invoker.invoke(RPC.java:225)
>> 	at $Proxy1.getMapCompletionEvents(Unknown Source)
>> 	at org.apache.hadoop.mapred.ReduceTask$ReduceCopier$GetMapEventsThread.=
getMapCompletionEvents(ReduceTask.java:2846)
>> 	at org.apache.hadoop.mapred.ReduceTask$ReduceCopier$GetMapEventsThread.=
run(ReduceTask.java:2810)
>>
>> 2013-01-23 12:15:45,220 FATAL org.apache.hadoop.mapred.Task: Failed to c=
ontact the tasktracker
>> org.apache.hadoop.ipc.RemoteException: java.io.IOException: JvmValidate =
Failed. Ignoring request from task: attempt_201301231151_0002_r_000000_0, w=
ith JvmId: jvm_201301231151_0002_r_1079250852
>> 	at org.apache.hadoop.mapred.TaskTracker.validateJVM(TaskTracker.java:32=
78)
>> 	at org.apache.hadoop.mapred.TaskTracker.fatalError(TaskTracker.java:352=
0)
>> 	at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>> 	at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl=
.java:57)
>> 	at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAcce=
ssorImpl.java:43)
>> 	at java.lang.reflect.Method.invoke(Method.java:601)
>> 	at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:563)
>> 	at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:1388)
>> 	at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:1384)
>> 	at java.security.AccessController.doPrivileged(Native Method)
>> 	at javax.security.auth.Subject.doAs(Subject.java:415)
>> 	at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInform=
ation.java:1121)
>> 	at org.apache.hadoop.ipc.Server$Handler.run(Server.java:1382)
>>
>> 	at org.apache.hadoop.ipc.Client.call(Client.java:1070)
>> 	at org.apache.hadoop.ipc.RPC$Invoker.invoke(RPC.java:225)
>> 	at $Proxy1.fatalError(Unknown Source)
>> 	at org.apache.hadoop.mapred.Task.reportFatalError(Task.java:298)
>> 	at org.apache.hadoop.mapred.ReduceTask$ReduceCopier$GetMapEventsThread.=
run(ReduceTask.java:2829)
>>
>> thanks,
>> Venkat
>>
>> -------- Original Message --------  Subject: Re: Configure Hive in
>> Cluster  Date: Thu, 17 Jan 2013 17:23:03 +0530  From: venkatramanan
>> <venkatramanann@smartek21.com> <venkatramanann@smartek21.com>  Reply-To:
>> <user@hive.apache.org> <user@hive.apache.org>  To: <user@hive.apache.org=
><user@hive.apache.org>
>>
>>
>> Can you suggest me the mandatory hive parameters and clustering
>> configuration steps
>>
>> On Thursday 17 January 2013 12:56 PM, Nitin Pawar wrote:
>>
>> looks like a very small cluster with very limited memory to run mapreduc=
e
>> jobs also number of map/reduce slots on nodes are less so at a time only
>> one map is running.
>>
>>  but still 15 min is a lot of time for 600MB memory
>>
>>
>> On Thu, Jan 17, 2013 at 12:47 PM, venkatramanan <
>> venkatramanann@smartek21.com> wrote:
>>
>>>  Below details are the cluster configuration
>>>
>>> Configured Capacity         : 82.8 GB
>>> DFS Used                          : 1.16 GB
>>> Non DFS Used                  : 31.95 GB
>>> DFS Remaining                : 49.69 GB
>>> DFS Used%                      : 1.4 %
>>> DFS Remaining%              : 60.01 %
>>> Live Nodes <http://localhost:50070/dfsnodelist.jsp?whatNodes=3DLIVE>
>>>                   : 2
>>> Dead Nodes <http://localhost:50070/dfsnodelist.jsp?whatNodes=3DDEAD>
>>>                 : 0
>>> Decommissioning Nodes<http://localhost:50070/dfsnodelist.jsp?whatNodes=
=3DDECOMMISSIONING>: 0
>>> Number of Under-Replicated Blocks : 0
>>>
>>> My Select Query is:
>>>
>>> "select * from tweet where Id =3D 810;"
>>>
>>> This query takes 15 min to complete
>>>
>>>
>>>
>>> On Thursday 17 January 2013 12:29 PM, Nitin Pawar wrote:
>>>
>>> how many number of nodes you have for select query?
>>> whats your select query?
>>>
>>>  if its just a select * from table then it does not run any mapreduce
>>> job
>>>  so its just taking time to show data on your screen if you are using
>>> that query
>>>
>>>
>>> On Thu, Jan 17, 2013 at 12:24 PM, venkatramanan <
>>> venkatramanann@smartek21.com> wrote:
>>>
>>>>  I didnt set any hive parameters and my total table size is 610 MB onl=
y
>>>>
>>>>
>>>>
>>>> On Thursday 17 January 2013 12:11 PM, Nitin Pawar wrote:
>>>>
>>>> a bit more details on size of table and select query will help
>>>> also did you set any hive parameters ?
>>>>
>>>>
>>>> On Thu, Jan 17, 2013 at 12:12 PM, venkatramanan <
>>>> venkatramanann@smartek21.com> wrote:
>>>>
>>>>>  Hi All,
>>>>>
>>>>> Am Newbie in apache hive. I have create a table and thats points to
>>>>> the HDFS Folder path and its takes 15 min to execute the simple "*
>>>>> select*" stmt, Can anyone suggest me for a best practices and
>>>>> performance improvement on hive.
>>>>>
>>>>> Thanks in Advance
>>>>>
>>>>> Venkat
>>>>>
>>>>
>>>>
>>>>
>>>>  --
>>>> Nitin Pawar
>>>>
>>>>
>>>>
>>>
>>>
>>>  --
>>> Nitin Pawar
>>>
>>>
>>>
>>
>>
>>  --
>> Nitin Pawar
>>
>>
>>
>>
>>
>
>
>  --
> Nitin Pawar
>
>
>
> --
>
>


--=20
Nitin Pawar

--20cf307f35f4047ccf04d3f015c1
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">this is the error on hadoop job=A0<div><pre style=3D"white=
-space:pre-wrap">2013-01-23 12:15:44,884 INFO org.apache.hadoop.mapred.Redu=
ceTask: Failed to fetch map-output from attempt_201301231151_0002_m_000001_=
0 even after MAX_FETCH_RETRIES_PER_MAP retries...  or it is a read error,  =
reporting to the JobTracker
2013-01-23 12:15:44,885 FATAL org.apache.hadoop.mapred.ReduceTask: Shuffle =
failed with too many fetch failures and insufficient progress!Killing task =
attempt_201301231151_0002_r_000000_0.</pre><pre style=3D"white-space:pre-wr=
ap">
2013-01-23 12:15:45,220 FATAL org.apache.hadoop.mapred.Task: Failed to cont=
act the tasktracker
org.apache.hadoop.ipc.RemoteException: java.io.IOException: JvmValidate Fai=
led. Ignoring request from task: attempt_201301231151_0002_r_000000_0, with=
 JvmId: jvm_201301231151_0002_r_1079250852</pre><pre style=3D"white-space:p=
re-wrap">
<br></pre><pre style=3D"white-space:pre-wrap">so something is a mess either=
 your network went down or nodes went down </pre><pre style=3D"white-space:=
pre-wrap">hive tries to get the same task log from the host (savitha-vitual=
box) and it can&#39;t figure out what that host is.</pre>
<pre style=3D"white-space:pre-wrap"><br></pre><pre style=3D"white-space:pre=
-wrap"><br></pre></div></div><div class=3D"gmail_extra"><br><br><div class=
=3D"gmail_quote">On Wed, Jan 23, 2013 at 1:28 PM, venkatramanan <span dir=
=3D"ltr">&lt;<a href=3D"mailto:venkatramanann@smartek21.com" target=3D"_bla=
nk">venkatramanann@smartek21.com</a>&gt;</span> wrote:<br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex">
 =20
   =20
 =20
  <div bgcolor=3D"#FFFFFF" text=3D"#000000">
    <div>No, all the nodes are up and running. i
      dont know, when hive takes the other nodes &quot;HOST NAME&quot; that=
s the
      error i guess..<br>
      <br>
      revert me if am wrong<div><div class=3D"h5"><br>
      <br>
      On Wednesday 23 January 2013 01:07 PM, Nitin Pawar wrote:<br>
    </div></div></div><div><div class=3D"h5">
    <blockquote type=3D"cite">
     =20
      <div dir=3D"ltr">when you ran the query, did the VM shutdown ?=A0</di=
v>
      <div class=3D"gmail_extra"><br>
        <br>
        <div class=3D"gmail_quote">On Wed, Jan 23, 2013 at 12:57 PM,
          venkatramanan <span dir=3D"ltr">&lt;<a href=3D"mailto:venkatraman=
ann@smartek21.com" target=3D"_blank">venkatramanann@smartek21.com</a>&gt;</=
span>
          wrote:<br>
          <blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;bord=
er-left:1px #ccc solid;padding-left:1ex">
            <div bgcolor=3D"#FFFFFF" text=3D"#000000"> <small>Hi,<br>
                <br>
                I got the following error while executing the &quot;select
                count(1) from tweettrend;&quot; <br>
                <br>
                Below are the exact log msg from the jobtracker Web
                Interface</small><br>
              <div><br>
                <small><b>Hive Cli Error:</b><br>
                  <br>
                  Exception in thread &quot;Thread-21&quot;
                  java.lang.RuntimeException: Error while reading from
                  task log url<br>
                  =A0=A0=A0 at
org.apache.hadoop.hive.ql.exec.errors.TaskLogProcessor.getStackTraces(TaskL=
ogProcessor.java:240)<br>
                  =A0=A0=A0 at
org.apache.hadoop.hive.ql.exec.JobDebugger.showJobFailDebugInfo(JobDebugger=
.java:227)<br>
                  =A0=A0=A0 at
                  org.apache.hadoop.hive.ql.exec.JobDebugger.run(JobDebugge=
r.java:92)<br>
                  =A0=A0=A0 at java.lang.Thread.run(Thread.java:722)<br>
                  Caused by: java.net.UnknownHostException:
                  savitha-VirtualBox<br>
                  =A0=A0=A0 at
java.net.AbstractPlainSocketImpl.connect(AbstractPlainSocketImpl.java:178)<=
br>
                  =A0=A0=A0 at
                  java.net.SocksSocketImpl.connect(SocksSocketImpl.java:391=
)<br>
                  =A0=A0=A0 at java.net.Socket.connect(Socket.java:579)<br>
                  =A0=A0=A0 at java.net.Socket.connect(Socket.java:528)<br>
                  =A0=A0=A0 at
                  sun.net.NetworkClient.doConnect(NetworkClient.java:180)<b=
r>
                  =A0=A0=A0 at
                  sun.net.www.http.HttpClient.openServer(HttpClient.java:37=
8)<br>
                  =A0=A0=A0 at
                  sun.net.www.http.HttpClient.openServer(HttpClient.java:47=
3)<br>
                  =A0=A0=A0 at
                  sun.net.www.http.HttpClient.&lt;init&gt;(HttpClient.java:=
203)<br>
                  =A0=A0=A0 at
                  sun.net.www.http.HttpClient.New(HttpClient.java:290)<br>
                  =A0=A0=A0 at
                  sun.net.www.http.HttpClient.New(HttpClient.java:306)<br>
                  =A0=A0=A0 at
sun.net.www.protocol.http.HttpURLConnection.getNewHttpClient(HttpURLConnect=
ion.java:995)<br>
                  =A0=A0=A0 at
sun.net.www.protocol.http.HttpURLConnection.plainConnect(HttpURLConnection.=
java:931)<br>
                  =A0=A0=A0 at
sun.net.www.protocol.http.HttpURLConnection.connect(HttpURLConnection.java:=
849)<br>
                  =A0=A0=A0 at
sun.net.www.protocol.http.HttpURLConnection.getInputStream(HttpURLConnectio=
n.java:1299)<br>
                  =A0=A0=A0 at java.net.URL.openStream(URL.java:1037)<br>
                  =A0=A0=A0 at
org.apache.hadoop.hive.ql.exec.errors.TaskLogProcessor.getStackTraces(TaskL=
ogProcessor.java:192)<br>
                  =A0=A0=A0 ... 3 more<br>
                  FAILED: Execution Error, return code 2 from
                  org.apache.hadoop.hive.ql.exec.MapRedTask<br>
                  MapReduce Jobs Launched: <br>
                  Job 0: Map: 2=A0 Reduce: 1=A0=A0 Cumulative CPU: 9.0 sec=
=A0=A0
                  HDFS Read: 408671053 HDFS Write: 0 FAIL<br>
                  Total MapReduce CPU Time Spent: 9 seconds 0 msec<br>
                </small><br>
                <b><u>syslog logs</u></b><br>
                <pre style=3D"line-height:normal;text-indent:0px;letter-spa=
cing:normal;text-align:start;font-variant:normal;text-transform:none;font-s=
tyle:normal;font-weight:normal;word-spacing:0px">utCopier.copyOutput(Reduce=
Task.java:1394)
	at org.apache.hadoop.mapred.ReduceTask$ReduceCopier$MapOutputCopier.run(Re=
duceTask.java:1326)

2013-01-23 12:15:44,884 INFO org.apache.hadoop.mapred.ReduceTask: Task atte=
mpt_201301231151_0002_r_000000_0: Failed fetch #10 from attempt_20130123115=
1_0002_m_000001_0
2013-01-23 12:15:44,884 INFO org.apache.hadoop.mapred.ReduceTask: Failed to=
 fetch map-output from attempt_201301231151_0002_m_000001_0 even after MAX_=
FETCH_RETRIES_PER_MAP retries...  or it is a read error,  reporting to the =
JobTracker
2013-01-23 12:15:44,885 FATAL org.apache.hadoop.mapred.ReduceTask: Shuffle =
failed with too many fetch failures and insufficient progress!Killing task =
attempt_201301231151_0002_r_000000_0.
2013-01-23 12:15:44,889 WARN org.apache.hadoop.mapred.ReduceTask: attempt_2=
01301231151_0002_r_000000_0 adding host savitha-VirtualBox to penalty box, =
next contact in 137 seconds
2013-01-23 12:15:44,889 INFO org.apache.hadoop.mapred.ReduceTask: attempt_2=
01301231151_0002_r_000000_0: Got 1 map-outputs from previous failures
2013-01-23 12:15:45,218 FATAL org.apache.hadoop.mapred.Task: attempt_201301=
231151_0002_r_000000_0 GetMapEventsThread Ignoring exception : org.apache.h=
adoop.ipc.RemoteException: java.io.IOException: JvmValidate Failed. Ignorin=
g request from task: attempt_201301231151_0002_r_000000_0, with JvmId: jvm_=
201301231151_0002_r_1079250852
	at org.apache.hadoop.mapred.TaskTracker.validateJVM(TaskTracker.java:3278)
	at org.apache.hadoop.mapred.TaskTracker.getMapCompletionEvents(TaskTracker=
.java:3537)
	at sun.reflect.GeneratedMethodAccessor3.invoke(Unknown Source)
	at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccesso=
rImpl.java:43)
	at java.lang.reflect.Method.invoke(Method.java:601)
	at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:563)
	at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:1388)
	at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:1384)
	at java.security.AccessController.doPrivileged(Native Method)
	at javax.security.auth.Subject.doAs(Subject.java:415)
	at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformati=
on.java:1121)
	at org.apache.hadoop.ipc.Server$Handler.run(Server.java:1382)

	at org.apache.hadoop.ipc.Client.call(Client.java:1070)
	at org.apache.hadoop.ipc.RPC$Invoker.invoke(RPC.java:225)
	at $Proxy1.getMapCompletionEvents(Unknown Source)
	at org.apache.hadoop.mapred.ReduceTask$ReduceCopier$GetMapEventsThread.get=
MapCompletionEvents(ReduceTask.java:2846)
	at org.apache.hadoop.mapred.ReduceTask$ReduceCopier$GetMapEventsThread.run=
(ReduceTask.java:2810)

2013-01-23 12:15:45,220 FATAL org.apache.hadoop.mapred.Task: Failed to cont=
act the tasktracker
org.apache.hadoop.ipc.RemoteException: java.io.IOException: JvmValidate Fai=
led. Ignoring request from task: attempt_201301231151_0002_r_000000_0, with=
 JvmId: jvm_201301231151_0002_r_1079250852
	at org.apache.hadoop.mapred.TaskTracker.validateJVM(TaskTracker.java:3278)
	at org.apache.hadoop.mapred.TaskTracker.fatalError(TaskTracker.java:3520)
	at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
	at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.ja=
va:57)
	at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccesso=
rImpl.java:43)
	at java.lang.reflect.Method.invoke(Method.java:601)
	at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:563)
	at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:1388)
	at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:1384)
	at java.security.AccessController.doPrivileged(Native Method)
	at javax.security.auth.Subject.doAs(Subject.java:415)
	at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformati=
on.java:1121)
	at org.apache.hadoop.ipc.Server$Handler.run(Server.java:1382)

	at org.apache.hadoop.ipc.Client.call(Client.java:1070)
	at org.apache.hadoop.ipc.RPC$Invoker.invoke(RPC.java:225)
	at $Proxy1.fatalError(Unknown Source)
	at org.apache.hadoop.mapred.Task.reportFatalError(Task.java:298)
	at org.apache.hadoop.mapred.ReduceTask$ReduceCopier$GetMapEventsThread.run=
(ReduceTask.java:2829)</pre>
                thanks,<br>
                Venkat <br>
                <br>
                -------- Original Message --------
                <table border=3D"0" cellpadding=3D"0" cellspacing=3D"0">
                  <tbody>
                    <tr>
                      <th align=3D"RIGHT" nowrap valign=3D"BASELINE">Subjec=
t: </th>
                      <td>Re: Configure Hive in Cluster</td>
                    </tr>
                    <tr>
                      <th align=3D"RIGHT" nowrap valign=3D"BASELINE">Date: =
</th>
                      <td>Thu, 17 Jan 2013 17:23:03 +0530</td>
                    </tr>
                    <tr>
                      <th align=3D"RIGHT" nowrap valign=3D"BASELINE">From: =
</th>
                      <td>venkatramanan <a href=3D"mailto:venkatramanann@sm=
artek21.com" target=3D"_blank">&lt;venkatramanann@smartek21.com&gt;</a></td=
>
                    </tr>
                    <tr>
                      <th align=3D"RIGHT" nowrap valign=3D"BASELINE">Reply-=
To: </th>
                      <td><a href=3D"mailto:user@hive.apache.org" target=3D=
"_blank">&lt;user@hive.apache.org&gt;</a></td>
                    </tr>
                    <tr>
                      <th align=3D"RIGHT" nowrap valign=3D"BASELINE">To: </=
th>
                      <td><a href=3D"mailto:user@hive.apache.org" target=3D=
"_blank">&lt;user@hive.apache.org&gt;</a></td>
                    </tr>
                  </tbody>
                </table>
                <div>
                  <div> <br>
                    <br>
                    <div>Can you suggest me the mandatory hive
                      parameters and clustering configuration steps<br>
                      =A0<br>
                      On Thursday 17 January 2013 12:56 PM, Nitin Pawar
                      wrote:<br>
                    </div>
                    <blockquote type=3D"cite">
                      <div dir=3D"ltr">looks like a very small cluster
                        with very limited memory to run mapreduce jobs
                        also number of map/reduce slots on nodes are
                        less so at a time only one map is running.=A0
                        <div><br>
                        </div>
                        <div>but still 15 min is a lot of time for 600MB
                          memory=A0</div>
                      </div>
                      <div class=3D"gmail_extra"><br>
                        <br>
                        <div class=3D"gmail_quote">On Thu, Jan 17, 2013 at
                          12:47 PM, venkatramanan <span dir=3D"ltr">&lt;<a =
href=3D"mailto:venkatramanann@smartek21.com" target=3D"_blank">venkatramana=
nn@smartek21.com</a>&gt;</span>
                          wrote:<br>
                          <blockquote class=3D"gmail_quote" style=3D"margin=
:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
                            <div bgcolor=3D"#FFFFFF" text=3D"#000000">
                              <div>Below details are the cluster
                                configuration<br>
                                <br>
                                Configured Capacity =A0=A0=A0 =A0=A0=A0 : 8=
2.8 GB <br>
                                DFS Used =A0=A0=A0 =A0=A0=A0 =A0=A0=A0 =A0=
=A0=A0 =A0=A0=A0 =A0=A0=A0=A0 : 1.16
                                GB <br>
                                Non DFS Used =A0=A0=A0 =A0=A0=A0 =A0=A0=A0 =
=A0=A0=A0=A0 : 31.95 GB
                                <br>
                                DFS Remaining =A0=A0=A0 =A0=A0=A0 =A0=A0=A0=
 =A0=A0 : 49.69 GB
                                <br>
                                DFS Used% =A0=A0=A0 =A0=A0=A0 =A0=A0=A0 =A0=
=A0=A0 =A0=A0=A0=A0 : 1.4 % <br>
                                DFS Remaining% =A0=A0=A0 =A0=A0=A0 =A0=A0=
=A0=A0 : 60.01 % <br>
                                <a href=3D"http://localhost:50070/dfsnodeli=
st.jsp?whatNodes=3DLIVE" style=3D"text-decoration:initial" target=3D"_blank=
">Live Nodes</a> =A0=A0=A0 =A0=A0=A0
                                =A0=A0=A0 =A0=A0=A0 =A0=A0=A0 =A0 : 2 <br>
                                <a href=3D"http://localhost:50070/dfsnodeli=
st.jsp?whatNodes=3DDEAD" style=3D"text-decoration:initial" target=3D"_blank=
">Dead Nodes</a> =A0=A0=A0 =A0=A0=A0
                                =A0=A0=A0 =A0=A0=A0 =A0=A0=A0 : 0 <br>
                                <a href=3D"http://localhost:50070/dfsnodeli=
st.jsp?whatNodes=3DDECOMMISSIONING" style=3D"text-decoration:initial" targe=
t=3D"_blank">Decommissioning Nodes</a>
                                : 0 <br>
                                Number of Under-Replicated Blocks : 0<br>
                                <br>
                                My Select Query is:<br>
                                <br>
                                &quot;select * from tweet where Id =3D 810;=
&quot;<br>
                                <br>
                                This query takes 15 min to complete
                                <div>
                                  <div><br>
                                    <br>
                                    <br>
                                    On Thursday 17 January 2013 12:29
                                    PM, Nitin Pawar wrote:<br>
                                  </div>
                                </div>
                              </div>
                              <div>
                                <div>
                                  <blockquote type=3D"cite">
                                    <div dir=3D"ltr">how many number of
                                      nodes you have for select query?=A0
                                      <div>whats your select query?=A0</div=
>
                                      <div><br>
                                      </div>
                                      <div>if its just a select * from
                                        table then it does not run any
                                        mapreduce job</div>
                                      <div> so its just taking time to
                                        show data on your screen if you
                                        are using that query=A0</div>
                                    </div>
                                    <div class=3D"gmail_extra"><br>
                                      <br>
                                      <div class=3D"gmail_quote">On Thu,
                                        Jan 17, 2013 at 12:24 PM,
                                        venkatramanan <span dir=3D"ltr">&lt=
;<a href=3D"mailto:venkatramanann@smartek21.com" target=3D"_blank">venkatra=
manann@smartek21.com</a>&gt;</span>
                                        wrote:<br>
                                        <blockquote class=3D"gmail_quote" s=
tyle=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
                                          <div bgcolor=3D"#FFFFFF" text=3D"=
#000000">
                                            <div><small>I didnt set any
                                                hive parameters and my
                                                total table size is 610
                                                MB only</small>
                                              <div>
                                                <div><br>
                                                  <br>
                                                  <br>
                                                  On Thursday 17 January
                                                  2013 12:11 PM, Nitin
                                                  Pawar wrote:<br>
                                                </div>
                                              </div>
                                            </div>
                                            <div>
                                              <div>
                                                <blockquote type=3D"cite">
                                                  <div dir=3D"ltr">a bit
                                                    more details on size
                                                    of table and select
                                                    query will help=A0
                                                    <div>also did you
                                                      set any hive
                                                      parameters ?=A0</div>
                                                  </div>
                                                  <div class=3D"gmail_extra=
"><br>
                                                    <br>
                                                    <div class=3D"gmail_quo=
te">On
                                                      Thu, Jan 17, 2013
                                                      at 12:12 PM,
                                                      venkatramanan <span d=
ir=3D"ltr">&lt;<a href=3D"mailto:venkatramanann@smartek21.com" target=3D"_b=
lank">venkatramanann@smartek21.com</a>&gt;</span>
                                                      wrote:<br>
                                                      <blockquote class=3D"=
gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-=
left:1ex">
                                                        <div bgcolor=3D"#FF=
FFFF" text=3D"#000000">
                                                          <small>Hi All,<br=
>
                                                          <br>
                                                          Am Newbie in
                                                          apache hive. I
                                                          have create a
                                                          table and
                                                          thats points
                                                          to the HDFS
                                                          Folder path
                                                          and its takes
                                                          15 min to
                                                          execute the
                                                          simple &quot;<b>s=
elect</b>&quot;
                                                          stmt, Can
                                                          anyone suggest
                                                          me for a best
                                                          practices and
                                                          performance
                                                          improvement on
                                                          hive.<br>
                                                          <br>
                                                          Thanks in
                                                          Advance<br>
                                                          <br>
                                                          Venkat</small><br=
>
                                                        </div>
                                                      </blockquote>
                                                    </div>
                                                    <br>
                                                    <br clear=3D"all">
                                                    <div><br>
                                                    </div>
                                                    -- <br>
                                                    Nitin Pawar<br>
                                                  </div>
                                                </blockquote>
                                                <br>
                                              </div>
                                            </div>
                                          </div>
                                        </blockquote>
                                      </div>
                                      <br>
                                      <br clear=3D"all">
                                      <div><br>
                                      </div>
                                      -- <br>
                                      Nitin Pawar<br>
                                    </div>
                                  </blockquote>
                                  <br>
                                </div>
                              </div>
                            </div>
                          </blockquote>
                        </div>
                        <br>
                        <br clear=3D"all">
                        <div><br>
                        </div>
                        -- <br>
                        Nitin Pawar<br>
                      </div>
                    </blockquote>
                    <br>
                    <br>
                  </div>
                </div>
              </div>
              <br>
            </div>
          </blockquote>
        </div>
        <br>
        <br clear=3D"all">
        <div><br>
        </div>
        -- <br>
        Nitin Pawar<br>
      </div>
    </blockquote>
    <br>
    <br>
    </div></div><div>-- <br>
      <br>
    </div>
  </div>

</blockquote></div><br><br clear=3D"all"><div><br></div>-- <br>Nitin Pawar<=
br>
</div>

--20cf307f35f4047ccf04d3f015c1--