hive-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Amit Behera <amit.bd...@gmail.com>
Subject Re: Container launch failed Error
Date Mon, 24 Nov 2014 17:26:52 GMT
I did not modify in all the slaves. except slave

will it be a problem ?

But for small data (up to 20 GB table) it is running and for 300GB table
only count(*) running sometimes and sometimes failed

Thanks
Amit

On Mon, Nov 24, 2014 at 10:37 PM, Daniel Haviv <
daniel.haviv@veracity-group.com> wrote:

> did you copy the hosts file to all the nodes?
>
> Daniel
>
> On 24 בנוב׳ 2014, at 19:04, Amit Behera <amit.bdk10@gmail.com> wrote:
>
> hi Daniel,
>
>
> this stacktrace same for other query .
> for different run I am getting slave7 sometime slave8...
>
> And also I registered all machine IPs in /etc/hosts
>
> Regards
> Amit
>
>
>
> On Mon, Nov 24, 2014 at 10:22 PM, Daniel Haviv <
> daniel.haviv@veracity-group.com> wrote:
>
>> It seems that the application master can't resolve slave6's name to an IP
>>
>> Daniel
>>
>> On 24 בנוב׳ 2014, at 18:49, Amit Behera <amit.bdk10@gmail.com> wrote:
>>
>> Hi Users,
>>
>> *my cluster(1+8) configuration*:
>>
>> RAM  : 32 GB each
>> HDFS : 1.5 TB SSD
>> CPU   : 8 core each
>>
>> -----------------------------------------------
>>
>> I am trying to query on 300GB of table but I am able to run only select
>> query.
>>
>> Except select query , for all other query I am getting following
>> exception.
>>
>>
>>
>>
>>
>> Total jobs = 1
>>
>> Stage-1 is selected by condition resolver.
>>
>> Launching Job 1 out of 1
>>
>> Number of reduce tasks not specified. Estimated
>> from input data size: 183
>>
>> In order to change the average load for a
>> reducer (in bytes):
>>
>>   set
>> hive.exec.reducers.bytes.per.reducer=<number>
>>
>> In order to limit the maximum number of
>> reducers:
>>
>>   set hive.exec.reducers.max=<number>
>>
>> In order to set a constant number of reducers:
>>
>>   set mapreduce.job.reduces=<number>
>>
>> Starting Job = job_1416831990090_0005, Tracking
>> URL = http://master:8088/proxy/application_1416831990090_0005/
>>
>> Kill Command = /root/hadoop/bin/hadoop job
>> -kill job_1416831990090_0005
>>
>> Hadoop job information for Stage-1: number of
>> mappers: 679; number of reducers: 183
>>
>> 2014-11-24 19:43:01,523 Stage-1 map = 0%,
>> reduce = 0%
>>
>> 2014-11-24 19:43:22,730 Stage-1 map = 53%,
>> reduce = 0%, Cumulative CPU 625.19 sec
>>
>> 2014-11-24 19:43:23,778 Stage-1 map = 100%,
>> reduce = 100%
>>
>> MapReduce Total cumulative CPU time: 10 minutes
>> 25 seconds 190 msec
>>
>> Ended Job = job_1416831990090_0005 with errors
>>
>> Error during job, obtaining debugging
>> information...
>>
>> Examining task ID:
>> task_1416831990090_0005_m_000005 (and more) from job
>> job_1416831990090_0005
>>
>> Examining task ID:
>> task_1416831990090_0005_m_000042 (and more) from job
>> job_1416831990090_0005
>>
>> Examining task ID:
>> task_1416831990090_0005_m_000035 (and more) from job
>> job_1416831990090_0005
>>
>> Examining task ID:
>> task_1416831990090_0005_m_000065 (and more) from job
>> job_1416831990090_0005
>>
>> Examining task ID:
>> task_1416831990090_0005_m_000002 (and more) from job
>> job_1416831990090_0005
>>
>> Examining task ID:
>> task_1416831990090_0005_m_000007 (and more) from job
>> job_1416831990090_0005
>>
>> Examining task ID:
>> task_1416831990090_0005_m_000058 (and more) from job
>> job_1416831990090_0005
>>
>> Examining task ID:
>> task_1416831990090_0005_m_000043 (and more) from job
>> job_1416831990090_0005
>>
>>
>>  Task with the most failures(4):
>>
>> -----
>>
>> Task ID:
>>
>>   task_1416831990090_0005_m_000005
>>
>>
>>  URL:
>>
>>  http://master:8088/taskdetails.jsp?jobid=job_1416831990090_0005&tipid=task_1416831990090_0005_m_000005
>>
>> -----
>>
>> Diagnostic Messages for this Task:
>>
>> Container launch failed for
>> container_1416831990090_0005_01_000112 :
>> java.lang.IllegalArgumentException: java.net.UnknownHostException:
>> slave6
>>
>> 	at
>> org.apache.hadoop.security.SecurityUtil.buildTokenService(SecurityUtil.java:418)
>>
>> 	at
>> org.apache.hadoop.security.SecurityUtil.setTokenService(SecurityUtil.java:397)
>>
>> 	at
>> org.apache.hadoop.yarn.util.ConverterUtils.convertFromYarn(ConverterUtils.java:233)
>>
>> 	at
>> org.apache.hadoop.yarn.client.api.impl.ContainerManagementProtocolProxy$ContainerManagementProtocolProxyData.newProxy(ContainerManagementProtocolProxy.java:211)
>>
>> 	at
>> org.apache.hadoop.yarn.client.api.impl.ContainerManagementProtocolProxy$ContainerManagementProtocolProxyData.<init>(ContainerManagementProtocolProxy.java:189)
>>
>> 	at
>> org.apache.hadoop.yarn.client.api.impl.ContainerManagementProtocolProxy.getProxy(ContainerManagementProtocolProxy.java:110)
>>
>> 	at
>> org.apache.hadoop.mapreduce.v2.app.launcher.ContainerLauncherImpl.getCMProxy(ContainerLauncherImpl.java:403)
>>
>> 	at
>> org.apache.hadoop.mapreduce.v2.app.launcher.ContainerLauncherImpl$Container.launch(ContainerLauncherImpl.java:138)
>>
>> 	at
>> org.apache.hadoop.mapreduce.v2.app.launcher.ContainerLauncherImpl$EventProcessor.run(ContainerLauncherImpl.java:369)
>>
>> 	at
>> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
>>
>> 	at
>> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
>>
>> 	at java.lang.Thread.run(Thread.java:745)
>>
>> Caused by: java.net.UnknownHostException: slave6
>>
>> 	... 12 more
>>
>>
>>
>>  FAILED: Execution Error, return code 2 from
>> org.apache.hadoop.hive.ql.exec.mr.MapRedTask
>>
>> MapReduce Jobs Launched:
>>
>> Job 0: Map: 679  Reduce: 183   Cumulative CPU:
>> 625.19 sec   HDFS Read: 0 HDFS Write: 0 FAIL
>>
>> Total MapReduce CPU Time Spent: 10 minutes 25
>> seconds 190 mse
>>
>>
>>
>>
>> Please help me to fix the issue.
>>
>> Thanks
>> Amit
>>
>>
>

Mime
View raw message