hadoop-mapreduce-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Devaraj k <devara...@huawei.com>
Subject RE: Debug MR tasks impossible.
Date Thu, 29 Mar 2012 16:11:19 GMT
Hi Pedro,

    You can use the below property to pass the debug params to the child jvm. And also you
should make sure that have only one task running at a time by giving the input appropriately.

  <description>Java opts for the task tracker child processes.
  The following symbol, if present, will be interpolated: @taskid@ is replaced
  by current TaskID. Any other occurrences of '@' will go unchanged.
  For example, to enable verbose gc logging to a file named for the taskid in
  /tmp and to set the heap maximum to be a gigabyte, pass a 'value' of:
        -Xmx1024m -verbose:gc -Xloggc:/tmp/@taskid@.gc

  The configuration variable mapred.child.ulimit can be used to control the
  maximum virtual memory of the child processes.

From: Ravi Prakash [ravihadoop@gmail.com]
Sent: Thursday, March 29, 2012 9:36 PM
To: mapreduce-user@hadoop.apache.org
Subject: Re: Debug MR tasks impossible.

Hi Pedro,

I know its sub-optimal but you should be able to put in as many System.out.println / log messages
as you want and you should be able to see them in stdout, and syslog files. Which version
of hadoop are you using?

On Thu, Mar 29, 2012 at 10:33 AM, Pedro Costa <psdc1978@gmail.com<mailto:psdc1978@gmail.com>>

I'm trying to debug map and reduce tasks for a quite long time, and it seems that it's impossible.
MR are launched in new process and there's no way to debug them. Even with IsolationRunner
class it's impossible. This isn't good because I really need to debug the class, to understand
some changes that I made to the code.

I wonder how MapReduce programmers could debug the code that they implemented?

Best regards,

View raw message