Return-Path: X-Original-To: apmail-hadoop-hdfs-user-archive@minotaur.apache.org Delivered-To: apmail-hadoop-hdfs-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 9192317FF3 for ; Mon, 6 Apr 2015 09:00:19 +0000 (UTC) Received: (qmail 3162 invoked by uid 500); 6 Apr 2015 09:00:13 -0000 Delivered-To: apmail-hadoop-hdfs-user-archive@hadoop.apache.org Received: (qmail 2964 invoked by uid 500); 6 Apr 2015 09:00:13 -0000 Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hadoop.apache.org Delivered-To: mailing list user@hadoop.apache.org Received: (qmail 2954 invoked by uid 99); 6 Apr 2015 09:00:12 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 06 Apr 2015 09:00:12 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of sanjeev.tripurari@inmobi.com designates 209.85.213.170 as permitted sender) Received: from [209.85.213.170] (HELO mail-ig0-f170.google.com) (209.85.213.170) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 06 Apr 2015 08:59:47 +0000 Received: by igbqf9 with SMTP id qf9so15413076igb.1 for ; Mon, 06 Apr 2015 01:59:45 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=inmobi.com; s=google; h=mime-version:date:message-id:subject:from:to:content-type; bh=vL9LJO2GklwhNuV7F6D7FfZrhfMFQ9JFg/mOWQniNqM=; b=GK/X6c498SEK4tHgeZ4PEoOPeKJWfNKqwz3M2yQdGEZs+viX3t4XQoln03SCzK7OwW AeWI0Gs5dYJrGIN+GLAZlAz4Yx7daEAl2uyXxvNpOcvgb5K00AIa/TB4v1Gm1EJdRTZd czUT8C4GF0nh/c6yN6HFt4bnkIm8tMIgZCgCs= X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:mime-version:date:message-id:subject:from:to :content-type; bh=vL9LJO2GklwhNuV7F6D7FfZrhfMFQ9JFg/mOWQniNqM=; b=TCI776lmdtXbya36cAx9JTgfAyC6tLTP3RAO4f/+HZ0BVG0Iwu9HRfRuT/ihFDj3nz aFZLflsWWqGFYx1d32dn9WjwHQfOt95Ammu1VSMxAtrimA/pqlkzz8OqU0rbHNAX/C+w 0g8Vk+FvqKmRQWfzcq7+2fq6uDM3aMoCDREbAiMA4ylIjCCDOSfX6VjwRqA7W9tGuzIT X6ixTJoixN3TuDt91PtItNyqVbNa0+oSCaBg1DgkDOqfjTxqvkZplVYdfoBAi2ok4WMx 3rFWQ3iPmksn6Ps3KsmqdJOvfW0JWyKbxvjOh7O8riW1/OeFlFDQp/OsVNutdu2hkXlk uQxg== X-Gm-Message-State: ALoCoQmzBoKTvhRRyEyVQJ7pWx6N9yJM/xhmQf5j5LHMN6BR4cfWusd7O39qnPbMX4VX3S8rxSJI9iHJTLx25UOjrFfxSDmiCs097EMR/yDIg9anCZy3lxU= MIME-Version: 1.0 X-Received: by 10.50.132.66 with SMTP id os2mr44710511igb.6.1428310785092; Mon, 06 Apr 2015 01:59:45 -0700 (PDT) Received: by 10.36.137.84 with HTTP; Mon, 6 Apr 2015 01:59:45 -0700 (PDT) Date: Mon, 6 Apr 2015 14:29:45 +0530 Message-ID: Subject: Tracking job failure using APIs From: Sanjeev Tripurari To: user@hadoop.apache.org Content-Type: multipart/alternative; boundary=047d7b2e3f3cf9b06405130a84b4 X-Virus-Checked: Checked by ClamAV on apache.org --047d7b2e3f3cf9b06405130a84b4 Content-Type: text/plain; charset=UTF-8 Hi All, How can I track a job failure on node or list of nodes, using YARN apis. I could get the list of long running jobs, using yarn client API, but need to go further to AM, NM, task attempts for map or reduce. Say, I have a job running for long,(about 4hours), might be caused of some task failures. Please provide the sequence of APIs, or any reference. Thanks and Regards -Sanjeev -- _____________________________________________________________ The information contained in this communication is intended solely for the use of the individual or entity to whom it is addressed and others authorized to receive it. It may contain confidential or legally privileged information. If you are not the intended recipient you are hereby notified that any disclosure, copying, distribution or taking any action in reliance on the contents of this information is strictly prohibited and may be unlawful. If you have received this communication in error, please notify us immediately by responding to this email and then delete it from your system. The firm is neither liable for the proper and complete transmission of the information contained in this communication nor for any delay in its receipt. --047d7b2e3f3cf9b06405130a84b4 Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable
Hi All,

How can I track a job failure o= n node or list of nodes, using YARN apis.
I could get the li= st of long running jobs, using yarn client API,=C2=A0
but need to= go further to AM, NM, task attempts for map or reduce.
Say, I have a job running for long,(about 4hours), might be cau= sed of some task failures.

Please provide the sequ= ence of APIs, or any reference.

Thanks and Regards=
-Sanjeev


_____________= ________________________________________________
The information contained in this communication is intended solely for th= e use of the individual or entity to whom it is addressed and others author= ized to receive it. It may contain confidential or legally privileged infor= mation. If you are not the intended recipient you are hereby notified that = any disclosure, copying, distribution or taking any action in reliance on t= he contents of this information is strictly prohibited and may be unlawful.= If you have received this communication in error, please notify us immedia= tely by responding to this email and then delete it from your system. The f= irm is neither liable for the proper and complete transmission of the infor= mation contained in this communication nor for any delay in its receipt. --047d7b2e3f3cf9b06405130a84b4--