Return-Path: Delivered-To: apmail-hadoop-core-dev-archive@www.apache.org Received: (qmail 45264 invoked from network); 9 Apr 2008 04:02:14 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 9 Apr 2008 04:02:14 -0000 Received: (qmail 65051 invoked by uid 500); 9 Apr 2008 04:02:13 -0000 Delivered-To: apmail-hadoop-core-dev-archive@hadoop.apache.org Received: (qmail 65027 invoked by uid 500); 9 Apr 2008 04:02:13 -0000 Mailing-List: contact core-dev-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: core-dev@hadoop.apache.org Delivered-To: mailing list core-dev@hadoop.apache.org Received: (qmail 65018 invoked by uid 99); 9 Apr 2008 04:02:13 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 08 Apr 2008 21:02:13 -0700 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.140] (HELO brutus.apache.org) (140.211.11.140) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 09 Apr 2008 04:01:30 +0000 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id 882F7234C0C1 for ; Tue, 8 Apr 2008 20:59:25 -0700 (PDT) Message-ID: <823449010.1207713565542.JavaMail.jira@brutus> Date: Tue, 8 Apr 2008 20:59:25 -0700 (PDT) From: "Hemanth Yamijala (JIRA)" To: core-dev@hadoop.apache.org Subject: [jira] Created: (HADOOP-3216) [HOD] Handle Torque error codes related to security / credential errors MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org [HOD] Handle Torque error codes related to security / credential errors ----------------------------------------------------------------------- Key: HADOOP-3216 URL: https://issues.apache.org/jira/browse/HADOOP-3216 Project: Hadoop Core Issue Type: Bug Components: contrib/hod Affects Versions: 0.16.2 Reporter: Hemanth Yamijala There a bunch of credential / security related errors that come from Torque server, possibly under high load. HOD already handles one of this code specially, by retrying a bunch of times and giving up. We should probably do the same for other such errors. One of the frequently occuring one is error code 159. Other ones which Rajiv identified are: PBSE_IVALREQ PBSE_TOOMANY PBSE_UNKREQ PBSE_PERM PBSE_SYSTEM PBSE_INTERNAL PBSE_BADSTATE PBSE_BADCRED PBSE_EXPIRED PBSE_BADUSER PBSE_QUEBUSY PBSE_NOCONNECTS PBSE_ROUTEREJ PBSE_RESCUNAV PBSE_BADGRP PBSE_BADACLHOST -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.