From general-return-2385-apmail-hadoop-general-archive=hadoop.apache.org@hadoop.apache.org Fri Nov 12 23:49:08 2010 Return-Path: Delivered-To: apmail-hadoop-general-archive@minotaur.apache.org Received: (qmail 95922 invoked from network); 12 Nov 2010 23:49:08 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 12 Nov 2010 23:49:08 -0000 Received: (qmail 6657 invoked by uid 500); 12 Nov 2010 23:49:38 -0000 Delivered-To: apmail-hadoop-general-archive@hadoop.apache.org Received: (qmail 6587 invoked by uid 500); 12 Nov 2010 23:49:38 -0000 Mailing-List: contact general-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: general@hadoop.apache.org Delivered-To: mailing list general@hadoop.apache.org Received: (qmail 6579 invoked by uid 99); 12 Nov 2010 23:49:38 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 12 Nov 2010 23:49:38 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of amp@opendns.com designates 67.215.68.163 as permitted sender) Received: from [67.215.68.163] (HELO mail.opendns.com) (67.215.68.163) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 12 Nov 2010 23:49:30 +0000 Received: from Adams-Desktop.local ([67.215.69.42]) (authenticated bits=0) by mail.opendns.com (8.14.3/8.14.3/Debian-5) with ESMTP id oACNn9s8001920 (version=TLSv1/SSLv3 cipher=AES256-SHA bits=256 verify=NO) for ; Fri, 12 Nov 2010 23:49:09 GMT Message-ID: <4CDDD275.9020901@opendns.com> Date: Fri, 12 Nov 2010 15:49:09 -0800 From: Adam Phelps User-Agent: Mozilla/5.0 (Macintosh; U; Intel Mac OS X 10.6; en-US; rv:1.9.2.12) Gecko/20101027 Thunderbird/3.1.6 MIME-Version: 1.0 To: general@hadoop.apache.org Subject: Starting hadoop services at boot Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit I'm sure there is some detail I'm missing, but I've been testing node failures and noticed that the various services (datanode, tasktracker, regionserver in my case) don't automatically restart on boot (I'm running on Ubuntu on EC2). Its currently hiding from me, however I assume there is a configuration setting somewhere that enables this? I noticed that there various start scripts in /etc/init.d: root@ip-10-123-1-151:~/control# ls /etc/init.d/hadoop* /etc/init.d/hadoop-0.20-datanode /etc/init.d/hadoop-0.20-namenode /etc/init.d/hadoop-0.20-tasktracker /etc/init.d/hadoop-hbase-regionserver /etc/init.d/hadoop-0.20-jobtracker /etc/init.d/hadoop-0.20-secondarynamenode /etc/init.d/hadoop-hbase-master /etc/init.d/hadoop-hbase-thrift However these don't actually appear to work in this setup: root@ip-10-123-1-151:~/control# /etc/init.d/hadoop-0.20-namenode stop Stopping Hadoop namenode daemon: no namenode to stop ERROR root@ip-10-123-1-151:~/control# /etc/init.d/hadoop-0.20-namenode start Starting Hadoop namenode daemon: starting namenode, logging to /usr/lib/hadoop-0.20/bin/../logs/hadoop-root-namenode-ip-10-123-1-151.out /usr/lib/hadoop-0.20/bin/hadoop-daemon.sh: line 114: /var/run/hadoop/hadoop-root-namenode.pid: Permission denied nice: cannot set niceness: Permission denied ERROR. I'm currently starting services using "hadoop-daemon.sh start XXX". Thanks - Adam