Return-Path: Delivered-To: apmail-hadoop-mapreduce-user-archive@minotaur.apache.org Received: (qmail 34163 invoked from network); 25 Nov 2010 09:05:36 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 25 Nov 2010 09:05:36 -0000 Received: (qmail 56246 invoked by uid 500); 25 Nov 2010 08:38:56 -0000 Delivered-To: apmail-hadoop-mapreduce-user-archive@hadoop.apache.org Received: (qmail 56045 invoked by uid 500); 25 Nov 2010 08:38:54 -0000 Mailing-List: contact mapreduce-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: mapreduce-user@hadoop.apache.org Delivered-To: mailing list mapreduce-user@hadoop.apache.org Received: (qmail 56013 invoked by uid 99); 25 Nov 2010 08:38:53 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 25 Nov 2010 08:38:53 +0000 X-ASF-Spam-Status: No, hits=0.6 required=10.0 tests=FREEMAIL_FROM,HK_RANDOM_ENVFROM,RCVD_IN_DNSWL_NONE,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of zjffdu@gmail.com designates 209.85.161.48 as permitted sender) Received: from [209.85.161.48] (HELO mail-fx0-f48.google.com) (209.85.161.48) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 25 Nov 2010 08:38:46 +0000 Received: by fxm2 with SMTP id 2so567179fxm.35 for ; Thu, 25 Nov 2010 00:38:26 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:received:in-reply-to :references:date:message-id:subject:from:to:content-type :content-transfer-encoding; bh=8bvD9Xzcd8GaPKYehT94cg+JF6XR91Of3IimAjzoc1g=; b=s7H1pdAmjp7B8/mYiSuEExUujoMt4Hdgnv0el08Zyb1KrJ8EpFyV1l3ZNjNtu6PTK4 oq2OYnPEQezju2rFkzWyVU02rjL2hQV0TjCDXmt6tFM2wCI3y+ADG9FCeFZS1s6IOLoB GYtp7/leC5NTWgG5kg5V4N5iTFouZKLM83PGk= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type:content-transfer-encoding; b=aUUxmeBnnUShm/wEb/AjwY1Ik/N56cqp2bYUyUkHlC2Z1hN75v9aAsKWK9jk6Iyv6y DJ5P6czG95nAKdATYh2mnFWJUypwVLfoYRVlLilcmO2ojRKX6vio9lUXp3zZtBuJAb64 Q2ehgVcSnhuJnAxpqrmCOejbR97qYoj4ZIE8Y= MIME-Version: 1.0 Received: by 10.223.114.135 with SMTP id e7mr396533faq.78.1290674306514; Thu, 25 Nov 2010 00:38:26 -0800 (PST) Received: by 10.223.115.209 with HTTP; Thu, 25 Nov 2010 00:38:26 -0800 (PST) In-Reply-To: References: Date: Thu, 25 Nov 2010 16:38:26 +0800 Message-ID: Subject: Re: Starting a Hadoop job programtically From: Jeff Zhang To: mapreduce-user@hadoop.apache.org Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable X-Virus-Checked: Checked by ClamAV on apache.org Please check the status of job tracker since you can not find port 9001 using netstat 2010/11/25 li ping : > Hi: > I am trying to run a job in my own application. > So far, I can run the job on the server which the hadoop server is runnin= g > on. > But what I expect is the hadoop server is running on server A. and the > application will run on other server (B server). > If I run the application on Server B, it will throw some error, see blow: > 10/11/25 15:54:10 INFO ipc.Client: Retrying connect to server: > xi-pli/xxx.xxx.xxx.xxx:9001. Already tried 0 time(s). > 10/11/25 15:54:11 INFO ipc.Client: Retrying connect to > server:=C2=A0xi-pli/xxx.xxx.xxx.xxx:9001. Already tried 1 time(s). > 10/11/25 15:54:12 INFO ipc.Client: Retrying connect to > server:=C2=A0xi-pli/xxx.xxx.xxx.xxx:9001. Already tried 2 time(s). > ........... > Here I replace the real ip address with the xxx.xxx.xxx.xxx. > This is my configuration file: > > =C2=A0=C2=A0 =C2=A0 > =C2=A0=C2=A0 =C2=A0 =C2=A0fs.default.name > =C2=A0=C2=A0 =C2=A0 =C2=A0hdfs://xi-pli:9000 > =C2=A0=C2=A0 =C2=A0 =C2=A0 > =C2=A0=C2=A0 =C2=A0 > > =C2=A0=C2=A0mapred.job.tracker > =C2=A0=C2=A0xi-pli:9001 > =C2=A0=C2=A0 > > > The=C2=A0configuration=C2=A0file on server side is almost the same. > The server A and B can ping each other. and Also can ssh to each other > without the password. > The A,B server has the same username "hadoop" > The iptable also has been shutdown. > I can use the "telnet" command to connect to 9001 port on hadoop server > (Server A). but on Server B, I can't telnet to Server A.(The hadoop serve= r > is running on Server A ) > If I use the netstat -a to check the port. I can't find the 9001 port. > I have no idea why I can't run the job on the other server. If anyone can > give me some suggestion, that's very=C2=A0appreciated. > Thanks > Best Regards > -- > -----=E6=9D=8E=E5=B9=B3 > -- > -----=E6=9D=8E=E5=B9=B3 > --=20 Best Regards Jeff Zhang