Return-Path: X-Original-To: apmail-hadoop-common-user-archive@www.apache.org Delivered-To: apmail-hadoop-common-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id AC730EBA6 for ; Thu, 28 Feb 2013 15:41:24 +0000 (UTC) Received: (qmail 95945 invoked by uid 500); 28 Feb 2013 15:41:19 -0000 Delivered-To: apmail-hadoop-common-user-archive@hadoop.apache.org Received: (qmail 95800 invoked by uid 500); 28 Feb 2013 15:41:19 -0000 Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hadoop.apache.org Delivered-To: mailing list user@hadoop.apache.org Received: (qmail 95793 invoked by uid 99); 28 Feb 2013 15:41:19 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 28 Feb 2013 15:41:19 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,NORMAL_HTTP_TO_IP,RCVD_IN_DNSWL_LOW,SPF_PASS,WEIRD_PORT X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of samir.helpdoc@gmail.com designates 209.85.212.54 as permitted sender) Received: from [209.85.212.54] (HELO mail-vb0-f54.google.com) (209.85.212.54) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 28 Feb 2013 15:41:13 +0000 Received: by mail-vb0-f54.google.com with SMTP id l1so202079vba.13 for ; Thu, 28 Feb 2013 07:40:52 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:x-received:in-reply-to:references:date:message-id :subject:from:to:content-type; bh=+FWI0hs7IgN9+PxeRh9pU0DVkxnCT4N0B8VFdXG1qxo=; b=W3nkQpaAOvtcoj2r2YUDxe0MH0iwHUYjxFXpCttQwLa3HHUuEyeob2AIk0CNDg0bsD mAvHl1Gi/e/iy5JA0TsqLB5Zp+RJhUp9MxJ8jwZOv7SsEOtq5BX6G5oScSwbtCEhmqw2 2qTfxU9KlDs7rNna8BxhSMKHYAzjyGmRCEZsfdAQFEaOYx+0c1h5GsNWTIb6+n2+TROp Ap1H6duuHIQ/n9e2Gyi4TzAGBjSCzx8k+Vd+MIWWfLyACvI4QY2iw+d9tkHrcnwA4rcV US34ZnMuf3vMxgBGPh3x+zdFv8IxS1gz/yHAmvxydoVjGPXuZO7RqtMGphchtIZQfoLg 1P+A== MIME-Version: 1.0 X-Received: by 10.220.223.202 with SMTP id il10mr2725219vcb.4.1362066052431; Thu, 28 Feb 2013 07:40:52 -0800 (PST) Received: by 10.58.161.82 with HTTP; Thu, 28 Feb 2013 07:40:52 -0800 (PST) In-Reply-To: References: Date: Thu, 28 Feb 2013 21:10:52 +0530 Message-ID: Subject: Re: Issue in Datanode (using CDH4.1.2) From: samir das mohapatra To: user@hadoop.apache.org, cdh-user@cloudera.org Content-Type: multipart/alternative; boundary=14dae9cdc487374ae204d6cab7a9 X-Virus-Checked: Checked by ClamAV on apache.org --14dae9cdc487374ae204d6cab7a9 Content-Type: text/plain; charset=ISO-8859-1 few more things Same setup was working in Ubuntu machine(Dev cluster), only failing under CentOS 6.3(prod Cluster) On Thu, Feb 28, 2013 at 9:06 PM, samir das mohapatra < samir.helpdoc@gmail.com> wrote: > Hi All, > I am facing on strange issue, That is In a cluster having 1k machine i > could able to start and stop > NN,DN,JT,TT,SSN. But the problem is under Name node Web-URL > it is showing only one datanode . I tried to connect node through ssh > also it was working file and i have assigned NNURL: port in core-site > http://namenode:50070 > > Again I have checked with datanode logs, and I got the message like this: > > 2013-02-28 06:59:01,652 WARN > org.apache.hadoop.hdfs.server.datanode.DataNode: Problem connecting to > server: hadoophost1/192.168.1.1:54310 > 2013-02-28 06:59:07,660 INFO org.apache.hadoop.ipc.Client: Retrying > connect to server: hadoophost1/192.168.1.1:54310. Already tried 0 > time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=10, > sleepTime=1 SECONDS) > > Regards, > samir. > > --14dae9cdc487374ae204d6cab7a9 Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable
few more things

Same setup was working i= n Ubuntu machine(Dev cluster), only failing under CentOS 6.3(prod Cluster)<= br>


= On Thu, Feb 28, 2013 at 9:06 PM, samir das mohapatra <= samir.helpdoc@= gmail.com> wrote:
Hi= All,
=A0 I am facing on strange issue, That is In a cluster havin= g 1k machine=A0 i could able to start and stop
NN,DN,JT,TT,SSN. But the problem is=A0 under Name node=A0 Web-URL
it is showing only one=A0 datanode . I tried to connect node through = ssh also it was working file and i have=A0 assigned NNURL: port=A0 in core-= site

Again I have checked with datanode logs,=A0 and I got = the message like this:

2013-02-28= 06:59:01,652 WARN org.apache.hadoop.hdfs.server.datanode.DataNode: Problem= connecting to server: hadoophost1/192.168.1.1:54310
2013-02-28 06:59:07,660 INFO org.apache.hadoop.ipc.Client: Retrying connect= to server: hadoophost1/192.168.1.1:54310. Already tried 0 time(s); retry policy is RetryUp= ToMaximumCountWithFixedSleep(maxRetries=3D10, sleepTime=3D1 SECONDS)

Regards,
samir.
<= /span>


--14dae9cdc487374ae204d6cab7a9--