Return-Path: X-Original-To: apmail-incubator-ambari-user-archive@minotaur.apache.org Delivered-To: apmail-incubator-ambari-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 3B7F7D292 for ; Tue, 5 Mar 2013 07:14:32 +0000 (UTC) Received: (qmail 55978 invoked by uid 500); 5 Mar 2013 07:14:32 -0000 Delivered-To: apmail-incubator-ambari-user-archive@incubator.apache.org Received: (qmail 55853 invoked by uid 500); 5 Mar 2013 07:14:31 -0000 Mailing-List: contact ambari-user-help@incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: ambari-user@incubator.apache.org Delivered-To: mailing list ambari-user@incubator.apache.org Received: (qmail 55818 invoked by uid 99); 5 Mar 2013 07:14:30 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 05 Mar 2013 07:14:30 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of yusaku@hortonworks.com designates 209.85.217.182 as permitted sender) Received: from [209.85.217.182] (HELO mail-lb0-f182.google.com) (209.85.217.182) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 05 Mar 2013 07:14:23 +0000 Received: by mail-lb0-f182.google.com with SMTP id gg6so4484051lbb.41 for ; Mon, 04 Mar 2013 23:14:03 -0800 (PST) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20120113; h=mime-version:x-received:in-reply-to:references:date:message-id :subject:from:to:content-type:x-gm-message-state; bh=G4YvieAp46MWbXicp1cGZxLzYDs1KkMcO+TlhUChl/M=; b=Qp/Ez1XJGZS4gSgG2qetrE5wgvUGloa7j+p1MM7J/vS9ee0JoudV0OLmvs+AbThgpe mnz85RWwRtMaLYruDDsnLrnadRNAHR69ptWlTAhAzwIS6xp+fLn+zb8JqkDU0D4Ba5YL 8lrkEu3RYRxb/nv17iGJWeM/ZX0Q5xttoadx9cgKoPxcO0X0zAo4iZqiPGiqLcfHSnvx g2B+72sOJ9ESo1GsJHNGlRKg1VTJaRrtGSRxzOqLy2Dq4YO1SL5vpW18/r4xjoulFhNg Ffhpx3iRsKeuvcw64slFNGKRmkS35keFHiDa8Cl4IbQ6JvB/6dYCmfw7lr7Y0Kq3aCoO zYeA== MIME-Version: 1.0 X-Received: by 10.152.135.205 with SMTP id pu13mr3866210lab.48.1362467643444; Mon, 04 Mar 2013 23:14:03 -0800 (PST) Received: by 10.112.163.228 with HTTP; Mon, 4 Mar 2013 23:14:03 -0800 (PST) In-Reply-To: <51358E51.7020205@thecyberguardian.com> References: <5135591A.7060601@thecyberguardian.com> <5135648A.907@thecyberguardian.com> <51358E51.7020205@thecyberguardian.com> Date: Mon, 4 Mar 2013 23:14:03 -0800 Message-ID: Subject: Re: Trouble during deploy From: Yusaku Sako To: ambari-user@incubator.apache.org Content-Type: multipart/alternative; boundary=f46d044284f8e7e6a204d728374c X-Gm-Message-State: ALoCoQnTT2W/ptPBfXnnUbOM7jBBvHINf0w+gCrHj6yGKo3RYjQXQgKprrnFNf8zlYnaQssNoue1 X-Virus-Checked: Checked by ClamAV on apache.org --f46d044284f8e7e6a204d728374c Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Hi Dustine, That's a strange place for the install process to get stuck at. Can you try page refresh on your browser? Does it continue making progress= ? If something fails, you would see the progress bar turn red (fatal error) or orange (warning). Yusaku On Mon, Mar 4, 2013 at 10:18 PM, Dustine Rene Bernasor < dustine@thecyberguardian.com> wrote: > Hello, > > I tried stopping the Ambari server, then resetting, then starting it. > Did everything from scratch and this time, after clicking the Deploy > button, > I am redirected to the Install, Start and Test page. Installation proceed= s > but after a certain point, I am stuck. > > Crawler51 9% Installing JobTracker > Crawler52 11% Installing HDFS Client > Crawler53 16% Installing MapReduce Client > > I am getting the following from stdout: > > warning: Could not retrieve fact fqdn > warning: Host is missing hostname and/or domain: crawler51 > warning: Dynamic lookup of $service_state at /var/lib/ambari-agent/puppet= /modules/hdp-hadoop/manifests/init.pp:161 is deprecated. Support will be r= emoved in Puppet 2.8. Use a fully-qualified variable name (e.g., $classnam= e::variable) or parameterized classes. > warning: Dynamic lookup of $service_state at /var/lib/ambari-agent/puppet= /modules/hdp-hadoop/manifests/service.pp:74 is deprecated. Support will be= removed in Puppet 2.8. Use a fully-qualified variable name (e.g., $classn= ame::variable) or parameterized classes. > warning: Dynamic lookup of $service_state at /var/lib/ambari-agent/puppet= /modules/hdp-hadoop/manifests/service.pp:83 is deprecated. Support will be= removed in Puppet 2.8. Use a fully-qualified variable name (e.g., $classn= ame::variable) or parameterized classes. > warning: Dynamic lookup of $ambari_db_server_host is deprecated. Support= will be removed in Puppet 2.8. Use a fully-qualified variable name (e.g.,= $classname::variable) or parameterized classes. > notice: /Stage[1]/Hdp::Snappy::Package/Hdp::Snappy::Package::Ln[32]/Hdp::= Exec[hdp::snappy::package::ln 32]/Exec[hdp::snappy::package::ln 32]/returns= : executed successfully > notice: /Stage[2]/Hdp-hadoop::Initialize/Configgenerator::Configfile[core= -site]/File[/etc/hadoop/conf/core-site.xml]/content: content changed '{md5}= aa21ba6ff20cc6766211e37e4f364395' to '{md5}4a8180bd03474a5be7e13a3530ab641a= ' > notice: /Stage[2]/Hdp-hadoop::Initialize/Configgenerator::Configfile[mapr= ed-site]/File[/etc/hadoop/conf/mapred-site.xml]/content: content changed '{= md5}864fa2060a7271cca6769742fdf00b16' to '{md5}ae167014591c96734bba8a438f80= 5548' > notice: Finished catalog run in 1.55 seconds > > > My nodes do not have an FQDN since I have no other IP I can use for the > domain. > > Thanks. > > Dustine > > > > > On 3/5/2013 11:20 AM, Dustine Rene Bernasor wrote: > > Hello Yusaku, > > When I click the Deploy button,a loader gif appears (sometimes) but I am > stuck in the same screen. > I am not redirected to the Install, Start and Test page. > > I will try to do the "ambari-server stop" first then reset then start and > see if I still get the same problem. > If I still get it, I might have to switch to 1.2.1 as you suggested. > > By the way, I have attached the ambari-server log. > > Thanks. > > Dustine > > On 3/5/2013 11:01 AM, Yusaku Sako wrote: > > Hi Dustine, > > What happens after you click on the Deploy button? It just gets stuck > on the same screen? Or does it go to the "Install, Start and Test" page > with progress bars? > If you can post /var/log/ambari-server/ambari-server.log, it would be > helpful to troubleshoot. > > Also, it sounds like you are using Ambari 1.2.0? > With 1.2.0, you should "ambari-server stop", followed by "ambari-server > reset", then "ambari-server start" if deploy gets stuck. Clear the browse= r > cache and hit http://:8080. > > BTW, Ambari 1.2.1 handles retrying deploy much better than 1.2.0. > If deploy gets stuck for whatever reason, you can hit refresh on the > browser and hit "Deploy" again (no need to do "ambari-server reset", etc)= . > You will not get a message saying you already have a cluster with the sam= e > name, etc. > I highly recommend trying out 1.2.1, rather than 1.2.0 (if you are not > already). In addition to handling retries better, it has 136 fixes over > 1.2.0: > https://issues.apache.org/jira/issues/?jql=3DfixVersion%20%3D%20%221.2.1%= 22%20AND%20project%20%3D%20AMBARI > > Yusaku > > On Mon, Mar 4, 2013 at 6:31 PM, Dustine Rene Bernasor < > dustine@thecyberguardian.com> wrote: > >> Hello, >> >> I am trying to deploy a Hadoop cluster with 3 nodes using Ambari. >> >> This is my set-up: >> >> HDFS >> NameNode: NodeA >> SecondaryNameNode: NodeA >> DataNodes: 2 hosts >> >> MapReduce >> JobTracker: NodeA >> TaskTracker: 2 hosts >> >> Nagios >> Server: NodeA >> >> Ganglia >> Server: NodeA >> >> However, after clicking the deploy button, the process seems to be stuck= . >> >> I got something like this on the server log: >> >> \"component\":\"JOBTRACKER\",\"hostName\":\"Crawler51\",\"serviceId\":\"= MAPREDUCE\",\"isInstalled\":false},{\"display_name\":\"Nagios >> Server\",\"component\":\"NAGIOS_SERVER\",\"hostName\":\"Crawler51\",\"se= rviceId\":\"NAGIOS\",\"isInstalled\":false},{\"display_name\":\"Ganglia >> Collector\",\"component\":\"GANGLIA_SERVER\",\"hostName\":\"Crawler51\",= \"serviceId\":\"GANGLIA\",\"isInstalled\":false}],\"slaveComponentHosts\":[= {\"componentName\":\"DATANODE\",\"displayName\":\"DataNode\",\"hosts\":[{\"= hostName\":\"Crawler52\",\"group\":\"Default\",\"isInstalled\":false},{\"ho= stName\":\"Crawler53\",\"group\":\"Default\",\"isInstalled\":false}]},{\"co= mponentName\":\"TASKTRACKER\",\"displayName\":\"TaskTracker\",\"hosts\":[{\= "hostName\":\"Crawler52\",\"group\":\"Default\",\"isInstalled\":false},{\"h= ostName\":\"Crawler53\",\"group\":\"Default\",\"isInstalled\":false}]},{\"c= omponentName\":\"CLIENT\",\"displayName\":\"client\",\"hosts\":[{\"hostName= \":\"Crawler52\",\"group\":\"Default\",\"isInstalled\":false},{\"hostName\"= :\"Crawler53\",\"group\":\"Default\",\"isInstalled\":false}]}]},\"AddHost\"= :{},\"AddService\":{}}}"} >> >> >> So after waiting for hours and hours, I tried to do it all over again. >> First I did a reset (ambari-server reset) on the Ambari host >> then did everything from scratch. When I reach the Deploy part, this >> time, I get a message that a cluster with the same name already exists. >> >> Here are my questions: >> 1. What to do with the stuck deploy? >> 2. How to remove the cluster that supposedly exist already? When I log i= n >> to Ambari, I am redirected to the install wizard. >> >> >> Thanks. >> >> Dustine >> >> > > > --f46d044284f8e7e6a204d728374c Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Hi Dustine,

That's a strange place for the install process to ge= t stuck at.
Can you try page refresh on your browser?=A0 Does it continu= e making progress?
If something fails, you would see the progress bar tu= rn red (fatal error) or orange (warning).

Yusaku

On Mon, Mar 4, 2013 at 10:18 P= M, Dustine Rene Bernasor <dustine@thecyberguardian.com><= /span> wrote:
=20 =20 =20
Hello,

I tried stopping the Ambari server, then resetting, then starting it.
Did everything from scratch and this time, after clicking the Deploy button,
I am redirected to the Install, Start and Test page. Installation proceeds
but after a certain point, I am stuck.

Crawler51 9% Installing JobTracker
Crawler52 11% Installing HDFS Client
Crawler53 16% Installing MapReduce Client

I am getting the following from stdout:

warning: Could not retrieve fact fqdn
warning: Host is missing hostname and/or domain: crawler51
warning: Dynamic lookup of $service_state at /var/lib/ambari-agent/puppet/m=
odules/hdp-hadoop/manifests/init.pp:161 is deprecated.  Support will be rem=
oved in Puppet 2.8.  Use a fully-qualified variable name (e.g., $classname:=
:variable) or parameterized classes.
warning: Dynamic lookup of $service_state at /var/lib/ambari-agent/puppet/m=
odules/hdp-hadoop/manifests/service.pp:74 is deprecated.  Support will be r=
emoved in Puppet 2.8.  Use a fully-qualified variable name (e.g., $classnam=
e::variable) or parameterized classes.
warning: Dynamic lookup of $service_state at /var/lib/ambari-agent/puppet/m=
odules/hdp-hadoop/manifests/service.pp:83 is deprecated.  Support will be r=
emoved in Puppet 2.8.  Use a fully-qualified variable name (e.g., $classnam=
e::variable) or parameterized classes.
warning: Dynamic lookup of $ambari_db_server_host is deprecated.  Support w=
ill be removed in Puppet 2.8.  Use a fully-qualified variable name (e.g., $=
classname::variable) or parameterized classes.
notice: /Stage[1]/Hdp::Snappy::Package/Hdp::Snappy::Package::Ln[32]/Hdp::Ex=
ec[hdp::snappy::package::ln 32]/Exec[hdp::snappy::package::ln 32]/returns: =
executed successfully
notice: /Stage[2]/Hdp-hadoop::Initialize/Configgenerator::Configfile[core-s=
ite]/File[/etc/hadoop/conf/core-site.xml]/content: content changed '{md=
5}aa21ba6ff20cc6766211e37e4f364395' to '{md5}4a8180bd03474a5be7e13a=
3530ab641a'
notice: /Stage[2]/Hdp-hadoop::Initialize/Configgenerator::Configfile[mapred=
-site]/File[/etc/hadoop/conf/mapred-site.xml]/content: content changed '=
;{md5}864fa2060a7271cca6769742fdf00b16' to '{md5}ae167014591c96734b=
ba8a438f805548'
notice: Finished catalog run in 1.55 seconds

My nodes do not have an FQDN since I have no other IP I can use for the domain.

Thanks.

Dustine




On 3/5/2013 11:20 AM, Dustine Rene Bernasor wrote:
=20
Hello Yusaku,

When I click the Deploy button,a loader gif appears (sometimes) but I am stuck in the same screen.
I am not redirected to the Install, Start and Test page.

I will try to do the "ambari-server stop" first then rese= t then start and see if I still get the same problem.
If I still get it, I might have to switch to 1.2.1 as you suggested.

By the way, I have attached the ambari-server log.

Thanks.

Dustine

On 3/5/2013 11:01 AM, Yusaku Sako wrote:
Hi Dustine,

What happens after you click on the Deploy button? =A0It just gets stuck on the same screen? =A0Or does it go to the "Inst= all, Start and Test" page with progress bars?
If you can post /var/log/ambari-server/ambari-server.log, it would be helpful to troubleshoot.

Also, it sounds like you are using Ambari 1.2.0?
With 1.2.0, you should "ambari-server stop", followe= d by "ambari-server reset", then "ambari-server start&q= uot; if deploy gets stuck. Clear the browser cache and hit http://<amb= ari-server>:8080. =A0

BTW, Ambari 1.2.1 handles retrying deploy much better than 1.2.0.
If deploy gets stuck for whatever reason, you can hit refresh on the browser and hit "Deploy" again (no need = to do "ambari-server reset", etc).
You will not get a message saying you already have a cluster with the same name, etc.
I highly recommend trying out 1.2.1, rather than 1.2.0 (if you are not already). =A0In addition to handling retries better, it has 136 fixes over 1.2.0:=A0https://issues.apache.org/jira/issues/?jql=3Dfi= xVersion%20%3D%20%221.2.1%22%20AND%20project%20%3D%20AMBARI

Yusaku

On Mon, Mar 4, 2013 at 6:31 PM, Dustine Rene Bernasor <dustine@thecyberguardian.com= > wrote:
Hello,

I am trying to deploy a Hadoop cluster with 3 nodes using Ambari.

This is my set-up:

HDFS
=A0 NameNode: NodeA
=A0 SecondaryNameNode: NodeA
=A0 DataNodes: 2 hosts

MapReduce
=A0 JobTracker: NodeA
=A0 TaskTracker: 2 hosts

Nagios
=A0 Server: NodeA

Ganglia
=A0 Server: NodeA

However, after clicking the deploy button, the process seems to be stuck.

I got something like this on the server log:

\"component\":\"JOBTRACKER\",\"hostN= ame\":\"Crawler51\",\"serviceId\":\"MAPREDUCE= \",\"isInstalled\":false},{\"display_name\":\"= ;Nagios Server\",\"component\":\"NAGIOS_SERVER\&q= uot;,\"hostName\":\"Crawler51\",\"serviceId\"= :\"NAGIOS\",\"isInstalled\":false},{\"display_name= \":\"Ganglia Collector\",\"component\":\"GANGLIA_SERVER\",\&quo= t;hostName\":\"Crawler51\",\"serviceId\":\"GA= NGLIA\",\"isInstalled\":false}],\"slaveComponentHosts\&= quot;:[{\"componentName\":\"DATANODE\",\"displayNa= me\":\"DataNode\",\"hosts\":[{\"hostName\&quo= t;:\"Crawler52\",\"group\":\"Default\",\"= ;isInstalled\":false},{\"hostName\":\"Crawler53\",= \"group\":\"Default\",\"isInstalled\":false}]= },{\"componentName\":\"TASKTRACKER\",\"displayName= \":\"TaskTracker\",\"hosts\":[{\"hostName\&qu= ot;:\"Crawler52\",\"group\":\"Default\",\&quo= t;isInstalled\":false},{\"hostName\":\"Crawler53\"= ,\"group\":\"Default\",\"isInstalled\":false}= ]},{\"componentName\":\"CLIENT\",\"displayName\&qu= ot;:\"client\",\"hosts\":[{\"hostName\":\&quo= t;Crawler52\",\"group\":\"Default\",\"isInsta= lled\":false},{\"hostName\":\"Crawler53\",\"g= roup\":\"Default\",\"isInstalled\":false}]}]},\&qu= ot;AddHost\":{},\"AddService\":{}}}"}


So after waiting for hours and hours, I tried to do it all over again. First I did a reset (ambari-server reset) on the Ambari host
then did everything from scratch. When I reach the Deploy part, this time, I get a message that a cluster with the same name already exists.

Here are my questions:
1. What to do with the stuck deploy?
2. How to remove the cluster that supposedly exist already? When I log in to Ambari, I am redirected to the install wizard.


Thanks.

Dustine





--f46d044284f8e7e6a204d728374c--