Return-Path: X-Original-To: apmail-incubator-accumulo-user-archive@minotaur.apache.org Delivered-To: apmail-incubator-accumulo-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 2EB009544 for ; Thu, 15 Mar 2012 21:32:02 +0000 (UTC) Received: (qmail 96605 invoked by uid 500); 15 Mar 2012 21:32:02 -0000 Delivered-To: apmail-incubator-accumulo-user-archive@incubator.apache.org Received: (qmail 96583 invoked by uid 500); 15 Mar 2012 21:32:02 -0000 Mailing-List: contact accumulo-user-help@incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: accumulo-user@incubator.apache.org Delivered-To: mailing list accumulo-user@incubator.apache.org Received: (qmail 96573 invoked by uid 99); 15 Mar 2012 21:32:02 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 15 Mar 2012 21:32:02 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: local policy) Received: from [206.112.75.238] (HELO iron-u-a-out.osis.gov) (206.112.75.238) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 15 Mar 2012 21:31:55 +0000 X-IronPort-Anti-Spam-Filtered: true X-IronPort-Anti-Spam-Result: AgUFAC1fYk+sEAbx/2dsb2JhbABDtiuBD4IJAQEBAwESAmoLCwsNLiITBR0ZGweHY58cCpVekQcElWGQJ4JmgVw X-IronPort-AV: E=Sophos;i="4.73,593,1325480400"; d="scan'208";a="8745323" Received: from ghost-a.center.osis.gov (HELO mail-vx0-f175.google.com) ([172.16.6.241]) by iron-u-a-in.osis.gov with ESMTP/TLS/RC4-SHA; 15 Mar 2012 17:30:00 -0400 Received: by vcbfl13 with SMTP id fl13so4106623vcb.6 for ; Thu, 15 Mar 2012 14:31:33 -0700 (PDT) Received: by 10.52.88.235 with SMTP id bj11mr88336vdb.119.1331847093280; Thu, 15 Mar 2012 14:31:33 -0700 (PDT) MIME-Version: 1.0 Received: by 10.220.148.199 with HTTP; Thu, 15 Mar 2012 14:31:13 -0700 (PDT) In-Reply-To: <1701454682.215004.1331836655508.JavaMail.root@linzimmb04o.imo.intelink.gov> References: <7DF22D8C-7014-4100-9263-CB02CE9985E3@cray.com> <1701454682.215004.1331836655508.JavaMail.root@linzimmb04o.imo.intelink.gov> From: John Vines Date: Thu, 15 Mar 2012 21:31:13 +0000 Message-ID: Subject: Re: start-all.sh should do some basic verification (was Accumulo fails to start) To: accumulo-user@incubator.apache.org Content-Type: multipart/alternative; boundary=20cf3071d0cee3df8504bb4ed075 X-Virus-Checked: Checked by ClamAV on apache.org --20cf3071d0cee3df8504bb4ed075 Content-Type: text/plain; charset=ISO-8859-1 In the debian scripts, I attempt to find JAVA_HOME by tracing /usr/bin/java with readlink, but this doesn't exist on OSX and readlink behaves differently on RHEL. So then I changed it to just iterating over a list of known java directories, which it checks for until it works. I would love to have a more standard solution for that as well, if anyone can think of one. On Thu, Mar 15, 2012 at 6:37 PM, Robert Vesse wrote: > Hi All > > This is an RFE based on my troubles getting started with Accumulo which > I feel would benefit other users. > > As seen in my original thread it is possible to end up with a > configuration where start-all.sh will appear to work fine but Accumulo will > actually not have started and no logs will have been generated. This is > very unhelpful to the user since they have no visibility that something has > gone wrong > > Would it be possible for the start-all script to do some basic > verification to check that servers actually started up ok and if not print > an error "Accumulo did not start correctly, please check your configuration" > > Rob > > On Mar 14, 2012, at 5:39 PM, Eric Newton wrote: > > Well, JAVA_HOME expects a certain file layout... for example, on my mac > laptop, /Library/Java/Home works, but on Linux it's completely different. > > It is especially sensitive when building the JNI libs for the native map. > > Since it is hard to debug different configurations on different > platforms, it would be great if you could document what works for you. > > -Eric > > On Wed, Mar 14, 2012 at 4:05 PM, Robert Vesse wrote: > >> I have tried to set it in that script but the scripts don't seem to >> accept it - this is further confused by the fact that a couple of days ago >> everything worked without any issues and I haven't changed my config at all >> >> A which java on my command line will give /usr/bin/java as the path but >> setting JAVA_HOME to this causes scripts to fail completely because they >> blindly append /bin/java to the path, yet bizarrely setting it to /usr >> gives the warning seen in my email >> >> I will experiment and see whether this is the cause of the issue but >> should there not be a way to actually show a useful error message to the >> user in this case? >> >> Rob >> >> On Mar 14, 2012, at 12:56 PM, Eric Newton wrote: >> >> Are you setting JAVA_HOME in conf/accumulo-env.sh ? >> >> If you are setting configuration in your environment, it may not >> survive an ssh to localhost. >> >> -Eric >> >> On Wed, Mar 14, 2012 at 3:51 PM, Robert Vesse wrote: >> >>> I'm having problems with Accumulo not starting correctly - yet from the >>> command line it looks to start fine i.e. when I run bin/start-all.sh I see >>> the following output: >>> >>> Starting tablet servers and loggers .... done >>> Unable to find a $JAVA_HOME at "/usr", continuing with system-provided >>> Java... >>> Unable to find a $JAVA_HOME at "/usr", continuing with system-provided >>> Java... >>> Starting logger on localhost >>> Starting tablet server on localhost >>> Starting master on localhost >>> Starting garbage collector on localhost >>> Starting monitor on localhost >>> Starting tracer on localhost >>> >>> Yet if I try to start an accumulo shell with bin/accumulo shell -u root >>> I get the following error: >>> >>> 14 12:41:27,472 [impl.ServerClient] WARN : Failed to find an available >>> server in the list of servers: [] >>> 14 12:41:27,473 [shell.Shell] ERROR: >>> org.apache.accumulo.core.client.AccumuloException: >>> org.apache.thrift.transport.TTransportException: Failed to connect to a >>> server >>> >>> I looked under the logs/ directory and see that a bunch of logs are >>> created but they are all zero bytes in size so no helpful hints there :-( >>> >>> I have followed the README to configure everything appropriately for a >>> single node setup (I have Hadoop and Zookeeper running fine) and I actually >>> had Accumulo working correctly a couple of days ago yet now I am unable to >>> start things up again. I have tried reformatting my HDFS (as this is just >>> a dev setup) and starting from scratch but I consistently hit this issue >>> that Accumulo just does not start properly. >>> >>> Any help or guidance on this would be much appreciated >>> >>> Rob >> >> >> >> > > --20cf3071d0cee3df8504bb4ed075 Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable In the debian scripts, I attempt to find JAVA_HOME by tracing /usr/bin/java= with readlink, but this doesn't exist on OSX and readlink behaves diff= erently on RHEL. So then I changed it to just iterating over a list of know= n java directories, which it checks for until it works. I would love to hav= e a more standard solution for that as well, if anyone can think of one.
On Thu, Mar 15, 2012 at 6:37 PM, Robert Vess= e <rvesse@yarcd= ata.com> wrote:
Hi All

This is an RFE based on my troubles getting started with Accumulo whic= h I feel would benefit other users.

As seen in my original thread it is possible to end up with a configur= ation where start-all.sh will appear to work fine but Accumulo will actuall= y not have started and no logs will have been generated. =A0This is very un= helpful to the user since they have no visibility that something has gone wrong

Would it be possible for the start-all script to do some basic verific= ation to check that servers actually started up ok and if not print an erro= r "Accumulo did not start correctly, please check your configuration&q= uot;

Rob

On Mar 14, 2012, at 5:39 PM, Eric Newton wrote:

Well, JAVA_HOME expects a certain file layout... = for example, on my mac laptop, /Library/Java/Home works, but on Linux it= 9;s completely different.

It is especially sensitive when building the JNI libs for the native m= ap.

Since it is hard to debug different configurations on different platfo= rms, it would be great if you could document what works for you.

-Eric

On Wed, Mar 14, 2012 at 4:05 PM, Robert Vesse <rvesse@yarcdat= a.com> wrote:
I have tried to set it in that script b= ut the scripts don't seem to accept it - this is further confused by th= e fact that a couple of days ago everything worked without any issues and I= haven't changed my config at all

A which java on my command line will give /usr/bin/java as the path bu= t setting JAVA_HOME to this causes scripts to fail completely because they = blindly append /bin/java to the path, yet bizarrely setting it to /usr give= s the warning seen in my email

I will experiment and see whether this is the cause of the issue but s= hould there not be a way to actually show a useful error message to the use= r in this case?

Rob

On Mar 14, 2012, at 12:56 PM, Eric Newton wrote:

Are you setting JAVA_HOME in conf/accumulo-env.sh= ?

If you are setting configuration in your environment, it may not survi= ve an ssh to localhost.

-Eric

On Wed, Mar 14, 2012 at 3:51 PM, Robert Vesse <rvesse@yarcdat= a.com> wrote:
I'm having problems with Accumulo not starting correctly - yet from the= command line it looks to start fine i.e. when I run bin/start-all.sh I see= the following output:

Starting tablet servers and loggers .... done
Unable to find a $JAVA_HOME at "/usr", continuing with system-pro= vided Java...
Unable to find a $JAVA_HOME at "/usr", continuing with system-pro= vided Java...
Starting logger on localhost
Starting tablet server on localhost
Starting master on localhost
Starting garbage collector on localhost
Starting monitor on localhost
Starting tracer on localhost

Yet if I try to start an accumulo shell with bin/accumulo shell -u root I g= et the following error:

14 12:41:27,472 [impl.ServerClient] WARN : Failed to find an available serv= er in the list of servers: []
14 12:41:27,473 [shell.Shell] ERROR: org.apache.accumulo.core.client.Accumu= loException: org.apache.thrift.transport.TTransportException: Failed to con= nect to a server

I looked under the logs/ directory and see that a bunch of logs are created= but they are all zero bytes in size so no helpful hints there :-(

I have followed the README to configure everything appropriately for a sing= le node setup (I have Hadoop and Zookeeper running fine) and I actually had= Accumulo working correctly a couple of days ago yet now I am unable to sta= rt things up again. =A0I have tried reformatting my HDFS (as this is just a dev setup) and starting from scrat= ch but I consistently hit this issue that Accumulo just does not start prop= erly.

Any help or guidance on this would be much appreciated

Rob





--20cf3071d0cee3df8504bb4ed075--