Return-Path: X-Original-To: apmail-accumulo-dev-archive@www.apache.org Delivered-To: apmail-accumulo-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 7288EDF70 for ; Tue, 11 Sep 2012 21:43:36 +0000 (UTC) Received: (qmail 87828 invoked by uid 500); 11 Sep 2012 21:43:36 -0000 Delivered-To: apmail-accumulo-dev-archive@accumulo.apache.org Received: (qmail 87794 invoked by uid 500); 11 Sep 2012 21:43:36 -0000 Mailing-List: contact dev-help@accumulo.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@accumulo.apache.org Delivered-To: mailing list dev@accumulo.apache.org Received: (qmail 87784 invoked by uid 99); 11 Sep 2012 21:43:36 -0000 Received: from minotaur.apache.org (HELO minotaur.apache.org) (140.211.11.9) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 11 Sep 2012 21:43:36 +0000 Received: from localhost (HELO mail-pz0-f41.google.com) (127.0.0.1) (smtp-auth username afuchs, mechanism plain) by minotaur.apache.org (qpsmtpd/0.29) with ESMTP; Tue, 11 Sep 2012 21:43:36 +0000 Received: by dadi14 with SMTP id i14so644295dad.0 for ; Tue, 11 Sep 2012 14:43:35 -0700 (PDT) MIME-Version: 1.0 Received: by 10.68.237.38 with SMTP id uz6mr13163148pbc.23.1347399815644; Tue, 11 Sep 2012 14:43:35 -0700 (PDT) Received: by 10.68.196.165 with HTTP; Tue, 11 Sep 2012 14:43:35 -0700 (PDT) In-Reply-To: References: Date: Tue, 11 Sep 2012 17:43:35 -0400 Message-ID: Subject: Re: Intermittent error connecting to Accumulo Instance From: Adam Fuchs To: dev@accumulo.apache.org Content-Type: multipart/alternative; boundary=047d7b33d6ce61c7d004c973f79e --047d7b33d6ce61c7d004c973f79e Content-Type: text/plain; charset=ISO-8859-1 Duane, The INVALID_INSTANCEID message implies that your client is trying to contact a server that thinks the instance id is something else. The 1 out of 5 likelihood probably indicates that one out of the five server processes (one master and four tablet servers) has some misconfiguration. When you do a "deleteTable" operation, the client code will get the list of active servers and pick one at random. This would seem to imply that zookeeper is configured correctly across your cluster. Tablet servers read their instance name and ID out of the accumulo directory in HDFS. I wonder if this could be caused by something like HDFS not being configured the same on one of the tablet servers, so when it launches it doesn't get the right instance ID? (I'm thinking out loud here) Do you see any warning or error messages propagating to your monitor page? Adam On Tue, Sep 11, 2012 at 5:25 PM, Cornish, Duane C. wrote: > I stood up a 4 node Accumulo Cloud. I wrote a Java interface to it and > when I attempt to connect to the table through my interface, I'm getting an > intermittent error that only pops up maybe 1 out of every 5 runs of the > code. I have zookeeper running on all 4 nodes but in my accumulo config > file, I only mentioned the one on the Accumulo name node. I don't know if > that would be a problem or not. Also, the zookeeper stuff is all owned by > root, while hadoop and Accumulo are owned by a different non-root user. I > built this cluster from the ground up (i.e. didn't use Cloudera, etc.), so > it's possible that I missed something. > > I'm running Accumulo 1.4.1 with hadoop 1.0.3 and zookeeper 3.3.6. > > As you can see from the error below, it is saying that I have an > INVALID_INSTANCEID. However my instance (and credentials) are static final > Strings and are not changing. I haven't had any trouble connecting to the > instance via the shell. Also, the monitor webpage works just fine and > shows no errors. > > Here is the error below: > > Exception in thread "main" > org.apache.accumulo.core.client.AccumuloSecurityException: Error > INVALID_INSTANCEID - Unknown security exception > at > org.apache.accumulo.core.client.impl.ServerClient.execute(ServerClient.java:73) > at > org.apache.accumulo.core.client.impl.ConnectorImpl.(ConnectorImpl.java:75) > at > org.apache.accumulo.core.client.ZooKeeperInstance.getConnector(ZooKeeperInstance.java:218) > at > jhuapl.redd.coremax.shared.accumuloInterface.AccumuloConnector.deleteTable(AccumuloConnector.java:583) > at > jhuapl.redd.coremax.shared.accumuloInterface.TestAccumuloInterface.main(TestAccumuloInterface.java:27) > Caused by: ThriftSecurityException(user:root, code:INVALID_INSTANCEID) > at > org.apache.accumulo.core.client.impl.thrift.ClientService$authenticateUser_result.read(ClientService.java:7755) > at > org.apache.accumulo.core.client.impl.thrift.ClientService$Client.recv_authenticateUser(ClientService.java:415) > at > org.apache.accumulo.core.client.impl.thrift.ClientService$Client.authenticateUser(ClientService.java:387) > at sun.reflect.NativeMethodAccessorImpl.invoke0(Native > Method) > at > sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57) > at > sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) > at java.lang.reflect.Method.invoke(Method.java:616) > at > org.apache.accumulo.cloudtrace.instrument.thrift.TraceWrap$2.invoke(TraceWrap.java:84) > at $Proxy0.authenticateUser(Unknown Source) > at > org.apache.accumulo.core.client.impl.ConnectorImpl$1.execute(ConnectorImpl.java:78) > at > org.apache.accumulo.core.client.impl.ConnectorImpl$1.execute(ConnectorImpl.java:75) > at > org.apache.accumulo.core.client.impl.ServerClient.executeRaw(ServerClient.java:108) > at > org.apache.accumulo.core.client.impl.ServerClient.execute(ServerClient.java:71) > ... 4 more > > Any help would be greatly appreciated! > Thanks, > Duane > > --047d7b33d6ce61c7d004c973f79e--