accumulo-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Eric Newton <eric.new...@gmail.com>
Subject Re: table data missing
Date Thu, 19 Jul 2012 16:43:32 GMT
You should have as many loggers as you have tablet servers.

Your log recovery is failing because the loggers are not running.

Please start all your loggers, and/or determine while they are going
down.  Then restart the master and the system should recover.

-Eric

On Thu, Jul 19, 2012 at 12:39 PM, Perko, Ralph J <Ralph.Perko@pnnl.gov> wrote:
> From the master log file at startup:
>
> 9 08:38:40,612 [master.CoordinateRecoveryTask] WARN : Unable to recover
> 192.168.1.244:11224/65911601-d684-43e8-94b3-cdf959590298(java.io.IOExceptio
> n: org.apache.thrift.transport.TTransportException:
> java.net.ConnectException: Connection refused)
> java.io.IOException: org.apache.thrift.transport.TTransportException:
> java.net.ConnectException: Connection refused
>         at
> org.apache.accumulo.server.tabletserver.log.RemoteLogger.<init>(RemoteLogge
> r.java:99)
>         at
> org.apache.accumulo.server.master.CoordinateRecoveryTask$RecoveryJob.startC
> opy(CoordinateRecoveryTask.java:132)
>         at
> org.apache.accumulo.server.master.CoordinateRecoveryTask$RecoveryJob.access
> $400(CoordinateRecoveryTask.java:114)
>         at
> org.apache.accumulo.server.master.CoordinateRecoveryTask.recover(Coordinate
> RecoveryTask.java:289)
>         at
> org.apache.accumulo.server.master.Master$TabletGroupWatcher.run(Master.java
> :1351)
> Caused by: org.apache.thrift.transport.TTransportException:
> java.net.ConnectException: Connection refused
>         at
> org.apache.accumulo.core.client.impl.ThriftTransportPool.createNewTransport
> (ThriftTransportPool.java:475)
>         at
> org.apache.accumulo.core.client.impl.ThriftTransportPool.getTransport(Thrif
> tTransportPool.java:464)
>         at
> org.apache.accumulo.core.client.impl.ThriftTransportPool.getTransport(Thrif
> tTransportPool.java:441)
>         at org.apache.accumulo.core.util.ThriftUtil.getClient(ThriftUtil.java:67)
>         at
> org.apache.accumulo.server.tabletserver.log.RemoteLogger.<init>(RemoteLogge
> r.java:96)
>         ... 4 more
> Caused by: java.net.ConnectException: Connection refused
>         at sun.nio.ch.Net.connect(Native Method)
>         at sun.nio.ch.SocketChannelImpl.connect(SocketChannelImpl.java:500)
>         at sun.nio.ch.SocketAdaptor.connect(SocketAdaptor.java:81)
>         at sun.nio.ch.SocketAdaptor.connect(SocketAdaptor.java:65)
>         at
> org.apache.accumulo.core.util.TTimeoutTransport.create(TTimeoutTransport.ja
> va:39)
>         at
> org.apache.accumulo.core.client.impl.ThriftTransportPool.createNewTransport
> (ThriftTransportPool.java:473)
>         ... 8 more
> 19 08:38:40,652 [master.CoordinateRecoveryTask] WARN : Recovery of
> 192.168.1.244:11224:65911601-d684-43e8-94b3-cdf959590298 failed
> 19 08:38:45,071 [master.CoordinateRecoveryTask] INFO : Deleting recovery
> directory org.apache.hadoop.fs.FileStatus@75641fd
> 19 09:08:40,848 [master.CoordinateRecoveryTask] WARN : Recovery taking too
> long, giving up
> 19 09:08:40,848 [master.EventCoordinator] INFO : Log recovery
> 192.168.1.244:11224/65911601-d684-43e8-94b3-cdf959590298 complete
>
>
>
>
> On 7/19/12 9:34 AM, "Keith Turner" <keith@deenlo.com> wrote:
>
>>What you are describing sounds like ZooKeeper is up and running (this
>>is where table config info is stored, so thats why you can list
>>tables), but not tablets are assigned to tablet servers.  Need to
>>determine why no tablets are assigned.  Look in the master log for
>>anything suspicious related to tablet assignment.
>>
>>
>>On Thu, Jul 19, 2012 at 12:28 PM, Perko, Ralph J <Ralph.Perko@pnnl.gov>
>>wrote:
>>> Hi,
>>>
>>> I restarted my cluster and now the Accumulo Overview page says there
>>>are 0 tables.  However, when I go to the Table List page, all my tables
>>>are listed with a status of "ONLINE" but nothing else.  From the
>>>Accumulo shell I cannot access any of my tables but I can list them,
>>>like the web site.  Hadoop is up and healthy.  The tablet servers are up
>>>but each states 0 for Hosted Tablets.  Do you know what is causing this
>>>and how to fix it?
>>>
>>> Thanks,
>>> Ralph
>>>
>>>
>

Mime
View raw message