incubator-cassandra-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Brian Tarbox <tar...@cabotresearch.com>
Subject Re: nodetool repair saying "starting" and then nothing, and nothing in any of the server logs either
Date Tue, 01 Jul 2014 18:20:46 GMT
Does this output from jstack indicate a problem?

"ReadRepairStage:12170" daemon prio=10 tid=0x00007f9dcc018800 nid=0x7361
waiting on condition [0x00007f9db540c000]
   java.lang.Thread.State: TIMED_WAITING (parking)
        at sun.misc.Unsafe.park(Native Method)
        - parking to wait for  <0x0000000613e049d8> (a
java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject)
        at
java.util.concurrent.locks.LockSupport.parkNanos(LockSupport.java:226)
        at
java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject.awaitNanos(AbstractQueuedSynchronizer.java:2082)
        at
java.util.concurrent.LinkedBlockingQueue.poll(LinkedBlockingQueue.java:467)
        at
java.util.concurrent.ThreadPoolExecutor.getTask(ThreadPoolExecutor.java:1068)
        at
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1130)
        at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
        at java.lang.Thread.run(Thread.java:744)

"ReadRepairStage:12169" daemon prio=10 tid=0x00007f9dd4009000 nid=0x7340
waiting on condition [0x00007f9db53cb000]
   java.lang.Thread.State: TIMED_WAITING (parking)
        at sun.misc.Unsafe.park(Native Method)
        - parking to wait for  <0x0000000613e049d8> (a
java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject)
        at
java.util.concurrent.locks.LockSupport.parkNanos(LockSupport.java:226)
        at
java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject.awaitNanos(AbstractQueuedSynchronizer.java:2082)
        at
java.util.concurrent.LinkedBlockingQueue.poll(LinkedBlockingQueue.java:467)
        at
java.util.concurrent.ThreadPoolExecutor.getTask(ThreadPoolExecutor.java:1068)
        at
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1130)
        at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
        at java.lang.Thread.run(Thread.java:744)

"ReadRepairStage:12168" daemon prio=10 tid=0x00007f9dd001d000 nid=0x733f
waiting on condition [0x00007f9db51a6000]
   java.lang.Thread.State: TIMED_WAITING (parking)
        at sun.misc.Unsafe.park(Native Method)
        - parking to wait for  <0x0000000613e049d8> (a
java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject)
        at
java.util.concurrent.locks.LockSupport.parkNanos(LockSupport.java:226)
        at
java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject.awaitNanos(AbstractQueuedSynchronizer.java:2082)
        at
java.util.concurrent.LinkedBlockingQueue.poll(LinkedBlockingQueue.java:467)
        at
java.util.concurrent.ThreadPoolExecutor.getTask(ThreadPoolExecutor.java:1068)
        at
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1130)
        at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
        at java.lang.Thread.run(Thread.java:744)




On Tue, Jul 1, 2014 at 2:09 PM, Brian Tarbox <tarbox@cabotresearch.com>
wrote:

> We're running 1.2.13.
>
> Any chance that doing a rolling-restart would help?
>
> Would running without the "-pr" improve the odds?
>
> Thanks.
>
>
> On Tue, Jul 1, 2014 at 1:40 PM, Robert Coli <rcoli@eventbrite.com> wrote:
>
>> On Tue, Jul 1, 2014 at 9:24 AM, Brian Tarbox <tarbox@cabotresearch.com>
>> wrote:
>>
>>> I have a six node cluster in AWS (repl:3) and recently noticed that
>>> repair was hanging.  I've run with the "-pr" switch.
>>>
>>
>> It'll do that.
>>
>> What version of Cassandra?
>>
>> =Rob
>>
>>
>
>

Mime
View raw message