accumulo-notifications mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Keith Turner (JIRA)" <>
Subject [jira] [Commented] (ACCUMULO-1345) Provide feedback that a compaction is "stuck"
Date Fri, 06 Sep 2013 22:50:51 GMT


Keith Turner commented on ACCUMULO-1345:

Here is an example of what this the commit I just made does.  Below shows configuring accumulo
to warn if a compaction does not make progress in 30 seconds AND then setting the slow iterator
to sleep for 60 seconds.

root@test16> config -s tserver.compaction.warn.time=30s
root@test16> table foo
root@test16 foo> config -t foo -s table.iterator.minc.slow=100,org.apache.accumulo.test.functional.SlowIterator
root@test16 foo> config -t foo -s table.iterator.minc.slow.opt.sleepTime=60000
root@test16 foo> insert r1 cf1 cq1 v1
root@test16 foo> flush -t foo 
2013-09-06 18:43:38,299 [shell.Shell] INFO : Flush of table foo initiated...

Eventually, the following shows up in the tserver logs.   

2013-09-06 18:44:27,044 [tabletserver.CompactionWatcher] WARN : Compaction of 2<< has
not made progress for at least 39999ms
java.lang.Exception: Possible stack trace of compaction stuck on 2<<
        at java.lang.Thread.sleep(Native Method)
        at org.apache.accumulo.core.util.UtilWaitThread.sleep(
        at org.apache.accumulo.server.tabletserver.Compactor.compactLocalityGroup(
        at org.apache.accumulo.server.tabletserver.Tablet.minorCompact(
        at org.apache.accumulo.server.tabletserver.Tablet.access$4300(
        at org.apache.accumulo.server.tabletserver.Tablet$
        at java.util.concurrent.ThreadPoolExecutor$Worker.runTask(
        at java.util.concurrent.ThreadPoolExecutor$
2013-09-06 18:44:38,384 [tabletserver.Compactor] DEBUG: Compaction 2<< 1 read | 1 written
|      0 entries/sec | 60.006 secs
2013-09-06 18:44:47,043 [tabletserver.CompactionWatcher] INFO : Compaction of 2<< is
no longer stuck

> Provide feedback that a compaction is "stuck"
> ---------------------------------------------
>                 Key: ACCUMULO-1345
>                 URL:
>             Project: Accumulo
>          Issue Type: Bug
>          Components: tserver
>            Reporter: Mike Drob
>            Assignee: Keith Turner
>             Fix For: 1.6.0
> The system should be able to detect when a compaction has not read or written data in
a while, indicating that it may be stuck on something (e.g. an infinite loop in a user iterator).

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see:

View raw message