accumulo-notifications mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Eric Newton (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (ACCUMULO-2645) tablet stuck unloading
Date Mon, 07 Apr 2014 19:55:14 GMT

    [ https://issues.apache.org/jira/browse/ACCUMULO-2645?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13962188#comment-13962188
] 

Eric Newton commented on ACCUMULO-2645:
---------------------------------------

Possible ways of detecting this problem in the future:

* UnloadTabletHandler could issue a warning if a tablet does not unload
* master could generate warnings about unload requests that are old
* the monitor could display the number of unload requests outstanding in the tserver



> tablet stuck unloading
> ----------------------
>
>                 Key: ACCUMULO-2645
>                 URL: https://issues.apache.org/jira/browse/ACCUMULO-2645
>             Project: Accumulo
>          Issue Type: Bug
>          Components: tserver
>    Affects Versions: 1.4.4
>         Environment: very large production cluster, CDH3u5
>            Reporter: Eric Newton
>            Assignee: Eric Newton
>
>  * master failed to balance
>  * custom balancer refused to balance while migrations were in place
>  * tablet server was not unloading the tablet
>  * tablet server was otherwise serving tablets, providing status
>  * memory dump determined that there were 21K UnloadTabletHandler objects
>  * jstack showed UnloadTabletHandler in Tablet.completeClose, line 2674
>  * the last print of the debug "completeClose(safeState=true, completeClose=true) occured
9 days ago
>  * there was a query that had been running for 9 days



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Mime
View raw message