accumulo-notifications mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Eric Newton (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (ACCUMULO-2351) Master memory leak
Date Tue, 11 Feb 2014 21:39:20 GMT

    [ https://issues.apache.org/jira/browse/ACCUMULO-2351?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13898349#comment-13898349
] 

Eric Newton commented on ACCUMULO-2351:
---------------------------------------

The master is regularly run over clusters with "hundreds and hundreds" of nodes, half a million
tablets and 40+ tables, with as little as Xmx3g.

Can you provide the logs? Especially the specific text of the OoM?


> Master memory leak
> ------------------
>
>                 Key: ACCUMULO-2351
>                 URL: https://issues.apache.org/jira/browse/ACCUMULO-2351
>             Project: Accumulo
>          Issue Type: Bug
>          Components: master
>    Affects Versions: 1.5.0
>            Reporter: John Vines
>
> This has been seen a few times. Master with Xms1g and Xmx4g, which should be more than
enough. Most recent case, with 44 nodes, 1.74k tablets, and 22 tables including !METADATA.
There are NO conspicuous messages in the master (just DefaultLoadBalancer messages for each
table). Possible exception for an error of "received invalid frame size of -..., are you using
TTframeProtocol (can't remember exact message). But then the master out file has a message
about OoM received, kill -9. 
> I don't really know how to get more information out of it for when this does occur again.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

Mime
View raw message