hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "stack (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (HBASE-14420) Zombie Stomping Session
Date Fri, 02 Oct 2015 19:23:27 GMT

     [ https://issues.apache.org/jira/browse/HBASE-14420?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

stack updated HBASE-14420:
--------------------------
    Attachment: hangers.txt

Little report on last 20 patch runs created by doing:

{code}
 for i in `seq 15839 15859`; do python ./dev-support/findHangingTests.py https://builds.apache.org/view/H-L/view/HBase/job/PreCommit-HBASE-Build/$i/consoleText
>> /tmp/report.txt ; done
{code}

4 of 20 passed
1 failed because patch did not apply
1 was a 0.98 build that failed a DLR test
4 had hanging tests/zombies.
Others were test failures... Flakies

Hanger incidence is falling but not cured yet.

> Zombie Stomping Session
> -----------------------
>
>                 Key: HBASE-14420
>                 URL: https://issues.apache.org/jira/browse/HBASE-14420
>             Project: HBase
>          Issue Type: Umbrella
>          Components: test
>            Reporter: stack
>            Assignee: stack
>            Priority: Critical
>         Attachments: hangers.txt
>
>
> Patch build are now failing most of the time because we are dropping zombies. I confirm
we are doing this on non-apache build boxes too.
> Left-over zombies consume resources on build boxes (OOME cannot create native threads).
Having to do multiple test runs in the hope that we can get a non-zombie-making build or making
(arbitrary) rulings that the zombies are 'not related' is a productivity sink. And so on...
> This is an umbrella issue for a zombie stomping session that started earlier this week.
Will hang sub-issues of this one. Am running builds back-to-back on little cluster to turn
out the monsters.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message