hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "stack (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HBASE-14420) Zombie Stomping Session
Date Fri, 09 Oct 2015 18:26:05 GMT

    [ https://issues.apache.org/jira/browse/HBASE-14420?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14950930#comment-14950930

stack commented on HBASE-14420:

Here is report on what  failed in last ten hadoopqa runs:

   1 Hanging test : org.apache.hadoop.hbase.mapreduce.TestImportExport
   1 Hanging test : org.apache.hadoop.hbase.mapreduce.TestWALPlayer
   1 Hanging test : org.apache.hadoop.hbase.regionserver.TestHRegionFileSystem
   1 Hanging test : org.apache.hadoop.hbase.util.TestHBaseFsck
   1 Hanging test : org.apache.hadoop.hbase.wal.TestWALFiltering
   1 Hanging test : org.apache.hadoop.hbase.wal.TestWALSplit
   1 Hanging test : org.apache.hadoop.hbase.wal.TestWALSplitCompressed

   2 Failing test : org.apache.hadoop.hbase.client.TestMobSnapshotCloneIndependence
   1 Failing test : org.apache.hadoop.hbase.client.TestReplicaWithCluster
   1 Failing test : org.apache.hadoop.hbase.client.TestSnapshotCloneIndependence
   1 Failing test : org.apache.hadoop.hbase.master.TestZKLessAMOnCluster
   1 Failing test : org.apache.hadoop.hbase.mob.mapreduce.TestMobSweeper
   1 Failing test : org.apache.hadoop.hbase.security.token.TestGenerateDelegationToken

Elliott is did work to tighten up TestSnapshotCloneIndependence which should help TestMobSnapshotCloneIndependence
(because Matteo did refactor so one is a subclass of the other).  [~jingcheng.du@intel.com]
just did work on TestMobSweeper so that should help... TestImportExport should be better after
a recent commit. Let me look at the others.

> Zombie Stomping Session
> -----------------------
>                 Key: HBASE-14420
>                 URL: https://issues.apache.org/jira/browse/HBASE-14420
>             Project: HBase
>          Issue Type: Umbrella
>          Components: test
>            Reporter: stack
>            Assignee: stack
>            Priority: Critical
>         Attachments: hangers.txt, none_fix (1).txt, none_fix.txt, none_fix.txt, none_fix.txt,
none_fix.txt, none_fix.txt
> Patch build are now failing most of the time because we are dropping zombies. I confirm
we are doing this on non-apache build boxes too.
> Left-over zombies consume resources on build boxes (OOME cannot create native threads).
Having to do multiple test runs in the hope that we can get a non-zombie-making build or making
(arbitrary) rulings that the zombies are 'not related' is a productivity sink. And so on...
> This is an umbrella issue for a zombie stomping session that started earlier this week.
Will hang sub-issues of this one. Am running builds back-to-back on little cluster to turn
out the monsters.

This message was sent by Atlassian JIRA

View raw message