hbase-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "stack (JIRA)" <j...@apache.org>
Subject [jira] Commented: (HBASE-1569) rare race condition can take down a regionserver.
Date Tue, 23 Jun 2009 23:33:07 GMT

    [ https://issues.apache.org/jira/browse/HBASE-1569?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12723361#action_12723361
] 

stack commented on HBASE-1569:
------------------------------

At first I thought that use of ConcurrentSkipListSet the problem but thinking on it more,
rather, we need to make code tolerate fact that a file has been moved or removed.  Alternative
is syncing around file operations till they complete which is too much to ask.

A good while a go, an issue in metrics got HRS stuck in an infinite loop.

Let me try hack up a patch.

> rare race condition can take down a regionserver. 
> --------------------------------------------------
>
>                 Key: HBASE-1569
>                 URL: https://issues.apache.org/jira/browse/HBASE-1569
>             Project: Hadoop HBase
>          Issue Type: Bug
>    Affects Versions: 0.20.0
>            Reporter: ryan rawson
>            Priority: Critical
>             Fix For: 0.20.0
>
>
> this happened after > 24 hours of heavy import load on my cluster.  Luckily the shutdown
seemed to be clean:
> java.lang.IllegalAccessError: Call open first
>         at org.apache.hadoop.hbase.regionserver.StoreFile.getReader(StoreFile.java:356)
>         at org.apache.hadoop.hbase.regionserver.Store.getStorefilesIndexSize(Store.java:1378)
>         at org.apache.hadoop.hbase.regionserver.HRegionServer.doMetrics(HRegionServer.java:1075)
>         at org.apache.hadoop.hbase.regionserver.HRegionServer.run(HRegionServer.java:454)
>         at java.lang.Thread.run(Thread.java:619)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message