hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Andrew Purtell (JIRA)" <j...@apache.org>
Subject [jira] [Created] (HBASE-11142) Taking snapshots can leave sockets on the master stuck in CLOSE_WAIT state
Date Sat, 10 May 2014 22:14:28 GMT
Andrew Purtell created HBASE-11142:
--------------------------------------

             Summary: Taking snapshots can leave sockets on the master stuck in CLOSE_WAIT
state
                 Key: HBASE-11142
                 URL: https://issues.apache.org/jira/browse/HBASE-11142
             Project: HBase
          Issue Type: Bug
    Affects Versions: 0.94.2
            Reporter: Andrew Purtell


As reported by Hansi Klose on user@. 
{quote}
we use a script to take on a regular basis snapshot's and delete old one's.
We recognizes that the web interface of the hbase master was not working any more because
of too many open files.
The master reaches his number of open file limit of 32768
When I run lsof I saw that there where a lot of TCP CLOSE_WAIT handles open with the regionserver
as target.
On the regionserver there is just one connection to the hbase master.
I can see that the count of the CLOSE_WAIT handles grow each time
i take a snapshot. When i delete on nothing changes.
Each time i take a snapshot  there are 20 - 30 new CLOSE_WAIT handles.
{quote}



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Mime
View raw message