hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Lars Hofhansl (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HBASE-8525) Use sleep multilier when choosing sinks in ReplicationSource
Date Tue, 21 May 2013 23:32:20 GMT

    [ https://issues.apache.org/jira/browse/HBASE-8525?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13663588#comment-13663588
] 

Lars Hofhansl commented on HBASE-8525:
--------------------------------------

Tracked down the crazy logging scenario. We've been playing around with replication and in
the process a single region server took over most queues of the others. Since the slave cluster
is down (by design in this test) we get 3 log messages per second (one for each quorum peer
in the slave) per queue to manage, which leads to some crazy amount of logging.

I now want to introduce a mechanisms that can wait even longer if the slave system is not
available (up to a few minutes should be fine if the slave is truly down).
                
> Use sleep multilier when choosing sinks in ReplicationSource
> ------------------------------------------------------------
>
>                 Key: HBASE-8525
>                 URL: https://issues.apache.org/jira/browse/HBASE-8525
>             Project: HBase
>          Issue Type: Bug
>            Reporter: Lars Hofhansl
>            Priority: Trivial
>             Fix For: 0.98.0, 0.95.1, 0.94.9
>
>         Attachments: 8525-0.94.txt
>
>
> Currently we see this every second. Filling up the log:
> {code}
> 2013-05-10 18:36:00,766 INFO org.apache.zookeeper.ClientCnxn: Opening socket connection
to server ist6-mnds1-2-sfm.ops.sfdc.net/10.224.156.197:2181. Will not attempt to authenticate
using SASL (Unable to locate a login configuration)
> 2013-05-10 18:36:00,767 WARN org.apache.zookeeper.ClientCnxn: Session 0x0 for server
null, unexpected error, closing socket connection and attempting reconnect
> java.net.ConnectException: Connection refused
> 	at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method)
> 	at sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:599)
> 	at org.apache.zookeeper.ClientCnxnSocketNIO.doTransport(ClientCnxnSocketNIO.java:350)
> 	at org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1068)
> 2013-05-10 18:36:01,868 INFO org.apache.zookeeper.ClientCnxn: Opening socket connection
to server ist6-mnds1-4-sfm.ops.sfdc.net/10.224.156.199:2181. Will not attempt to authenticate
using SASL (Unable to locate a login configuration)
> 2013-05-10 18:36:01,870 WARN org.apache.zookeeper.ClientCnxn: Session 0x0 for server
null, unexpected error, closing socket connection and attempting reconnect
> java.net.ConnectException: Connection refused
> 	at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method)
> 	at sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:599)
> 	at org.apache.zookeeper.ClientCnxnSocketNIO.doTransport(ClientCnxnSocketNIO.java:350)
> 	at org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1068)
> 2013-05-10 18:36:01,971 INFO org.apache.zookeeper.ClientCnxn: Opening socket connection
to server ist6-mnds1-3-sfm.ops.sfdc.net/10.224.156.198:2181. Will not attempt to authenticate
using SASL (Unable to locate a login configuration)
> {code}
> Patch is trivial.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message