Return-Path: X-Original-To: apmail-hbase-issues-archive@www.apache.org Delivered-To: apmail-hbase-issues-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 9ABF810EA9 for ; Wed, 29 Jan 2014 17:40:15 +0000 (UTC) Received: (qmail 51322 invoked by uid 500); 29 Jan 2014 17:40:13 -0000 Delivered-To: apmail-hbase-issues-archive@hbase.apache.org Received: (qmail 51238 invoked by uid 500); 29 Jan 2014 17:40:13 -0000 Mailing-List: contact issues-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list issues@hbase.apache.org Received: (qmail 51109 invoked by uid 99); 29 Jan 2014 17:40:10 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 29 Jan 2014 17:40:10 +0000 Date: Wed, 29 Jan 2014 17:40:10 +0000 (UTC) From: "Jean-Daniel Cryans (JIRA)" To: issues@hbase.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (HBASE-9501) Provide throttling for replication MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/HBASE-9501?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13885575#comment-13885575 ] Jean-Daniel Cryans commented on HBASE-9501: ------------------------------------------- I think the patch should be refactored so that the bandwidth limiter is just a tool you use and that tells you how long you should sleep given some information you provide. This way you don't have to start tracking how much time you've actually slept in the unit tests since this tends to be very unreliable, especially on the build machines. It will also make the code more readable in ReplicationSource and won't add a lot of runtime to TestReplicationSmallTests. > Provide throttling for replication > ---------------------------------- > > Key: HBASE-9501 > URL: https://issues.apache.org/jira/browse/HBASE-9501 > Project: HBase > Issue Type: Improvement > Components: Replication > Reporter: Feng Honghua > Assignee: Feng Honghua > Attachments: HBASE-9501-trunk_v0.patch, HBASE-9501-trunk_v1.patch > > > When we disable a peer for a time of period, and then enable it, the ReplicationSource in master cluster will push the accumulated hlog entries during the disabled interval to the re-enabled peer cluster at full speed. > If the bandwidth of the two clusters is shared by different applications, the push at full speed for replication can use all the bandwidth and severely influence other applications. > Though there are two config replication.source.size.capacity and replication.source.nb.capacity to tweak the batch size each time a push delivers, but if decrease these two configs, the number of pushes increase, and all these pushes proceed continuously without pause. And no obvious help for the bandwidth throttling. > From bandwidth-sharing and push-speed perspective, it's more reasonable to provide a bandwidth up limit for each peer push channel, and within that limit, peer can choose a big batch size for each push for bandwidth efficiency. > Any opinion? -- This message was sent by Atlassian JIRA (v6.1.5#6160)