Return-Path: X-Original-To: apmail-hbase-issues-archive@www.apache.org Delivered-To: apmail-hbase-issues-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 5E875182A4 for ; Tue, 10 Nov 2015 17:54:12 +0000 (UTC) Received: (qmail 75331 invoked by uid 500); 10 Nov 2015 17:54:11 -0000 Delivered-To: apmail-hbase-issues-archive@hbase.apache.org Received: (qmail 75172 invoked by uid 500); 10 Nov 2015 17:54:11 -0000 Mailing-List: contact issues-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list issues@hbase.apache.org Received: (qmail 75141 invoked by uid 99); 10 Nov 2015 17:54:11 -0000 Received: from arcas.apache.org (HELO arcas) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 10 Nov 2015 17:54:11 +0000 Received: from arcas.apache.org (localhost [127.0.0.1]) by arcas (Postfix) with ESMTP id 18FA92C1F58 for ; Tue, 10 Nov 2015 17:54:11 +0000 (UTC) Date: Tue, 10 Nov 2015 17:54:11 +0000 (UTC) From: "Lars Hofhansl (JIRA)" To: issues@hbase.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Created] (HBASE-14791) [0.98] CopyTable is extremely slow when moving delete markers MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 Lars Hofhansl created HBASE-14791: ------------------------------------- Summary: [0.98] CopyTable is extremely slow when moving delete markers Key: HBASE-14791 URL: https://issues.apache.org/jira/browse/HBASE-14791 Project: HBase Issue Type: Bug Affects Versions: 0.98.16 Reporter: Lars Hofhansl We found that some of our copy table job run for many hours, even when there isn't that much data to copy. [~vik.karma] did his magic and found that the issue with copying delete markers (we use raw mode to also move deletes across). Looking at the code in 0.98 it's immediately obvious that deletes (unlike puts) are not batched and hence sent to the other side one by one, cause a network RTT for each delete marker. Looks like in trunk it's doing the right thing (using BufferedMutators for all mutations in TableOutputFormat). So likely only a 0.98 (and 1.0, 1.1, 1.2?) issue. -- This message was sent by Atlassian JIRA (v6.3.4#6332)