Return-Path: Delivered-To: apmail-hadoop-hbase-user-archive@minotaur.apache.org Received: (qmail 48038 invoked from network); 4 Feb 2009 16:29:11 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 4 Feb 2009 16:29:11 -0000 Received: (qmail 73109 invoked by uid 500); 4 Feb 2009 16:29:09 -0000 Delivered-To: apmail-hadoop-hbase-user-archive@hadoop.apache.org Received: (qmail 73090 invoked by uid 500); 4 Feb 2009 16:29:09 -0000 Mailing-List: contact hbase-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hbase-user@hadoop.apache.org Delivered-To: mailing list hbase-user@hadoop.apache.org Received: (qmail 73079 invoked by uid 99); 4 Feb 2009 16:29:09 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 04 Feb 2009 08:29:09 -0800 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_HELO_PASS,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: local policy) Received: from [216.86.168.183] (HELO mxout-08.mxes.net) (216.86.168.183) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 04 Feb 2009 16:28:59 +0000 Received: from [192.168.10.105] (unknown [24.6.146.30]) (using TLSv1 with cipher AES128-SHA (128/128 bits)) (No client certificate requested) by smtp.mxes.net (Postfix) with ESMTPSA id 5253BD0B9E for ; Wed, 4 Feb 2009 11:28:38 -0500 (EST) Message-Id: <83CA1AC8-BB99-4C65-BD46-FBA21A5A99C6@wensel.net> From: Chris K Wensel To: hbase-user@hadoop.apache.org In-Reply-To: <2f81e00902040821k328e1e81g2eaa48f99a425545@mail.gmail.com> Content-Type: text/plain; charset=US-ASCII; format=flowed; delsp=yes Content-Transfer-Encoding: 7bit Mime-Version: 1.0 (Apple Message framework v930.3) Subject: Re: Backup Again Date: Wed, 4 Feb 2009 08:28:37 -0800 References: <2f81e00902040821k328e1e81g2eaa48f99a425545@mail.gmail.com> X-Mailer: Apple Mail (2.930.3) X-Virus-Checked: Checked by ClamAV on apache.org Hey Michael You could probably use Cascading to migrate data between HBase clusters. http://wiki.apache.org/hadoop/Hbase/Cascading But the code currently doesn't support multiple HBase cluster clients in a single JVM, but I'm sure it can be coded in quickly. (the code is hosted at github, so is easily cloned and patched). A benefit of using Cascading would be the ability to put in quality checks, or filter data very easily. I've already heard of users starting to migrate from HBase to a RDBMS using Cascading, and also between Hypertable and Aster Data. Hopefully those adapters will leak out for the rest of us to use. ckw On Feb 4, 2009, at 8:21 AM, Michael Dagaev wrote: > Hi, all > > I read HBASE-974 and HBASE-643 mentioned on the list > but what do you think about copying tables from the production > to a backup Hbase cluster ? I guess we do need a big iron > for such a backup cluster. > > I understand that the copy can be implemented with MR > but for now we can implement it just as a simple sequential script, > which scans the tables of the production Hbase and writes the data > to the backup Hbase. > > Does it make sense? > > Thank you for your cooperation, > M. -- Chris K Wensel chris@wensel.net http://www.cascading.org/ http://www.scaleunlimited.com/