Return-Path: Delivered-To: apmail-hadoop-hbase-user-archive@minotaur.apache.org Received: (qmail 52965 invoked from network); 10 Feb 2010 03:08:19 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 10 Feb 2010 03:08:19 -0000 Received: (qmail 46851 invoked by uid 500); 10 Feb 2010 03:08:18 -0000 Delivered-To: apmail-hadoop-hbase-user-archive@hadoop.apache.org Received: (qmail 46718 invoked by uid 500); 10 Feb 2010 03:08:17 -0000 Mailing-List: contact hbase-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hbase-user@hadoop.apache.org Delivered-To: mailing list hbase-user@hadoop.apache.org Received: (qmail 46708 invoked by uid 99); 10 Feb 2010 03:08:17 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 10 Feb 2010 03:08:17 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of ryanobjc@gmail.com designates 209.85.223.189 as permitted sender) Received: from [209.85.223.189] (HELO mail-iw0-f189.google.com) (209.85.223.189) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 10 Feb 2010 03:08:10 +0000 Received: by iwn27 with SMTP id 27so1121611iwn.20 for ; Tue, 09 Feb 2010 19:07:48 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:in-reply-to:references :date:message-id:subject:from:to:content-type :content-transfer-encoding; bh=Xsj4B8iXGQYobwbsPKxBb9abe7dYTeqT4addwXxxFuY=; b=cANQUkRtcjaZkaLLgcjPl3yNw5AodmAuDuTMh9N5YTGMkcJC006pvoKyl5RgKdwo9/ HBYRl72EMI7VkFC71Csu/m0QXwfwU/mhD9yMcWt5c65w/80559k51fpqMaHHEYwstRgV uPF2tp7ouKLcaFdgTXcqatMguiVozrWpRW+fg= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type:content-transfer-encoding; b=MEsNBXGskAEQZYJ/WpshbjLbhLHP6kh09Dv+Q/yMZGd1DoULm+uoUIT9lWISRaUDgH WfxB29pPfIvMRw64v0ApuxzXOsNCWwW6Cyr9CMMfqtZy5b02ni1TVBrLw+1zNFHLeqjR XU968x4jx5rB3XqN3DwBr3JqTfrUKtGpAEwhc= MIME-Version: 1.0 Received: by 10.231.191.135 with SMTP id dm7mr4125664ibb.46.1265771268262; Tue, 09 Feb 2010 19:07:48 -0800 (PST) In-Reply-To: <7c457ebe1002091902l707cb116g1079257ec7292105@mail.gmail.com> References: <1265770076.28098.299.camel@puma> <78568af11002091850p494d0815gef5f475f4d0c4479@mail.gmail.com> <7c457ebe1002091902l707cb116g1079257ec7292105@mail.gmail.com> Date: Tue, 9 Feb 2010 19:07:48 -0800 Message-ID: <78568af11002091907o9dd4f45rb265e9c6330ccb46@mail.gmail.com> Subject: Re: HBase export/import From: Ryan Rawson To: hbase-user@hadoop.apache.org Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable You can also use distcp instead of the copyToLocal && scp && copyFromLocal chain you have there. On Tue, Feb 9, 2010 at 7:02 PM, Dan Washusen wrote: > +1 > > I use this method when performance testing on different data sets. =A0I h= ave > several datasets to test on (varying sizes, etc). =A0 When I want to swit= ch > datasets I just shut down hbase and rename the /hbase directory... > > e.g. (assuming hbase is not running) > hadoop/bin/hadoop fs -mv /hbase /hbase.small > hadoop/bin/hadoop fs -mv /hbase.large /hbase > > When I want to move my data between clusters I use: > hadoop/bin/hadoop fs -copyToLocal /hbase.large /tmp/hbase.large > scp -r /tmp/hbase.large user@host:/tmp > ssh user@host > hadoop/bin/hadoop fs -put /tmp/hbase.large /hbase > > > Very handy :) > > > On 10 February 2010 13:50, Ryan Rawson wrote: > >> If you stop the source cluster then you can distcp the /hbase to the >> other cluster. Done. A perfect copy. >> >> That is probably the most efficient/highest performing way. >> >> On Tue, Feb 9, 2010 at 6:47 PM, James Baldassari wrot= e: >> > Hi, >> > >> > I'm wondering if it's possible to export all data from one HBase clust= er >> > and import it into another. =A0We have a lot of data that we've import= ed >> > into our staging HBase environment, and rather than repeating the >> > lengthy import process in our production environment we would prefer t= o >> > just copy all the data directly from HBase/HDFS in staging into >> > production. =A0Is there an easy way to do this? =A0I know Hadoop has s= ome >> > distributed copy functionality, but I don't know if this will work wit= h >> > HBase. =A0The number of region servers and the replication factor will= be >> > the same in the source and destination environments, but the >> > hostnames/IPs will be different. =A0The production environment is >> > completely empty right now, so we don't need to worry about overwritin= g >> > data. >> > >> > I came across these links while searching for information HBase >> > export/import: >> > >> > http://issues.apache.org/jira/browse/HBASE-897 >> > http://issues.apache.org/jira/browse/HBASE-1684 >> > >> http://hadoop.apache.org/hbase/docs/current/api/org/apache/hadoop/hbase/= mapreduce/Export.html >> > >> > Has anyone used these tools? =A0Is there a better way? >> > >> > Thanks, >> > James >> > >> > >> > >> >