Return-Path: X-Original-To: apmail-hbase-user-archive@www.apache.org Delivered-To: apmail-hbase-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id BF02572CF for ; Wed, 28 Sep 2011 21:14:08 +0000 (UTC) Received: (qmail 34441 invoked by uid 500); 28 Sep 2011 21:14:07 -0000 Delivered-To: apmail-hbase-user-archive@hbase.apache.org Received: (qmail 34403 invoked by uid 500); 28 Sep 2011 21:14:07 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 34395 invoked by uid 99); 28 Sep 2011 21:14:07 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 28 Sep 2011 21:14:07 +0000 X-ASF-Spam-Status: No, hits=2.2 required=5.0 tests=FREEMAIL_FROM,HTML_MESSAGE,RCVD_IN_DNSWL_NONE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: local policy) Received: from [98.138.91.53] (HELO nm3-vm1.bullet.mail.ne1.yahoo.com) (98.138.91.53) by apache.org (qpsmtpd/0.29) with SMTP; Wed, 28 Sep 2011 21:14:01 +0000 Received: from [98.138.90.53] by nm3.bullet.mail.ne1.yahoo.com with NNFMP; 28 Sep 2011 21:13:40 -0000 Received: from [98.138.89.246] by tm6.bullet.mail.ne1.yahoo.com with NNFMP; 28 Sep 2011 21:13:40 -0000 Received: from [127.0.0.1] by omp1060.mail.ne1.yahoo.com with NNFMP; 28 Sep 2011 21:13:40 -0000 X-Yahoo-Newman-Property: ymail-3 X-Yahoo-Newman-Id: 556029.92084.bm@omp1060.mail.ne1.yahoo.com Received: (qmail 13646 invoked by uid 60001); 28 Sep 2011 21:13:40 -0000 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=yahoo.com; s=s1024; t=1317244420; bh=ut9+tDB1rw7aNiGBDr+tIXEJ5Vwp691y0TKdCFtsaUU=; h=X-YMail-OSG:Received:X-Mailer:References:Message-ID:Date:From:Reply-To:Subject:To:In-Reply-To:MIME-Version:Content-Type; b=yHwpW/wC0pivQBassNPqVWqQaoHO3UO4KDYwS/0NvUvatMOU8dYSivyO5CLIWapN/v2TeOV/85qCwNVhK0nlB2RBGrch9UcxPteI5+KGwgk1U7YuKvJp+18UGkXCfcynKE7ERrhpkfAGfs3yBl5W1ov9kVOo8/sT3AoO+W3UeYk= DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=s1024; d=yahoo.com; h=X-YMail-OSG:Received:X-Mailer:References:Message-ID:Date:From:Reply-To:Subject:To:In-Reply-To:MIME-Version:Content-Type; b=O5XULEkGenzbeCBGCGDhHKtEHtl5rqA9Rr8aKGz0zdrNaH2rz6VOQpQe5rGglEnjXay6H0d6ph1u4FKKh27m6t2g3tpCnQmaTaKIk37Ij4Brbu84PpebPE3W3tnZoYkpQigsoV5M3EoTfcHp3IGbgoJ1W86L2CQjR94xeBzWnVA=; X-YMail-OSG: 3wojuUEVM1kZNYIcZpSLwjLPU.NyUslUIbL4AU2XrXqIQ9d kJcHbsnTCBJ62BnDx7T3eL.oR_CJgdrKEfE9QvtNPKphr.WItoaB_nZ1cnDj a.2lpL9rJ_Hsu2L9nO4iovawxyy2Orz0dU70yagKh5DGyBhbK689NVK8kqmi VJxBNn95bIY.UbfRM4odMHfAXR7qa7Cv06gQWE4P729YGYGRWiUnwY0v9b4m cEc_tMpO80A4UGfYeQ21mWmQRy50V_963b90Te71E2ZP_jlAJIUczksVBzct FxiQL49Z6xCH_qLTsKg72D41HklB3oMfo8U8WCegJjtM.gBVwY4EMo9nvmP1 VMH2ElegFWiaW0DEXauJ7C0YKWNoz50ympJee45lgdolLPtDQ6NX67zjoqUM vZkf4IfeYfn3A_SGtC44TkJ1Ic48fz6RwDm_jaL0K98IWGAnmgXpV0WoXPd8 spOU- Received: from [24.4.206.147] by web121705.mail.ne1.yahoo.com via HTTP; Wed, 28 Sep 2011 14:13:40 PDT X-Mailer: YahooMailWebService/0.8.114.317681 References: <2D6136772A13B84E95DF6DA79E85A9F0016B66255696@NSPEXMBX-A.the-lab.llnl.gov> Message-ID: <1317244420.85268.YahooMailNeo@web121705.mail.ne1.yahoo.com> Date: Wed, 28 Sep 2011 14:13:40 -0700 (PDT) From: lars hofhansl Reply-To: lars hofhansl Subject: Re: Recommended backup/restore solution for hbase To: "user@hbase.apache.org" In-Reply-To: <2D6136772A13B84E95DF6DA79E85A9F0016B66255696@NSPEXMBX-A.the-lab.llnl.gov> MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="-1714046436-1972296751-1317244420=:85268" ---1714046436-1972296751-1317244420=:85268 Content-Type: text/plain; charset=iso-8859-1 Content-Transfer-Encoding: quoted-printable You can now (0.92+) set the minium number of versions you want to always ke= ep around together with TTL. See HBASE-4071 =0A=0A=0A-- Lars=0A=0A=0A=0A___= _____________________________=0AFrom: "Buttler, David" = =0ATo: "user@hbase.apache.org" =0ASent: Wednesday, S= eptember 28, 2011 2:10 PM=0ASubject: RE: Recommended backup/restore solutio= n for hbase=0A=0AWouldn't using a TTL on your data automatically delete dat= a that is older than X months?=A0 Of course major compactions have to occur= to get the data to automatically disappear.=0A=0ASee:=0Ahttp://hbase.apach= e.org/book.html#ttl=0Ahttp://hbase.apache.org/apidocs/org/apache/hadoop/hba= se/HColumnDescriptor.html#HColumnDescriptor(byte[], int, java.lang.String, = boolean, boolean, int, int, java.lang.String, int)=0A=0ADave=0A=0A-----Orig= inal Message-----=0AFrom: tvinod@socialyantra.com [mailto:tvinod@socialyant= ra.com] On Behalf Of Vinod Gupta Tankala=0ASent: Wednesday, September 28, 2= 011 12:12 PM=0ATo: user@hbase.apache.org=0ASubject: Re: Recommended backup/= restore solution for hbase=0A=0Athanks Li. I didn't know about using S3 as = a datastore. Will look into this=0Amore.=0A=0AI understand that hdfs replic= ation will help in partial hardware failure. I=0Awanted to protect myself a= gainst inconsistencies as I have gotten bitten in=0Athe past. That had happ= ened due to hbase fatal exceptions. One of the=0Areasons for that could hav= e been due to standalone mode as that is not=0Aproduction ready, based on r= eading hbase documentation.=0AAnother use case I have is - I would be writi= ng sweeper jobs to delete user=0Adata that is more than x months old. So in= case, we need to retrieve old=0Auser data, I would like to have the abilit= y to get old data back from=0Aexported tables. Ofcourse, I understand that = to do so for selective user=0Aaccounts, I have to write custom jobs.=0A=0At= hanks=0Avinod=0A=0AOn Wed, Sep 28, 2011 at 11:49 AM, Li Pi w= rote:=0A=0A> What kind of situations are you looking for to guard against? = Partial=0A> hardware failure, full hardware failure (of live cluster),=0A> = accidentally deleting all data?=0A>=0A> HDFS provides replication that alre= ady guards against partial hardware=0A> failure - if this is all you need, = a ephemeral store should be=A0 fine.=0A>=0A> Also, HBase can use S3 directl= y as a datastore. You can choose the raw=0A> mode, in which HBase treats S3= as a disk. There used to be a block=0A> based mode as well, but now as S3 = has increased the object size limit=0A> to 5tb, this isn't needed anymore. = (Somebody correct me if i'm wrong).=0A>=0A> On Wed, Sep 28, 2011 at 9:15 AM= , Vinod Gupta Tankala=0A> wrote:=0A> > Hi,=0A> > Ca= n someone answer these basic but important questions for me.=0A> > We are u= sing hbase for our datastore and want to safeguard ourselves from=0A> > dat= a corruption/data loss. Also we are hosted on aws ec2. Currently, I=0A> onl= y=0A> > have a single node but want to prepare for scale right away as thin= gs are=0A> > going to change starting next couple of weeks. Also, I am curr= ently using=0A> > ephemeral store for hbase data.=0A> >=0A> > 1) What is th= e recommended aws data store method for hbase? should you=0A> use=0A> > eph= emeral store and do S3 backups or use EBS? I read and heard that EBS=0A> ca= n=0A> > be expensive and also unreliable in terms of read/write latency.=0A= > Ofcourse,=0A> > it provides data replication and protection, so you don't= have to worry=0A> > about that.=0A> >=0A> > 2) What is the recommended bac= kup/restore method for hbase? I would like=0A> to=0A> > take periodic data = snapshots and then have a import utility that will=0A> > incrementally impo= rt data in case i lose some regions due to corruption=0A> or=0A> > table in= consistencies. also, if something catastrophic happens, i can=0A> > restore= the whole data.=0A> >=0A> > 3) While we are at it, what is the recommended= ec2 instance types for=0A> > running master/zookeeper/region servers? i ge= t conflicting answers from=0A> > google search - ranging from c1.xlarge to = m1.xlarge.=0A> >=0A> > I would really appreciate if someone could help me.= =0A> >=0A> > thanks=0A> > vinod=0A> >=0A> ---1714046436-1972296751-1317244420=:85268--