hbase-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From R W <aijw...@gmail.com>
Subject Re: What is the best practice for backup HBase data?
Date Wed, 02 Apr 2014 14:00:31 GMT
Hi JM

I see your point, actually we are trying to backup the hbase data to AWS S3
on a daily/weekly basis, that's why in my first mail i think maybe
org.apache.hadoop.hbase.mapreduce.Export /
org.apache.hadoop.hbase.mapreduce.Import
would be the best choice for us, any idea?

Thanks
aij


On Wed, Apr 2, 2014 at 9:51 PM, Jean-Marc Spaggiari <jean-marc@spaggiari.org
> wrote:

> Hi,
>
> For incremental, you might want to look at replication....
>
> JM
>
>
> 2014-04-02 9:49 GMT-04:00 R W <aijwjia@gmail.com>:
>
> > Hi JM
> >
> > Here is the problem if we use the ExportSnapshot feature to export data
> to
> > the dest cluster, it seems we cannot do incremental backup.
> >
> > Thanks
> > aij
> >
> >
> > On Wed, Apr 2, 2014 at 7:54 PM, Jean-Marc Spaggiari <
> > jean-marc@spaggiari.org
> > > wrote:
> >
> > > Hi,
> > >
> > > You can take a look at replication. Activate replication from a date X
> > and
> > > then copy table from date 0 to date x from the origin to the dest
> > cluster.
> > >
> > > You can also export the snapshot. Take a look at 15.8.8 here:
> > > http://hbase.apache.org/book/ops.snapshots.html
> > >
> > > JM
> > >
> > >
> > > 2014-04-02 7:23 GMT-04:00 R W <aijwjia@gmail.com>:
> > >
> > > > Hi Esteban
> > > >
> > > > I checked the snapshot feature and tried myself, it's very good, one
> of
> > > the
> > > > introduction
> > > >
> > > >
> > >
> >
> http://blog.cloudera.com/blog/2013/03/introduction-to-apache-hbase-snapshots/mentioned
> > > > about:
> > > >
> > > > Clone a snapshot: This operation creates a new table using the same
> > > schema
> > > > > and with the same data present in the specified snapshot. The
> result
> > of
> > > > > this operation is a new fully functional table that can can be
> > modified
> > > > > with no impact on the original table or the snapshot.
> > > >
> > > >
> > > > I think this clone operation will make a copy of the snapshot, then
> > > create
> > > > the new table from the copy of the snapshot, am i correct? Otherwise,
> > > > modification to the new table will change the snapshot, right?
> > > >
> > > > Another question, if we want to backup hbase data somewhere else, it
> > > seems
> > > > we cannot go with snapshot feature, we want the data to be backup
> even
> > > > after the whole Hadoop cluster down, any idea?
> > > >
> > > > Thanks
> > > > aij
> > > >
> > > >
> > > > On Wed, Apr 2, 2014 at 2:12 PM, Esteban Gutierrez <
> > esteban@cloudera.com
> > > > >wrote:
> > > >
> > > > > Hello Aij,
> > > > >
> > > > > Snapshots are the suggested method since HBase 0.94.6, they provide
> > > > better
> > > > > consistency for backing up data in HBase. You can find more
> > information
> > > > in
> > > > > the HBase Book here:
> > > > >
> > > > > https://hbase.apache.org/book.html#ops.snapshots
> > > > >
> > > > > Depending on your use case and resources you might want to consider
> > > > > replication as well:
> > > > >
> > > > > http://hbase.apache.org/replication.html
> > > > >
> > > > > cheers,
> > > > > esteban.
> > > > >
> > > > >
> > > > > --
> > > > > Cloudera, Inc.
> > > > >
> > > > >
> > > > >
> > > > > On Tue, Apr 1, 2014 at 10:56 PM, R W <aijwjia@gmail.com> wrote:
> > > > >
> > > > > > Hi Guys
> > > > > >
> > > > > > I'm using hbase org.apache.hadoop.hbase.mapreduce.Export
> > > > > > / org.apache.hadoop.hbase.mapreduce.Import to backup and restore
> > > HBase
> > > > > > data, at least it's good to me, i would like to know if there
are
> > any
> > > > > > better solutions or practices on how to backup HBase data, that
> > will
> > > be
> > > > > > really helpful for us, thanks.
> > > > > >
> > > > > > Cheers
> > > > > > aij
> > > > > >
> > > > >
> > > >
> > >
> >
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message