hbase-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jacques <whs...@gmail.com>
Subject Re: Quick Question about Bulk loading of HFiles & Timestamps
Date Fri, 05 Aug 2011 23:24:04 GMT
Perfect.

thanks,
Jacques

On Fri, Aug 5, 2011 at 3:53 PM, Todd Lipcon <todd@cloudera.com> wrote:

> Hi Jacques,
>
> Yes, the timestamps are set at the time the MR job runs, not the time
> they're loaded. So, you'll see the values from the job that wrote its
> output most recently.
>
> You can also specify timestamps explicitly for each KeyValue, if you
> prefer.
>
> -Todd
>
> On Fri, Aug 5, 2011 at 2:10 PM, Jacques <whshub@gmail.com> wrote:
> > Can someone confirm that bulk loading hfiles keeps cell timestamps from
> > overwriting each other.
> >
> > For example:
> > I run mapreduce A job on Monday.
> > I run mapreduce B job on Tuesday.
> >
> > I then run LoadIncrementalHFiles on job B first, followed by A.
> >
> > Please confirm that at the intersection of outputs A & B will be the
> values
> > from B.
> >
> > Thanks,
> > Jacques
> >
>
>
>
> --
> Todd Lipcon
> Software Engineer, Cloudera
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message