hbase-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ted Yu <yuzhih...@gmail.com>
Subject Re: Mapping Over Cells
Date Mon, 11 May 2015 17:27:15 GMT
Please note that if you have a Filter with active filterRowCells(List) or
filterRow(), the Filter cannot be set on the Scan when setBatch() is called
on the same Scan object.

FYI

On Mon, May 11, 2015 at 10:12 AM, Mike Axiak <mike@axiak.net> wrote:

> Hi Ryan,
>
> You can call setBatch() on the scan to get partial rows returned in your
> map task. (The Result object won't be guaranteed to have all cells for a
> given row.)
>
> On Mon, May 11, 2015 at 6:24 PM, Webb, Ryan L. <Ryan.Webb@jhuapl.edu>
> wrote:
>
> > Looks like the duplicate made it through.
> >
> > Sorry about that!
> >
> > -----Original Message-----
> > From: Webb, Ryan L. [mailto:Ryan.Webb@jhuapl.edu]
> > Sent: Monday, May 11, 2015 11:15 AM
> > To: user@hbase.apache.org
> > Subject: Mapping Over Cells
> >
> > Hello,
> >
> > We have a table in HBase that has very large rows and it goes OOM when
> the
> > table mapper attempts to read the entire row into a result.
> >
> > We would like to be able to map over each Cell in the table as a solution
> > and it is what we are doing in the map anyway.
> > Is this possible? Like the default behavior for Accumulo?
> >
> > We looked at the settings on Scan and didn't really see anything and the
> > source code of Result looks like it wraps an array of cells so the data
> is
> > already loaded at that point.
> > We are using HBase .98.1 and Hadoop 2 APIs
> >
> > Thanks
> > Ryan Webb
> >
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message