hbase-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Mike Axiak <m...@axiak.net>
Subject Re: Mapping Over Cells
Date Mon, 11 May 2015 17:12:00 GMT
Hi Ryan,

You can call setBatch() on the scan to get partial rows returned in your
map task. (The Result object won't be guaranteed to have all cells for a
given row.)

On Mon, May 11, 2015 at 6:24 PM, Webb, Ryan L. <Ryan.Webb@jhuapl.edu> wrote:

> Looks like the duplicate made it through.
>
> Sorry about that!
>
> -----Original Message-----
> From: Webb, Ryan L. [mailto:Ryan.Webb@jhuapl.edu]
> Sent: Monday, May 11, 2015 11:15 AM
> To: user@hbase.apache.org
> Subject: Mapping Over Cells
>
> Hello,
>
> We have a table in HBase that has very large rows and it goes OOM when the
> table mapper attempts to read the entire row into a result.
>
> We would like to be able to map over each Cell in the table as a solution
> and it is what we are doing in the map anyway.
> Is this possible? Like the default behavior for Accumulo?
>
> We looked at the settings on Scan and didn't really see anything and the
> source code of Result looks like it wraps an array of cells so the data is
> already loaded at that point.
> We are using HBase .98.1 and Hadoop 2 APIs
>
> Thanks
> Ryan Webb
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message