hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hudson (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HIVE-3603) Enable client-side caching for scans on HBase
Date Tue, 16 Jul 2013 09:12:49 GMT

    [ https://issues.apache.org/jira/browse/HIVE-3603?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13709624#comment-13709624
] 

Hudson commented on HIVE-3603:
------------------------------

FAILURE: Integrated in Hive-trunk-hadoop2-ptest #16 (See [https://builds.apache.org/job/Hive-trunk-hadoop2-ptest/16/])
HIVE-3603 Enable client-side caching for scans on HBase (Navis Ryu via EGC)

Submitted by:	Navis Ryu
Reviewed by:	Edward Capriolo (ecapriolo: http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1503544)
* /hive/trunk/hbase-handler/src/java/org/apache/hadoop/hive/hbase/HBaseSerDe.java
* /hive/trunk/hbase-handler/src/java/org/apache/hadoop/hive/hbase/HBaseStorageHandler.java
* /hive/trunk/hbase-handler/src/java/org/apache/hadoop/hive/hbase/HiveHBaseTableInputFormat.java
* /hive/trunk/hbase-handler/src/test/queries/positive/hbase_scan_params.q
* /hive/trunk/hbase-handler/src/test/results/positive/hbase_scan_params.q.out

                
> Enable client-side caching for scans on HBase
> ---------------------------------------------
>
>                 Key: HIVE-3603
>                 URL: https://issues.apache.org/jira/browse/HIVE-3603
>             Project: Hive
>          Issue Type: Improvement
>          Components: HBase Handler
>            Reporter: Karthik Ranganathan
>            Assignee: Navis
>            Priority: Minor
>             Fix For: 0.12.0
>
>         Attachments: HIVE-3603.D7761.1.patch
>
>
> HBaseHandler sets up a TableInputFormat MR job against HBase to read data in. The underlying
implementation (in HBaseHandler.java) makes an RPC call per row-key, which makes it very inefficient.
Need to specify a client side cache size on the scan.
> Note that HBase currently only supports num-rows based caching (no way to specify a memory
limit). Created HBASE-6770 to address this.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message