hbase-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Andrew Purtell (JIRA)" <j...@apache.org>
Subject [jira] Commented: (HBASE-1576) TIF needs to be able to set scanner caching size for smaller row tables & performance
Date Tue, 23 Jun 2009 22:03:07 GMT

    [ https://issues.apache.org/jira/browse/HBASE-1576?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12723327#action_12723327
] 

Andrew Purtell commented on HBASE-1576:
---------------------------------------

The HBaseConfiguration object is created from the JobConf (TableInputFormat.java, line ~58),
so isn't this sufficient?

{code}
JobConf job = new JobConf();
// ... 
job.set("hbase.client.scanner.caching", "100");
// ... 
{code}

No problem to make a convenience method, though...


> TIF needs to be able to set scanner caching size for smaller row tables & performance
> -------------------------------------------------------------------------------------
>
>                 Key: HBASE-1576
>                 URL: https://issues.apache.org/jira/browse/HBASE-1576
>             Project: Hadoop HBase
>          Issue Type: Bug
>          Components: mapred
>    Affects Versions: 0.20.0
>            Reporter: ryan rawson
>            Priority: Critical
>             Fix For: 0.20.0
>
>
> TIF goes with the default scanner caching size (1).  When each row is processed very
fast and is small, this limits the overall performance.  By setting a higher scanner caching
level you can achieve 100x+ the performance with the exact same map-reduce and table.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message