hadoop-common-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Clint Morgan (JIRA)" <j...@apache.org>
Subject [jira] Created: (HADOOP-2161) getRow() is orders of magnitudes slower than get(), even on rows with one column
Date Tue, 06 Nov 2007 17:34:51 GMT
getRow() is orders of magnitudes slower than get(), even on rows with one column
--------------------------------------------------------------------------------

                 Key: HADOOP-2161
                 URL: https://issues.apache.org/jira/browse/HADOOP-2161
             Project: Hadoop
          Issue Type: Bug
          Components: contrib/hbase
    Affects Versions: 0.16.0
         Environment: latest from trunk
            Reporter: Clint Morgan


HTable.getRow(Text) is several orders of magnitude slower than
HTable.get(Text, Text), even on rows with a single column.

This problem can be observed by the attached patch of
PerformanceEvaluation.java which changes SequentialRead to use getRow,
and prints out the time for each read. 

The test can the be run with:

bin/hbase org.apache.hadoop.hbase.PerformaeEvaluation sequentialRead 1

On my laptop, the original test (using get()) produces reads on the order of 5-20
milliseconds. Using getRow(), the reads take 50-2000 ms. 
 


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message