hadoop-common-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Clint Morgan (JIRA)" <j...@apache.org>
Subject [jira] Updated: (HADOOP-2161) getRow() is orders of magnitudes slower than get(), even on rows with one column
Date Tue, 06 Nov 2007 17:36:55 GMT

     [ https://issues.apache.org/jira/browse/HADOOP-2161?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Clint Morgan updated HADOOP-2161:
---------------------------------

    Attachment: PerformanceEvaluation-patch.txt

Modifies SequentialReadTest to use getRow, and print the read time to standard out.

> getRow() is orders of magnitudes slower than get(), even on rows with one column
> --------------------------------------------------------------------------------
>
>                 Key: HADOOP-2161
>                 URL: https://issues.apache.org/jira/browse/HADOOP-2161
>             Project: Hadoop
>          Issue Type: Bug
>          Components: contrib/hbase
>    Affects Versions: 0.16.0
>         Environment: latest from trunk
>            Reporter: Clint Morgan
>         Attachments: PerformanceEvaluation-patch.txt
>
>
> HTable.getRow(Text) is several orders of magnitude slower than
> HTable.get(Text, Text), even on rows with a single column.
> This problem can be observed by the attached patch of
> PerformanceEvaluation.java which changes SequentialRead to use getRow,
> and prints out the time for each read. 
> The test can the be run with:
> bin/hbase org.apache.hadoop.hbase.PerformaeEvaluation sequentialRead 1
> On my laptop, the original test (using get()) produces reads on the order of 5-20
> milliseconds. Using getRow(), the reads take 50-2000 ms. 
>  

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message