hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jimmy Xiang (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HBASE-9915) Severe performance bug: isSeeked() in EncodedScannerV2 is always false
Date Thu, 07 Nov 2013 19:51:17 GMT

    [ https://issues.apache.org/jira/browse/HBASE-9915?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13816348#comment-13816348
] 

Jimmy Xiang commented on HBASE-9915:
------------------------------------

Great. +1.

There is another performance related issue with encoded scanner.  In DataBlockEncoding#isCorrectEncoder,
it checks the wrong class.  So it ends up always return false, a new encoder is created for
each encoded data block. I will fix this in HBASE-9870.

> Severe performance bug: isSeeked() in EncodedScannerV2 is always false
> ----------------------------------------------------------------------
>
>                 Key: HBASE-9915
>                 URL: https://issues.apache.org/jira/browse/HBASE-9915
>             Project: HBase
>          Issue Type: Bug
>            Reporter: Lars Hofhansl
>            Assignee: Lars Hofhansl
>             Fix For: 0.98.0, 0.96.1, 0.94.14
>
>         Attachments: 9915-0.94-v2.txt, 9915-0.94.txt, profile.png
>
>
> While debugging why reseek is so slow I found that it is quite broken for encoded scanners.
> The problem is this:
> AbstractScannerV2.reseekTo(...) calls isSeeked() to check whether scanner was seeked
or not. If it was it checks whether the KV we want to seek to is in the current block, if
not it always consults the index blocks again.
> isSeeked checks the blockBuffer member, which is not used by EncodedScannerV2 and thus
always returns false, which in turns causes an index lookup for each reseek.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

Mime
View raw message