hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Vikas Vishwakarma (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HBASE-17877) Replace/improve HBase's byte[] comparator
Date Sat, 08 Apr 2017 11:39:41 GMT

    [ https://issues.apache.org/jira/browse/HBASE-17877?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15961794#comment-15961794
] 

Vikas Vishwakarma commented on HBASE-17877:
-------------------------------------------

I did some tests with JMH. So overall it looks like the hadoop version gives better throughput
with random byte arrays, but it hits a worst case when two the two byte arrays are almost
similar and have just 1 byte diff like say last byte in which case it gives 30-40% lower throughput.
I will do some more experiments with the guava version also. The below result table is with
Random byte arrays. I have used 20 warmup and 20 measurement cycles of 1 sec each. I will
leave a longer running test for the weekend. 

|Byte Array Size|byte array diff index|Mean HBase (ops/ms)|Hadoop (ops/ms) |%diff|
|----|----|----|----|----|
|Random|4|440977.678|439558.547|0|
|Random|8|437781.625|439089.09|0|
|Random|16|441573.466|439052.733|-1 |
|Random|20|433731.957|449067.897|4 |
|Random|32|441486.237|448821.457|2 |
|Random|50|432720.686|447027.454|3 |
|Random|64|436713.574|440450.684|1 |
|Random|100|430155.538|439399.306|2 |
|Random|128|441413.074|440321.381|0 |
|Random|200|450140.537|448164.678|0 | 
|Random|256|376015.788|457719.989|22 |
|Random|512|437394.161|438975.893|0 | 
|Random|1024|442672.023|440269.773|-1 |
|Random|2048|441500.328|439325.611|0 |
|Random|4096|428583.365|439881.571|3 |
|Random|8192|449889.732|439456.918|-2 |
|Random|16384|439970.375|439271.189|0 |

> Replace/improve HBase's byte[] comparator
> -----------------------------------------
>
>                 Key: HBASE-17877
>                 URL: https://issues.apache.org/jira/browse/HBASE-17877
>             Project: HBase
>          Issue Type: Bug
>            Reporter: Lars Hofhansl
>            Assignee: Vikas Vishwakarma
>         Attachments: 17877-1.2.patch, 17877-v2-1.3.patch, ByteComparatorJiraHBASE-17877.pdf
>
>
> [~vik.karma] did some extensive tests and found that Hadoop's version is faster - dramatically
faster in some cases.
> Patch forthcoming.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Mime
View raw message