hbase-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "stack (JIRA)" <j...@apache.org>
Subject [jira] Commented: (HBASE-875) Use MurmurHash instead of JenkinsHash
Date Mon, 08 Sep 2008 17:51:44 GMT

    [ https://issues.apache.org/jira/browse/HBASE-875?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12629229#action_12629229
] 

stack commented on HBASE-875:
-----------------------------

Sound great Andrzej.

Would suggest that it not replace jenkins but can it be offered as an option?   We can't just
replace the jenkins hashing since the hash is used in filenames out on the filesystem; we'd
have to run a migration to move from one hash type to the other.

> Use MurmurHash instead of JenkinsHash
> -------------------------------------
>
>                 Key: HBASE-875
>                 URL: https://issues.apache.org/jira/browse/HBASE-875
>             Project: Hadoop HBase
>          Issue Type: Improvement
>          Components: util
>    Affects Versions: 0.19.0
>            Reporter: Andrzej Bialecki 
>
> I recently ported the MurmurHash (http://murmurhash.googlepages.com/) to Java, and according
to my tests it's roughly 5 times faster than the current version of JenkinsHash in the trunk/
. According to the author (and other analysts at comp.sci.crypt) this hash has an excellent
avalanche behavior, and low collision rate. I propose to either replace the JenkinsHash or
add this hash as an option to be used in BloomFilter-s and related classes.
> If your opinion is positive, I'll prepare a patch. The Java implementation of the hash
can be found here: http://www.getopt.org/murmur/MurmurHash.java

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message