jackrabbit-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Amit Jain (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (JCR-3730) Use object keys to create partitions in S3 automatically
Date Fri, 21 Feb 2014 05:32:19 GMT

    [ https://issues.apache.org/jira/browse/JCR-3730?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13907977#comment-13907977

Amit Jain commented on JCR-3730:

Hi Shashank,

Removing just the prefix 'dataStore_' from the key should work just fine. 
According to the recommendations from Amazon, the first 3-4 characters should be uniformly
distributed. So, I don't think AmazonS3 is '-' aware when distributing keys. 

> Use object keys to create partitions in S3 automatically
> --------------------------------------------------------
>                 Key: JCR-3730
>                 URL: https://issues.apache.org/jira/browse/JCR-3730
>             Project: Jackrabbit Content Repository
>          Issue Type: Sub-task
>          Components: jackrabbit-core
>    Affects Versions: 2.7.4
>            Reporter: Shashank Gupta
>             Fix For: 2.7.5
> To improve performance of S3, it is recommended to use object keys which enables to save
data in multiple partitions. [1]
> The current key format dataStore_SHA1_HASH put all data in single partition. It is recommended
to remove dataStore_ prefix and split SHA1_HASH to enable randomness in prefix. 
> for e.g. if older key format is dataStore_004cb70c8f87d78f04da41e7547cb434094089ea. change
this key format to 004c-b70c8f87d78f04da41e7547cb434094089ea
> *Also consider upgrade scenario to migrate older key format data
> http://docs.aws.amazon.com/AmazonS3/latest/dev/request-rate-perf-considerations.html

This message was sent by Atlassian JIRA

View raw message