hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Lars Hofhansl (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HBASE-12988) [Replication]Parallel apply edits on row-level
Date Tue, 16 Jun 2015 03:35:01 GMT

    [ https://issues.apache.org/jira/browse/HBASE-12988?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14587403#comment-14587403
] 

Lars Hofhansl commented on HBASE-12988:
---------------------------------------

Thanks [~apurtell]!

There's also another idea I had in the meanwhile: The sink logic I had implemented to deal
with cluster ids to avoid replication cycles already groups by table and list<clusterid>
and then applies this in batches. Could just do the same logic on the sink (with the addition
of also grouping by row), maybe it's OK - performance wise - to break up the WALEdits and
to reassemble them. That would certainly make it easier later to stream per row edits to sinks.

> [Replication]Parallel apply edits on row-level
> ----------------------------------------------
>
>                 Key: HBASE-12988
>                 URL: https://issues.apache.org/jira/browse/HBASE-12988
>             Project: HBase
>          Issue Type: Improvement
>          Components: Replication
>            Reporter: hongyu bi
>            Assignee: Lars Hofhansl
>         Attachments: 12988.txt, HBASE-12988-0.98.patch, ParallelReplication-v2.txt
>
>
> we can apply  edits to slave cluster in parallel on table-level to speed up replication
.
> update : per conversation blow , it's better to apply edits on row-level in parallel



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message