cassandra-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Anuj Wadehra (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (CASSANDRA-10446) Run repair with down replicas
Date Wed, 20 Jan 2016 18:06:39 GMT

    [ https://issues.apache.org/jira/browse/CASSANDRA-10446?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15109045#comment-15109045
] 

Anuj Wadehra commented on CASSANDRA-10446:
------------------------------------------

I think this option won't do the job. Referring to scenario, when a node failed in 20 node
cluster, what nodes will you set in -hosts and how will you ensure that the entire ring is
repaired? 

Suppose host20 failed, you would run "full repair with -hosts hosts1,host2...host19 option"
on all 19 healthy nodes.This option is unrealistic. Clusters generally use repair -pr option
to repair the cluster. With RF=5, Repair time would be 5 times more for 19 nodes. Moreover,
it requires special planning and manual intervention with just one node failure which should
be undesirable in a distributed fault tolerant system.

Another option would be to run repair -pr on 19 nodes and run repair separately on the ranges
for which the failed node was responsible. But that wont work because -pr and -hosts options
don't work together. 

Can you provide a better way to use -hosts option for addressing the issue?


> Run repair with down replicas
> -----------------------------
>
>                 Key: CASSANDRA-10446
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-10446
>             Project: Cassandra
>          Issue Type: Improvement
>            Reporter: sankalp kohli
>            Priority: Minor
>             Fix For: 3.x
>
>
> We should have an option of running repair when replicas are down. We can call it -force.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message