cassandra-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Anuj Wadehra (JIRA)" <>
Subject [jira] [Commented] (CASSANDRA-10446) Run repair with down replicas
Date Wed, 20 Jan 2016 18:06:39 GMT


Anuj Wadehra commented on CASSANDRA-10446:

I think this option won't do the job. Referring to scenario, when a node failed in 20 node
cluster, what nodes will you set in -hosts and how will you ensure that the entire ring is

Suppose host20 failed, you would run "full repair with -hosts hosts1,host2...host19 option"
on all 19 healthy nodes.This option is unrealistic. Clusters generally use repair -pr option
to repair the cluster. With RF=5, Repair time would be 5 times more for 19 nodes. Moreover,
it requires special planning and manual intervention with just one node failure which should
be undesirable in a distributed fault tolerant system.

Another option would be to run repair -pr on 19 nodes and run repair separately on the ranges
for which the failed node was responsible. But that wont work because -pr and -hosts options
don't work together. 

Can you provide a better way to use -hosts option for addressing the issue?

> Run repair with down replicas
> -----------------------------
>                 Key: CASSANDRA-10446
>                 URL:
>             Project: Cassandra
>          Issue Type: Improvement
>            Reporter: sankalp kohli
>            Priority: Minor
>             Fix For: 3.x
> We should have an option of running repair when replicas are down. We can call it -force.

This message was sent by Atlassian JIRA

View raw message