Return-Path: Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: (qmail 50213 invoked from network); 21 May 2010 17:43:27 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 21 May 2010 17:43:27 -0000 Received: (qmail 32555 invoked by uid 500); 21 May 2010 17:43:26 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 32527 invoked by uid 500); 21 May 2010 17:43:26 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 32519 invoked by uid 99); 21 May 2010 17:43:26 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 21 May 2010 17:43:26 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=RCVD_IN_DNSWL_NONE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of rcoli@digg.com designates 209.85.222.174 as permitted sender) Received: from [209.85.222.174] (HELO mail-pz0-f174.google.com) (209.85.222.174) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 21 May 2010 17:43:18 +0000 Received: by pzk4 with SMTP id 4so687163pzk.7 for ; Fri, 21 May 2010 10:42:57 -0700 (PDT) Received: by 10.143.24.39 with SMTP id b39mr1264201wfj.184.1274463776888; Fri, 21 May 2010 10:42:56 -0700 (PDT) Received: from Robert-Colis-MacBook-Pro.local (64-71-7-198.static.wiline.com [64.71.7.198]) by mx.google.com with ESMTPS id 23sm1116137pzk.6.2010.05.21.10.42.54 (version=TLSv1/SSLv3 cipher=RC4-MD5); Fri, 21 May 2010 10:42:55 -0700 (PDT) Message-ID: <4BF6C61E.3080903@digg.com> Date: Fri, 21 May 2010 10:42:54 -0700 From: Rob Coli User-Agent: Mozilla/5.0 (Macintosh; U; Intel Mac OS X 10.6; en-US; rv:1.9.1.9) Gecko/20100317 Thunderbird/3.0.4 MIME-Version: 1.0 To: user@cassandra.apache.org Subject: Re: Multiple hard disks configuration References: In-Reply-To: Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org > On Mon, May 17, 2010 at 10:39 PM, Ma Xiao wrote: >> Hi all, >> Recently we have a 5 nodes running cassandra, 4 X 1.5TB drives for each, >> ...then I put 4 paths with DataFileDirectory, >> my question is what's going to happen when one of the disk fail, especialy >> the one has os installed which also holds the commit log? can we simply >> replace the disk, and cassandra will get the replica write back? or it >> should be deal with as entire node faile(wipe all date on the node and >> rejoin the ring), that will involve mass data copy from other node but we >> have just one disk fail. Note, we dont have hardware level raid installed. >> Any suggestion? On 5/18/10 11:23 PM, Jonathan Ellis wrote: > Yes, you can rely on replication for this (run nodetool repair). @Ma Xiao : Nodes which have lost data due to hardware failure and are "repair"ing can serve "no value" reads for data they think they have but do not actually currently have, when read with ConsistencyLevel.ONE. Each one of these no value reads triggers a read repair, however, which should result in your hottest data being repaired relatively quickly. As I understand it, using your 750gb of data case as an example, you are likely to serve a non-trivial number of these no value reads if you read with CL.ONE in a disk failure-then-repair scenario. Reading with CL.QUORUM avoids this risk. =Rob