Return-Path: Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: (qmail 22147 invoked from network); 4 Apr 2011 12:47:16 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 4 Apr 2011 12:47:16 -0000 Received: (qmail 74845 invoked by uid 500); 4 Apr 2011 12:47:13 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 74821 invoked by uid 500); 4 Apr 2011 12:47:13 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 74799 invoked by uid 99); 4 Apr 2011 12:47:13 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 04 Apr 2011 12:47:13 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=5.0 tests=RCVD_IN_DNSWL_NONE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: local policy) Received: from [208.113.200.5] (HELO homiemail-a79.g.dreamhost.com) (208.113.200.5) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 04 Apr 2011 12:47:05 +0000 Received: from homiemail-a79.g.dreamhost.com (localhost [127.0.0.1]) by homiemail-a79.g.dreamhost.com (Postfix) with ESMTP id 7321B7D4070 for ; Mon, 4 Apr 2011 05:46:38 -0700 (PDT) DomainKey-Signature: a=rsa-sha1; c=nofws; d=thelastpickle.com; h=content-type :mime-version:subject:from:in-reply-to:date :content-transfer-encoding:message-id:references:to; q=dns; s= thelastpickle.com; b=omHK3h8edgRf+fT08hTtHl4BeokYBM54B2k+8ZukrCB C4g/euZu0mVP5EFd6a6XkJ+DK8FKd6veKsFdPbuv0UYIAnnO7kd5OD409RWI0KjM ZeJLCZlgyNNc9gICZlO6o5QQjRQ/IwK8i45udUac9+o2Q3Wy+STDw3+UzV06RqjA = DKIM-Signature: v=1; a=rsa-sha1; c=relaxed; d=thelastpickle.com; h= content-type:mime-version:subject:from:in-reply-to:date :content-transfer-encoding:message-id:references:to; s= thelastpickle.com; bh=Y2lGlnGjDDPcitgz0/7X04I6DOg=; b=nrE27d5/Bn sclMaeTGl9myf00/TsE0F1R3nQXzUaIr/okIzcSegYfXUN+AbrLRF44XcGJpZxPI fdqSaRGcObWwBQijj1NcpgXop4Kr6F/B6KopxHW1xABoeHXcbKIxV8NsKddzFBhg gKpsFE5w9U20mPiph2eWCbeI7fHJI+v9Q= Received: from [192.168.0.104] (CPE-58-175-240-72.hdcz1.win.bigpond.net.au [58.175.240.72]) (using TLSv1 with cipher AES128-SHA (128/128 bits)) (No client certificate requested) (Authenticated sender: aaron@thelastpickle.com) by homiemail-a79.g.dreamhost.com (Postfix) with ESMTPSA id C6D5B7D406C for ; Mon, 4 Apr 2011 05:46:37 -0700 (PDT) Content-Type: text/plain; charset=iso-8859-1 Mime-Version: 1.0 (Apple Message framework v1082.1) Subject: Re: AW: Strange nodetool repair behaviour From: aaron morton In-Reply-To: <120CB7532EA53A4D8CA6B63F94B4ADB351E6D62840@IE2RD2XVS021.red002.local> Date: Mon, 4 Apr 2011 22:46:32 +1000 Content-Transfer-Encoding: quoted-printable Message-Id: References: <4D999CBB.4070603@trioptima.com> <120CB7532EA53A4D8CA6B63F94B4ADB351E6D62840@IE2RD2XVS021.red002.local> To: user@cassandra.apache.org X-Mailer: Apple Mail (2.1082.1) X-Virus-Checked: Checked by ClamAV on apache.org Jonas, AFAIK if repair completed successfully there should be no = streaming the next time round. This sounds odd please look into it if = you can.=20 Can you run at DEBUG logging, there will be some messages about = receiving streams from files and which ranges are being requested.=20 I would be interested to know if the repair is completing successfully. = You should see messages such as "Repair session blah completed = successfully" if it is. It is possible repair to hang if one of the = neighbours goes away or fails to send the data. In this case the repair = session will timeout after 48 hours.=20 Aaron On 4 Apr 2011, at 20:39, Roland Gude wrote: > I am experiencing the same behavior but had it on previous versions of = 0.7 as well. >=20 >=20 > -----Urspr=FCngliche Nachricht----- > Von: Jonas Borgstr=F6m [mailto:jonas.borgstrom@trioptima.com]=20 > Gesendet: Montag, 4. April 2011 12:26 > An: user@cassandra.apache.org > Betreff: Strange nodetool repair behaviour >=20 > Hi, >=20 > I have a 6 node 0.7.4 cluster with replication_factor=3D3 where = "nodetool > repair keyspace" behaves really strange. >=20 > The keyspace contains three column families and about 60GB data in = total > (i.e 30GB on each node). >=20 > Even though no data has been added or deleted since the last repair, a > repair takes hours and the repairing node seems to receive 100+GB = worth > of sstable data from its neighbourhood nodes, i.e several times the > actual data size. >=20 > The log says things like: >=20 > "Performing streaming repair of 27 ranges" >=20 > And a bunch of: >=20 > "Compacted to 22,208,983,964 to 4,816,514,033 (~21% of = original)" >=20 > In the end the repair finishes without any error after a few hours but > even then the active sstables seems to contain lots of redundant data > since the disk usage can be sliced in half by triggering a major = compaction. >=20 > All this leads me to believe that something stops the AES from = correctly > figuring out what data is already on the repairing node and what needs > to be streamed from the neighbours. >=20 > The only thing I can think of right now is that one of the column > families contains a lot of large rows that are larger than > memtable_throughput and that's perhaps what's confusing the merkle = tree. >=20 > Anyway, is this a known problem of perhaps expected behaviour? > Otherwise I'll try to create a more reproducible test case. >=20 > Regards, > Jonas >=20 >=20