incubator-cassandra-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jonathan Colby <jonathan.co...@gmail.com>
Subject average repair/bootstrap durations
Date Fri, 27 May 2011 13:08:10 GMT
Hi -

Operations  like repair and bootstrap on nodes in our cluster (average
load 150GB each) take a very long time.

By long I mean 1-2 days.   With nodetool "netstats" I can see the
progress % very slowly progressing.

I guess there are some throttling mechanisms built into cassandra.
And yes there is also production load on these nodes so it is somewhat
understandable. Also some of out compacted data files are as 50-60 GB
each.

I was just wondering if these times are similar to what other people
are experiencing or if there is a serious configuration problem with
our setup.

So what have you guys seen with operations like loadbalance,repair,
cleanup, bootstrap on nodes with large amounts of data??

I'm not seeing too many full garbage collections.  Other minor GCs are
well under a second.

Setup info:
0.7.4
5 GB heap
8 GB  ram
64 bit linux os
AMD quad core HP blades
CMS Garbage collector with default cassandra settings
1 TB raid 0 sata disks
across 2 datacenters, but operations within the same dc take very long too.


This is a netstat output of a bootstrap that has been going on for 3+ hours:

Mode: Normal
Streaming to: /10.47.108.103
   /var/lib/cassandra/data/DFS/main-f-1541-Data.db/(0,32842490722),(32842490722,139556639427),(139556639427,161075890783)
	 progress=94624588642/161075890783 - 58%
   /var/lib/cassandra/data/DFS/main-f-1455-Data.db/(0,660743002)
	 progress=0/660743002 - 0%
   /var/lib/cassandra/data/DFS/main-f-1444-Data.db/(0,32816130132),(32816130132,71465138397),(71465138397,90968640033)
	 progress=0/90968640033 - 0%
   /var/lib/cassandra/data/DFS/main-f-1540-Data.db/(0,931632934),(931632934,2621052149),(2621052149,3236107041)
	 progress=0/3236107041 - 0%
   /var/lib/cassandra/data/DFS/main-f-1488-Data.db/(0,33428780851),(33428780851,110546591227),(110546591227,110851587206)
	 progress=0/110851587206 - 0%
   /var/lib/cassandra/data/DFS/main-f-1542-Data.db/(0,24091168),(24091168,97485080),(97485080,108233211)
	 progress=0/108233211 - 0%
   /var/lib/cassandra/data/DFS/main-f-1544-Data.db/(0,3646406),(3646406,18065308),(18065308,25776551)
	 progress=0/25776551 - 0%
   /var/lib/cassandra/data/DFS/main-f-1452-Data.db/(0,676616940)
	 progress=0/676616940 - 0%
   /var/lib/cassandra/data/DFS/main-f-1548-Data.db/(0,6957269),(6957269,48966550),(48966550,51499779)
	 progress=0/51499779 - 0%
   /var/lib/cassandra/data/DFS/main-f-1552-Data.db/(0,237153399),(237153399,750466875),(750466875,898056853)
	 progress=0/898056853 - 0%
   /var/lib/cassandra/data/DFS/main-f-1554-Data.db/(0,45155582),(45155582,195640768),(195640768,247592141)
	 progress=0/247592141 - 0%
   /var/lib/cassandra/data/DFS/main-f-1449-Data.db/(0,2812483216)
	 progress=0/2812483216 - 0%
   /var/lib/cassandra/data/DFS/main-f-1545-Data.db/(0,107648943),(107648943,434575065),(434575065,436667186)
	 progress=0/436667186 - 0%
Not receiving any streams.
Pool Name                    Active   Pending      Completed
Commands                        n/a         0         134283
Responses                       n/a         0         192438

Mime
View raw message