Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (athena.apache.org: local policy)
DomainKey-Signature: a=rsa-sha1; c=nofws; d=thelastpickle.com; h=from
	:mime-version:content-type:subject:date:in-reply-to:to
	:references:message-id; q=dns; s=thelastpickle.com; b=XVheWyAuTp
	c9YFoSopgX7GAxqP2ADm4mfUKgRK4QDDB6ZJiwArhIqQtUMxwVJJP1W/pghqUoTd
	8eNH5a/0ysbBFIk5epMZL0FE/n9IIGHp+4AZ58NiQNCLrLmOBMUD1aM9rMVD5RYz
	TNWE+Rl/jg+FR72osHZ1wnx5vs/jX+TbQ=
From: aaron morton <aaron@thelastpickle.com>
Mime-Version: 1.0 (Apple Message framework v1244.3)
Content-Type: multipart/alternative;
 boundary="Apple-Mail=_B26C2FFD-5CD5-4DB1-B91F-376864A02AB7"
Subject: Re: node restart taking too long
Date: Mon, 22 Aug 2011 09:42:48 +1200
In-Reply-To: 
 <CAOA66tHe1VgxQhYUeWxOOD64M8WYX55_wiSNYjua6qwfy-APhg@mail.gmail.com>
To: user@cassandra.apache.org
References: 
 <CAOA66tEiz5G2Jdf_wu+b5nizd=g+Cu9GjS9KtbVwG9uD6MzEZg@mail.gmail.com>
 <3066FEE2-CE8D-4B1D-BEB9-75812BAFE9F7@thelastpickle.com>
 <CALdd-ziBuvufOxkA1cHaZhqF68o1EOKiqNekGYkRPVPiZhTGGQ@mail.gmail.com>
 <CAOA66tHObEeS5LseVHctg6guQWpfGanJe1Dcic8ifGPvFGEBXA@mail.gmail.com>
 <CAOA66tFdpExyc1wBuk7Q6=dLPV=DwqKSYLH01pFvw+T6OVeSTg@mail.gmail.com>
 <4E4AE839.2010403@wetafx.co.nz>
 <CAOA66tFNS3O7YPUh+9bgTh5kVDCKV1-DO50tGD3HE9ous1qsGQ@mail.gmail.com>
 <5FD79CA7-C800-45D8-9CB9-70F40236C497@thelastpickle.com>
 <CAOA66tHSA0EP1CSwkPmDrQBSODsFnd3P5XfQQWuJc6APvYKcUw@mail.gmail.com>
 <CAAaMp51_KzaTZx2NcEAQYt=YqHw_7_7EbZ1Joy12FnfLT2VBcg@mail.gmail.com>
 <CAOA66tFiHD7NUrkh=1h55ZvZBrScJez-usQisVfzCUc1W97JBQ@mail.gmail.com>
 <CAOA66tHPgrM6-GhcdSyaBDMv8tGpA+3RzJetYzhFov3t-3tg=g@mail.gmail.com>
 <C821169C-9A01-4990-A83F-BCBCDCAEC3FF@thelastpickle.com>
 <CAOA66tFZFFbqW_-mCmUX5r5CfsKMq+cHH9FMrTbZSfA-L51pkw@mail.gmail.com>
 <CALdd-zj2aK88A5_4aUL7QbCdaZy9pHQYBR7Ez=buEhktYm34Qw@mail.gmail.com>
 <CALdd-zi6sVqxPs
 -z0i=Y4EPkjq4Si-6Q_EOsBOOrLo8tEi7c9Q@mail.gmail.com>
 <CAOA66tEUuLwW8Y+rY-bzNfKYNVgoMHWrvD3Wt=0A3bLPj8g=Yw@mail.gmail.com>
 <194F5791-EB22-4BB1-9360-C1E531E78491@thelastpickle.com>
 <CAOA66tHe1VgxQhYUeWxOOD64M8WYX55_wiSNYjua6qwfy-APhg@mail.gmail.com>
Message-Id: <2A4566C8-AAFD-4656-BCA0-898FC23C3DAB@thelastpickle.com>


--Apple-Mail=_B26C2FFD-5CD5-4DB1-B91F-376864A02AB7
Content-Transfer-Encoding: quoted-printable
Content-Type: text/plain;
	charset=iso-8859-1

cf already exists is not the same.=20

Would need the call stack.=20

Cheers

-----------------
Aaron Morton
Freelance Cassandra Developer
@aaronmorton
http://www.thelastpickle.com

On 22/08/2011, at 1:03 AM, Yan Chunlu wrote:

> is that means I could just wait and it will be okay eventually?
>=20
> I also saw the "column family already exists"(not accurate, something =
like that) Exception, also caused after I delete the migration and =
schema sstables.   but I can not reproduce it, is that a similar =
problem?
>=20
> On Sun, Aug 21, 2011 at 7:57 PM, aaron morton =
<aaron@thelastpickle.com> wrote:
> I've seen "Couldn't find cfId=3D1000" in a mutation stage happen when =
a node joins a cluster with existing data after having it's schema =
cleared.=20
>=20
> The migrations received from another node are applied one CF at a =
time, when each CF is added the node will open the existing data files =
which can take a while. In the mean time it's joined on gossip and is =
receiving mutations from other nodes that have all the CF's. One the =
returning node gets through applying the migration the errors should =
stop.=20
>=20
> Read is a similar story.
>=20
> Cheers
> =20
>=20
>=20
> -----------------
> Aaron Morton
> Freelance Cassandra Developer
> @aaronmorton
> http://www.thelastpickle.com
>=20
> On 21/08/2011, at 8:58 PM, Yan Chunlu wrote:
>=20
>> actually I didn't dropped any CF,  maybe my understanding was totally =
wrong, I just describe what I thought as belows:=20
>>=20
>> I thought by "deleted CFs" means the sstable that useless(since "node =
repair" and could copy data to another node,  the original sstable might =
be deleted but not yet).  when I deleted all migration and schema =
sstables, it somehow "forgot" those files should be deleted, so it read =
the file and "can not find cfId"...
>>=20
>>=20
>> I got to this situation by the following steps: at first I did "node =
repair" on node2 which failed in the middle(node3 down), and leave the =
Load as 170GB while average is 30GB.
>>=20
>> after I brought up node3,  the node2 start up very slow, 4 days past =
it stil starting.  it seems loading row cache and key cache.  so I =
disabled those cache by set the value to 0 via cassandra-cli. during =
this procedure, of course node2 was not reachable so it can not update =
the schema.
>>=20
>> after that node2 could be start very quickly, but the "describe =
cluster" shows it was "UNREACHABLE", so I did as the FAQ says, delete =
schema, migration sstables and restart node2.=20
>>=20
>> then the "Couldn't find cfId=3D1000'" error start showing up.
>>=20
>>=20
>>=20
>>=20
>>=20
>> I have just moved those migration && schema sstables back and start =
cassandra, it still shows "UNREACHABLE", after wait for couple of hours, =
the "describe cluster" shows they are the same version now.
>>=20
>>=20
>> even this problem solved, I am not sure HOW....... really curious =
that why just remove "migration* and schema*" sstables could cause  =
"Couldn't find cfId=3D1000'"  error.
>>=20
>> On Sun, Aug 21, 2011 at 12:24 PM, Jonathan Ellis <jbellis@gmail.com> =
wrote:
>> I'm not sure what problem you're trying to solve.  The exception you
>> pasted should stop once your clients are no longer trying to use the
>> dropped CF.
>>=20
>> On Sat, Aug 20, 2011 at 10:09 PM, Yan Chunlu <springrider@gmail.com> =
wrote:
>> > that could be the reason, I did nodetool repair(unfinished, data =
size
>> > increased 6 times bigger 30G vs 170G) and there should be some =
unclean
>> > sstables on that node.
>> > however upgrade it a tough work for me right now.  could the =
nodetool scrub
>> > help?  or decommission the node and join it again?
>> >
>> > On Sun, Aug 21, 2011 at 5:56 AM, Jonathan Ellis <jbellis@gmail.com> =
wrote:
>> >>
>> >> This means you should upgrade, because we've fixed bugs about =
ignoring
>> >> deleted CFs since 0.7.4.
>> >>
>> >> On Fri, Aug 19, 2011 at 9:26 AM, Yan Chunlu =
<springrider@gmail.com> wrote:
>> >> > the log file shows as follows, not sure what does 'Couldn't find
>> >> > cfId=3D1000'
>> >> > means(google just returned useless results):
>> >> >
>> >> > INFO [main] 2011-08-18 07:23:17,688 DatabaseDescriptor.java =
(line 453)
>> >> > Found
>> >> > table data in data directories. Consider using JMX to call
>> >> > =
org.apache.cassandra.service.StorageService.loadSchemaFromYaml().
>> >> >  INFO [main] 2011-08-18 07:23:17,705 CommitLogSegment.java (line =
50)
>> >> > Creating new commitlog segment
>> >> > /cassandra/commitlog/CommitLog-1313670197705.log
>> >> >  INFO [main] 2011-08-18 07:23:17,716 CommitLog.java (line 155) =
Replaying
>> >> > /cassandra/commitlog/CommitLog-1313670030512.log
>> >> >  INFO [main] 2011-08-18 07:23:17,734 CommitLog.java (line 314) =
Finished
>> >> > reading /cassandra/commitlog/CommitLog-1313670030512.log
>> >> >  INFO [main] 2011-08-18 07:23:17,744 CommitLog.java (line 163) =
Log
>> >> > replay
>> >> > complete
>> >> >  INFO [main] 2011-08-18 07:23:17,756 StorageService.java (line =
364)
>> >> > Cassandra version: 0.7.4
>> >> >  INFO [main] 2011-08-18 07:23:17,756 StorageService.java (line =
365)
>> >> > Thrift
>> >> > API version: 19.4.0
>> >> >  INFO [main] 2011-08-18 07:23:17,756 StorageService.java (line =
378)
>> >> > Loading
>> >> > persisted ring state
>> >> >  INFO [main] 2011-08-18 07:23:17,766 StorageService.java (line =
414)
>> >> > Starting
>> >> > up server gossip
>> >> >  INFO [main] 2011-08-18 07:23:17,771 ColumnFamilyStore.java =
(line 1048)
>> >> > Enqueuing flush of Memtable-LocationInfo@832310230(29 bytes, 1
>> >> > operations)
>> >> >  INFO [FlushWriter:1] 2011-08-18 07:23:17,772 Memtable.java =
(line 157)
>> >> > Writing Memtable-LocationInfo@832310230(29 bytes, 1 operations)
>> >> >  INFO [FlushWriter:1] 2011-08-18 07:23:17,822 Memtable.java =
(line 164)
>> >> > Completed flushing =
/cassandra/data/system/LocationInfo-f-66-Data.db (80
>> >> > bytes)
>> >> >  INFO [CompactionExecutor:1] 2011-08-18 07:23:17,823
>> >> > CompactionManager.java
>> >> > (line 396) Compacting
>> >> >
>> >> > =
[SSTableReader(path=3D'/cassandra/data/system/LocationInfo-f-63-Data.db'),=
SSTableReader(path=3D'/cassandra/data/system/LocationInfo-f-64-Data.db'),S=
STableReader(path=3D'/cassandra/data/system/LocationInfo-f-65-Data.db'),SS=
TableReader(path=3D'/cassandra/data/system/LocationInfo-f-66-Data.db')]
>> >> >  INFO [main] 2011-08-18 07:23:17,853 StorageService.java (line =
478)
>> >> > Using
>> >> > saved token 113427455640312821154458202477256070484
>> >> >  INFO [main] 2011-08-18 07:23:17,854 ColumnFamilyStore.java =
(line 1048)
>> >> > Enqueuing flush of Memtable-LocationInfo@18895884(53 bytes, 2
>> >> > operations)
>> >> >  INFO [FlushWriter:1] 2011-08-18 07:23:17,854 Memtable.java =
(line 157)
>> >> > Writing Memtable-LocationInfo@18895884(53 bytes, 2 operations)
>> >> > ERROR [MutationStage:28] 2011-08-18 07:23:18,246
>> >> > RowMutationVerbHandler.java
>> >> > (line 86) Error in row mutation
>> >> > org.apache.cassandra.db.UnserializableColumnFamilyException: =
Couldn't
>> >> > find
>> >> > cfId=3D1000
>> >> >     at
>> >> >
>> >> > =
org.apache.cassandra.db.ColumnFamilySerializer.deserialize(ColumnFamilySer=
ializer.java:117)
>> >> >     at
>> >> >
>> >> > =
org.apache.cassandra.db.RowMutation$RowMutationSerializer.deserialize(RowM=
utation.java:380)
>> >> >     at
>> >> >
>> >> > =
org.apache.cassandra.db.RowMutationVerbHandler.doVerb(RowMutationVerbHandl=
er.java:50)
>> >> >     at
>> >> >
>> >> > =
org.apache.cassandra.net.MessageDeliveryTask.run(MessageDeliveryTask.java:=
72)
>> >> >     at
>> >> >
>> >> > =
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:=
1110)
>> >> >     at
>> >> >
>> >> > =
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java=
:603)
>> >> >     at java.lang.Thread.run(Thread.java:636)
>> >> >  INFO [GossipStage:1] 2011-08-18 07:23:18,255 Gossiper.java =
(line 623)
>> >> > Node
>> >> > /node1 has restarted, now UP again
>> >> > ERROR [ReadStage:1] 2011-08-18 07:23:18,254
>> >> > DebuggableThreadPoolExecutor.java (line 103) Error in =
ThreadPoolExecutor
>> >> > java.lang.IllegalArgumentException: Unknown ColumnFamily =
prjcache in
>> >> > keyspace prjkeyspace
>> >> >     at
>> >> >
>> >> > =
org.apache.cassandra.config.DatabaseDescriptor.getComparator(DatabaseDescr=
iptor.java:966)
>> >> >     at
>> >> >
>> >> > =
org.apache.cassandra.db.ColumnFamily.getComparatorFor(ColumnFamily.java:38=
8)
>> >> >     at
>> >> > =
org.apache.cassandra.db.ReadCommand.getComparator(ReadCommand.java:93)
>> >> >     at
>> >> >
>> >> > =
org.apache.cassandra.db.SliceByNamesReadCommand.<init>(SliceByNamesReadCom=
mand.java:44)
>> >> >     at
>> >> >
>> >> > =
org.apache.cassandra.db.SliceByNamesReadCommandSerializer.deserialize(Slic=
eByNamesReadCommand.java:110)
>> >> >     at
>> >> >
>> >> > =
org.apache.cassandra.db.ReadCommandSerializer.deserialize(ReadCommand.java=
:122)
>> >> >     at
>> >> > =
org.apache.cassandra.db.ReadVerbHandler.doVerb(ReadVerbHandler.java:67)
>> >> >
>> >> >
>> >> > On Fri, Aug 19, 2011 at 5:44 AM, aaron morton =
<aaron@thelastpickle.com>
>> >> > wrote:
>> >> >>
>> >> >> Look in the logs to work find out why the migration did not get =
to
>> >> >> node2.
>> >> >> Otherwise yes you can drop those files.
>> >> >> Cheers
>> >> >> -----------------
>> >> >> Aaron Morton
>> >> >> Freelance Cassandra Developer
>> >> >> @aaronmorton
>> >> >> http://www.thelastpickle.com
>> >> >> On 18/08/2011, at 11:25 PM, Yan Chunlu wrote:
>> >> >>
>> >> >> just found out that changes via cassandra-cli, the schema =
change didn't
>> >> >> reach node2. and node2 became unreachable....
>> >> >> I did as this
>> >> >> =
document:http://wiki.apache.org/cassandra/FAQ#schema_disagreement
>> >> >> but after that I just got two schema versons:
>> >> >>
>> >> >>
>> >> >> ddcada52-c96a-11e0-99af-3bd951658d61: [node1, node3]
>> >> >> 2127b2ef-6998-11e0-b45b-3bd951658d61: [node2]
>> >> >>
>> >> >> is that enough delete Schema* && Migrations* sstables and =
restart the
>> >> >> node?
>> >> >>
>> >> >>
>> >> >> On Thu, Aug 18, 2011 at 5:13 PM, Yan Chunlu =
<springrider@gmail.com>
>> >> >> wrote:
>> >> >>>
>> >> >>> thanks a lot for  all the help!  I have gone through the steps =
and
>> >> >>> successfully brought up the node2 :)
>> >> >>>
>> >> >>> On Thu, Aug 18, 2011 at 10:51 AM, Boris Yen =
<yulinyen@gmail.com>
>> >> >>> wrote:
>> >> >>> > Because the file only preserve the "key" of records, not the =
whole
>> >> >>> > record.
>> >> >>> > Records for those saved key will be loaded into cassandra =
during the
>> >> >>> > startup
>> >> >>> > of cassandra.
>> >> >>> >
>> >> >>> > On Wed, Aug 17, 2011 at 5:52 PM, Yan Chunlu =
<springrider@gmail.com>
>> >> >>> > wrote:
>> >> >>> >>
>> >> >>> >> but the data size in the saved_cache are relatively small:
>> >> >>> >>
>> >> >>> >> will that cause the load problem?
>> >> >>> >>
>> >> >>> >>  ls  -lh  /cassandra/saved_caches/
>> >> >>> >> total 32M
>> >> >>> >> -rw-r--r-- 1 cass cass 2.9M 2011-08-12 19:53
>> >> >>> >> cass-CommentSortsCache-KeyCache
>> >> >>> >> -rw-r--r-- 1 cass cass 2.9M 2011-08-17 04:29
>> >> >>> >> cass-CommentSortsCache-RowCache
>> >> >>> >> -rw-r--r-- 1 cass cass 2.7M 2011-08-12 18:50
>> >> >>> >> cass-CommentVote-KeyCache
>> >> >>> >> -rw-r--r-- 1 cass cass 140K 2011-08-12 19:53
>> >> >>> >> cass-device_images-KeyCache
>> >> >>> >> -rw-r--r-- 1 cass cass  33K 2011-08-12 18:51 =
cass-Hide-KeyCache
>> >> >>> >> -rw-r--r-- 1 cass cass 4.6M 2011-08-12 19:53 =
cass-images-KeyCache
>> >> >>> >> -rw-r--r-- 1 cass cass 2.6M 2011-08-12 19:53
>> >> >>> >> cass-LinksByUrl-KeyCache
>> >> >>> >> -rw-r--r-- 1 cass cass 2.5M 2011-08-12 18:50 =
cass-LinkVote-KeyCache
>> >> >>> >> -rw-r--r-- 1 cass cass 7.5M 2011-08-12 18:50 =
cass-cache-KeyCache
>> >> >>> >> -rw-r--r-- 1 cass cass 3.7M 2011-08-12 21:51 =
cass-cache-RowCache
>> >> >>> >> -rw-r--r-- 1 cass cass 1.8M 2011-08-12 18:51 =
cass-Save-KeyCache
>> >> >>> >> -rw-r--r-- 1 cass cass 111K 2011-08-12 19:50
>> >> >>> >> cass-SavesByAccount-KeyCache
>> >> >>> >> -rw-r--r-- 1 cass cass  864 2011-08-12 19:49
>> >> >>> >> cass-VotesByDay-KeyCache
>> >> >>> >> -rw-r--r-- 1 cass cass 249K 2011-08-12 19:49
>> >> >>> >> cass-VotesByLink-KeyCache
>> >> >>> >> -rw-r--r-- 1 cass cass   28 2011-08-14 12:50
>> >> >>> >> system-HintsColumnFamily-KeyCache
>> >> >>> >> -rw-r--r-- 1 cass cass    5 2011-08-14 12:50
>> >> >>> >> system-LocationInfo-KeyCache
>> >> >>> >> -rw-r--r-- 1 cass cass   54 2011-08-13 13:30
>> >> >>> >> system-Migrations-KeyCache
>> >> >>> >> -rw-r--r-- 1 cass cass   76 2011-08-13 13:30 =
system-Schema-KeyCache
>> >> >>> >>
>> >> >>> >> On Wed, Aug 17, 2011 at 4:31 PM, aaron morton
>> >> >>> >> <aaron@thelastpickle.com>
>> >> >>> >> wrote:
>> >> >>> >> > If you have a node that cannot start up due to issues =
loading the
>> >> >>> >> > saved
>> >> >>> >> > cache delete the files in the saved_cache directory =
before
>> >> >>> >> > starting
>> >> >>> >> > it.
>> >> >>> >> >
>> >> >>> >> > The settings to save the row and key cache are per CF. =
You can
>> >> >>> >> > change
>> >> >>> >> > them with an update column family statement via the CLI =
when
>> >> >>> >> > attached to any
>> >> >>> >> > node. You may then want to check the saved_caches =
directory and
>> >> >>> >> > delete any
>> >> >>> >> > files that are left (not sure if they are automatically =
deleted).
>> >> >>> >> >
>> >> >>> >> > i would recommend:
>> >> >>> >> > - stop node 2
>> >> >>> >> > - delete it's saved_cache
>> >> >>> >> > - make the schema change via another node
>> >> >>> >> > - startup node 2
>> >> >>> >> >
>> >> >>> >> > Cheers
>> >> >>> >> >
>> >> >>> >> > -----------------
>> >> >>> >> > Aaron Morton
>> >> >>> >> > Freelance Cassandra Developer
>> >> >>> >> > @aaronmorton
>> >> >>> >> > http://www.thelastpickle.com
>> >> >>> >> >
>> >> >>> >> > On 17/08/2011, at 2:59 PM, Yan Chunlu wrote:
>> >> >>> >> >
>> >> >>> >> >> does this need to be cluster wide? or I could just =
modify the
>> >> >>> >> >> caches
>> >> >>> >> >> on one node?   since I could not connect to the node =
with
>> >> >>> >> >> cassandra-cli, it says "connection refused"
>> >> >>> >> >>
>> >> >>> >> >>
>> >> >>> >> >> [default@unknown] connect node2/9160;
>> >> >>> >> >> Exception connecting to node2/9160. Reason: Connection =
refused.
>> >> >>> >> >>
>> >> >>> >> >>
>> >> >>> >> >> so if I change the cache size via other nodes, how could =
node2
>> >> >>> >> >> be
>> >> >>> >> >> notified the changing?    kill cassandra and start it =
again
>> >> >>> >> >> could
>> >> >>> >> >> make
>> >> >>> >> >> it update the schema?
>> >> >>> >> >>
>> >> >>> >> >>
>> >> >>> >> >>
>> >> >>> >> >> On Wed, Aug 17, 2011 at 5:59 AM, Teijo Holzer
>> >> >>> >> >> <tholzer@wetafx.co.nz>
>> >> >>> >> >> wrote:
>> >> >>> >> >>> Hi,
>> >> >>> >> >>>
>> >> >>> >> >>> yes, we saw exactly the same messages. We got rid of =
these by
>> >> >>> >> >>> doing
>> >> >>> >> >>> the
>> >> >>> >> >>> following:
>> >> >>> >> >>>
>> >> >>> >> >>> * Set all row & key caches in your CFs to 0 via =
cassandra-cli
>> >> >>> >> >>> * Kill Cassandra
>> >> >>> >> >>> * Remove all files in the saved_caches directory
>> >> >>> >> >>> * Start Cassandra
>> >> >>> >> >>> * Slowly bring back row & key caches (if desired, we =
left them
>> >> >>> >> >>> off)
>> >> >>> >> >>>
>> >> >>> >> >>> Cheers,
>> >> >>> >> >>>
>> >> >>> >> >>>        T.
>> >> >>> >> >>>
>> >> >>> >> >>> On 16/08/11 23:35, Yan Chunlu wrote:
>> >> >>> >> >>>>
>> >> >>> >> >>>>  I saw alot slicequeryfilter things if changed the log =
level
>> >> >>> >> >>>> to
>> >> >>> >> >>>> DEBUG.
>> >> >>> >> >>>>  just
>> >> >>> >> >>>> thought even bring up a new node will be faster than =
start the
>> >> >>> >> >>>> old
>> >> >>> >> >>>> one..... it
>> >> >>> >> >>>> is wired
>> >> >>> >> >>>>
>> >> >>> >> >>>> DEBUG [main] 2011-08-16 06:32:49,213 =
SliceQueryFilter.java
>> >> >>> >> >>>> (line
>> >> >>> >> >>>> 123)
>> >> >>> >> >>>> collecting 0 of 2147483647:
>> >> >>> >> >>>> 76616c7565:false:225@1313068845474382
>> >> >>> >> >>>> DEBUG [main] 2011-08-16 06:32:49,245 =
SliceQueryFilter.java
>> >> >>> >> >>>> (line
>> >> >>> >> >>>> 123)
>> >> >>> >> >>>> collecting 0 of 2147483647:
>> >> >>> >> >>>> 76616c7565:false:453@1310999270198313
>> >> >>> >> >>>> DEBUG [main] 2011-08-16 06:32:49,251 =
SliceQueryFilter.java
>> >> >>> >> >>>> (line
>> >> >>> >> >>>> 123)
>> >> >>> >> >>>> collecting 0 of 2147483647:
>> >> >>> >> >>>> 76616c7565:false:26@1313199902088827
>> >> >>> >> >>>> DEBUG [main] 2011-08-16 06:32:49,576 =
SliceQueryFilter.java
>> >> >>> >> >>>> (line
>> >> >>> >> >>>> 123)
>> >> >>> >> >>>> collecting 0 of 2147483647:
>> >> >>> >> >>>> 76616c7565:false:157@1313097239332314
>> >> >>> >> >>>> DEBUG [main] 2011-08-16 06:32:50,674 =
SliceQueryFilter.java
>> >> >>> >> >>>> (line
>> >> >>> >> >>>> 123)
>> >> >>> >> >>>> collecting 0 of 2147483647:
>> >> >>> >> >>>> 76616c7565:false:41729@1313190821826229
>> >> >>> >> >>>> DEBUG [main] 2011-08-16 06:32:50,811 =
SliceQueryFilter.java
>> >> >>> >> >>>> (line
>> >> >>> >> >>>> 123)
>> >> >>> >> >>>> collecting 0 of 2147483647:
>> >> >>> >> >>>> 76616c7565:false:6@1313174157301203
>> >> >>> >> >>>> DEBUG [main] 2011-08-16 06:32:50,867 =
SliceQueryFilter.java
>> >> >>> >> >>>> (line
>> >> >>> >> >>>> 123)
>> >> >>> >> >>>> collecting 0 of 2147483647:
>> >> >>> >> >>>> 76616c7565:false:98@1312011362250907
>> >> >>> >> >>>> DEBUG [main] 2011-08-16 06:32:50,881 =
SliceQueryFilter.java
>> >> >>> >> >>>> (line
>> >> >>> >> >>>> 123)
>> >> >>> >> >>>> collecting 0 of 2147483647:
>> >> >>> >> >>>> 76616c7565:false:42@1313201711997005
>> >> >>> >> >>>> DEBUG [main] 2011-08-16 06:32:50,910 =
SliceQueryFilter.java
>> >> >>> >> >>>> (line
>> >> >>> >> >>>> 123)
>> >> >>> >> >>>> collecting 0 of 2147483647:
>> >> >>> >> >>>> 76616c7565:false:96@1312939986190155
>> >> >>> >> >>>> DEBUG [main] 2011-08-16 06:32:50,954 =
SliceQueryFilter.java
>> >> >>> >> >>>> (line
>> >> >>> >> >>>> 123)
>> >> >>> >> >>>> collecting 0 of 2147483647:
>> >> >>> >> >>>> 76616c7565:false:621@1313192538616112
>> >> >>> >> >>>>
>> >> >>> >> >>>>
>> >> >>> >> >>>>
>> >> >>> >> >>>> On Tue, Aug 16, 2011 at 7:32 PM, Yan Chunlu
>> >> >>> >> >>>> <springrider@gmail.com
>> >> >>> >> >>>> <mailto:springrider@gmail.com>> wrote:
>> >> >>> >> >>>>
>> >> >>> >> >>>>    but it seems the row cache is cluster wide, how =
will  the
>> >> >>> >> >>>> change
>> >> >>> >> >>>> of row
>> >> >>> >> >>>>    cache affect the read speed?
>> >> >>> >> >>>>
>> >> >>> >> >>>>
>> >> >>> >> >>>>    On Mon, Aug 15, 2011 at 7:33 AM, Jonathan Ellis
>> >> >>> >> >>>> <jbellis@gmail.com
>> >> >>> >> >>>>    <mailto:jbellis@gmail.com>> wrote:
>> >> >>> >> >>>>
>> >> >>> >> >>>>        Or leave row cache enabled but disable cache =
saving
>> >> >>> >> >>>> (and
>> >> >>> >> >>>> remove the
>> >> >>> >> >>>>        one already on disk).
>> >> >>> >> >>>>
>> >> >>> >> >>>>        On Sun, Aug 14, 2011 at 5:05 PM, aaron morton
>> >> >>> >> >>>> <aaron@thelastpickle.com
>> >> >>> >> >>>>        <mailto:aaron@thelastpickle.com>> wrote:
>> >> >>> >> >>>>         >  INFO [main] 2011-08-14 09:24:52,198
>> >> >>> >> >>>> ColumnFamilyStore.java
>> >> >>> >> >>>> (line 547)
>> >> >>> >> >>>>         > completed loading (1744370 ms; 200000 keys) =
row
>> >> >>> >> >>>> cache
>> >> >>> >> >>>> for
>> >> >>> >> >>>> COMMENT
>> >> >>> >> >>>>         >
>> >> >>> >> >>>>         > It's taking 29 minutes to load 200,000 rows =
in the
>> >> >>> >> >>>>  row
>> >> >>> >> >>>> cache.
>> >> >>> >> >>>> Thats a
>> >> >>> >> >>>>         > pretty big row cache, I would suggest =
reducing or
>> >> >>> >> >>>> disabling
>> >> >>> >> >>>> it.
>> >> >>> >> >>>>         > Background
>> >> >>> >> >>>>
>> >> >>> >> >>>>
>> >> >>> >> >>>>
>> >> >>> >> >>>>
>> >> >>> >> >>>>  =
http://www.datastax.com/dev/blog/maximizing-cache-benefit-with-cassandra
>> >> >>> >> >>>>         >
>> >> >>> >> >>>>         > and server can not afford the load then =
crashed.
>> >> >>> >> >>>> after
>> >> >>> >> >>>> come
>> >> >>> >> >>>> back,
>> >> >>> >> >>>>        node 3 can
>> >> >>> >> >>>>         > not return for more than 96 hours
>> >> >>> >> >>>>         >
>> >> >>> >> >>>>         > Crashed how ?
>> >> >>> >> >>>>         > You may be seeing
>> >> >>> >> >>>> https://issues.apache.org/jira/browse/CASSANDRA-2280
>> >> >>> >> >>>>         > Watch nodetool compactionstats to see when =
the
>> >> >>> >> >>>> Merkle
>> >> >>> >> >>>> tree
>> >> >>> >> >>>> build
>> >> >>> >> >>>>        finishes
>> >> >>> >> >>>>         > and nodetool netstats to see which CF's are
>> >> >>> >> >>>> streaming.
>> >> >>> >> >>>>         > Cheers
>> >> >>> >> >>>>         > -----------------
>> >> >>> >> >>>>         > Aaron Morton
>> >> >>> >> >>>>         > Freelance Cassandra Developer
>> >> >>> >> >>>>         > @aaronmorton
>> >> >>> >> >>>>         > http://www.thelastpickle.com
>> >> >>> >> >>>>         > On 15 Aug 2011, at 04:23, Yan Chunlu wrote:
>> >> >>> >> >>>>         >
>> >> >>> >> >>>>         >
>> >> >>> >> >>>>         > I got 3 nodes and RF=3D3, when I repairing =
ndoe3, it
>> >> >>> >> >>>> seems
>> >> >>> >> >>>> alot
>> >> >>> >> >>>> data
>> >> >>> >> >>>>         > generated.  and server can not afford the =
load then
>> >> >>> >> >>>> crashed.
>> >> >>> >> >>>>         > after come back, node 3 can not return for =
more than
>> >> >>> >> >>>> 96
>> >> >>> >> >>>> hours
>> >> >>> >> >>>>         >
>> >> >>> >> >>>>         > for 34GB data, the node 2 could restart and =
back
>> >> >>> >> >>>> online
>> >> >>> >> >>>> within 1
>> >> >>> >> >>>> hour.
>> >> >>> >> >>>>         >
>> >> >>> >> >>>>         > I am not sure what's wrong with node3 and =
should I
>> >> >>> >> >>>> restart
>> >> >>> >> >>>> node
>> >> >>> >> >>>> 3 again?
>> >> >>> >> >>>>         > thanks!
>> >> >>> >> >>>>         >
>> >> >>> >> >>>>         > Address         Status State   Load          =
  Owns
>> >> >>> >> >>>>  Token
>> >> >>> >> >>>>         >
>> >> >>> >> >>>>         > 113427455640312821154458202477256070484
>> >> >>> >> >>>>         > node1     Up     Normal  34.11 GB        =
33.33%  0
>> >> >>> >> >>>>         > node2     Up     Normal  31.44 GB        =
33.33%
>> >> >>> >> >>>>         > 56713727820156410577229101238628035242
>> >> >>> >> >>>>         > node3     Down   Normal  177.55 GB       =
33.33%
>> >> >>> >> >>>>         > 113427455640312821154458202477256070484
>> >> >>> >> >>>>         >
>> >> >>> >> >>>>         >
>> >> >>> >> >>>>         > the log shows it is still going on, not sure =
why it
>> >> >>> >> >>>> is
>> >> >>> >> >>>> so
>> >> >>> >> >>>> slow:
>> >> >>> >> >>>>         >
>> >> >>> >> >>>>         >
>> >> >>> >> >>>>         >  INFO [main] 2011-08-14 08:55:47,734
>> >> >>> >> >>>> SSTableReader.java
>> >> >>> >> >>>> (line
>> >> >>> >> >>>> 154)
>> >> >>> >> >>>>        Opening
>> >> >>> >> >>>>         > /cassandra/data/COMMENT
>> >> >>> >> >>>>         >  INFO [main] 2011-08-14 08:55:47,828
>> >> >>> >> >>>> ColumnFamilyStore.java
>> >> >>> >> >>>> (line 275)
>> >> >>> >> >>>>         > reading saved cache
>> >> >>> >> >>>> /cassandra/saved_caches/COMMENT-RowCache
>> >> >>> >> >>>>         >  INFO [main] 2011-08-14 09:24:52,198
>> >> >>> >> >>>> ColumnFamilyStore.java
>> >> >>> >> >>>> (line 547)
>> >> >>> >> >>>>         > completed loading (1744370 ms; 200000 keys) =
row
>> >> >>> >> >>>> cache
>> >> >>> >> >>>> for
>> >> >>> >> >>>> COMMENT
>> >> >>> >> >>>>         >  INFO [main] 2011-08-14 09:24:52,299
>> >> >>> >> >>>> ColumnFamilyStore.java
>> >> >>> >> >>>> (line 275)
>> >> >>> >> >>>>         > reading saved cache
>> >> >>> >> >>>> /cassandra/saved_caches/COMMENT-RowCache
>> >> >>> >> >>>>         >  INFO [CompactionExecutor:1] 2011-08-14 =
10:24:55,480
>> >> >>> >> >>>>        CacheWriter.java (line
>> >> >>> >> >>>>         > 96) Saved COMMENT-RowCache (200000 items) in =
2535 ms
>> >> >>> >> >>>>         >
>> >> >>> >> >>>>         >
>> >> >>> >> >>>>         >
>> >> >>> >> >>>>         >
>> >> >>> >> >>>>         >
>> >> >>> >> >>>>         >
>> >> >>> >> >>>>
>> >> >>> >> >>>>
>> >> >>> >> >>>>
>> >> >>> >> >>>>        --
>> >> >>> >> >>>>        Jonathan Ellis
>> >> >>> >> >>>>        Project Chair, Apache Cassandra
>> >> >>> >> >>>>        co-founder of DataStax, the source for =
professional
>> >> >>> >> >>>> Cassandra
>> >> >>> >> >>>> support
>> >> >>> >> >>>>        http://www.datastax.com
>> >> >>> >> >>>>
>> >> >>> >> >>>>
>> >> >>> >> >>>>
>> >> >>> >> >>>
>> >> >>> >> >>>
>> >> >>> >> >
>> >> >>> >> >
>> >> >>> >
>> >> >>> >
>> >> >>>
>> >> >>
>> >> >>
>> >> >
>> >> >
>> >>
>> >>
>> >>
>> >> --
>> >> Jonathan Ellis
>> >> Project Chair, Apache Cassandra
>> >> co-founder of DataStax, the source for professional Cassandra =
support
>> >> http://www.datastax.com
>> >
>> >
>>=20
>>=20
>>=20
>> --
>> Jonathan Ellis
>> Project Chair, Apache Cassandra
>> co-founder of DataStax, the source for professional Cassandra support
>> http://www.datastax.com
>>=20
>=20
>=20


--Apple-Mail=_B26C2FFD-5CD5-4DB1-B91F-376864A02AB7
Content-Transfer-Encoding: quoted-printable
Content-Type: text/html;
	charset=iso-8859-1

<html><head></head><body style=3D"word-wrap: break-word; =
-webkit-nbsp-mode: space; -webkit-line-break: after-white-space; ">cf =
already exists is not the same.&nbsp;<div><br></div><div>Would need the =
call stack.&nbsp;</div><div><br></div><div>Cheers</div><div><br><div>
<span class=3D"Apple-style-span" style=3D"border-collapse: separate; =
color: rgb(0, 0, 0); font-family: Helvetica; font-style: normal; =
font-variant: normal; font-weight: normal; letter-spacing: normal; =
line-height: normal; orphans: 2; text-align: auto; text-indent: 0px; =
text-transform: none; white-space: normal; widows: 2; word-spacing: 0px; =
-webkit-border-horizontal-spacing: 0px; -webkit-border-vertical-spacing: =
0px; -webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><span =
class=3D"Apple-style-span" style=3D"border-collapse: separate; color: =
rgb(0, 0, 0); font-family: Helvetica; font-style: normal; font-variant: =
normal; font-weight: normal; letter-spacing: normal; line-height: =
normal; orphans: 2; text-indent: 0px; text-transform: none; white-space: =
normal; widows: 2; word-spacing: 0px; -webkit-border-horizontal-spacing: =
0px; -webkit-border-vertical-spacing: 0px; =
-webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><div =
style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; =
-webkit-line-break: after-white-space; "><span class=3D"Apple-style-span" =
style=3D"border-collapse: separate; color: rgb(0, 0, 0); font-family: =
Helvetica; font-style: normal; font-variant: normal; font-weight: =
normal; letter-spacing: normal; line-height: normal; orphans: 2; =
text-indent: 0px; text-transform: none; white-space: normal; widows: 2; =
word-spacing: 0px; -webkit-border-horizontal-spacing: 0px; =
-webkit-border-vertical-spacing: 0px; =
-webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><div =
style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; =
-webkit-line-break: after-white-space; =
"><div><div>-----------------</div><div>Aaron Morton</div><div>Freelance =
Cassandra Developer</div><div>@aaronmorton</div><div><a =
href=3D"http://www.thelastpickle.com">http://www.thelastpickle.com</a></di=
v></div></div></span></div></span></span>
</div>

<br><div><div>On 22/08/2011, at 1:03 AM, Yan Chunlu wrote:</div><br =
class=3D"Apple-interchange-newline"><blockquote type=3D"cite">is that =
means I could just wait and it will be okay =
eventually?<div><br></div><div>I also saw the "column family already =
exists"(not accurate, something like that) Exception, also caused after =
I delete the migration and schema sstables. &nbsp; but I can not =
reproduce it, is that a similar problem?<br>

<br><div class=3D"gmail_quote">On Sun, Aug 21, 2011 at 7:57 PM, aaron =
morton <span dir=3D"ltr">&lt;<a =
href=3D"mailto:aaron@thelastpickle.com">aaron@thelastpickle.com</a>&gt;</s=
pan> wrote:<br><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 =
.8ex;border-left:1px #ccc solid;padding-left:1ex;">

<div style=3D"word-wrap:break-word">I've seen "Couldn't find cfId=3D1000" =
in a mutation stage happen when a node joins a cluster with existing =
data after having it's schema cleared.&nbsp;<div><br></div><div>
The migrations received from another node are applied one CF at a time, =
when each CF is added the node will open the existing data files which =
can take a while. In the mean time it's joined on gossip and is =
receiving mutations from other nodes that have all the CF's. One the =
returning node gets through applying the migration the errors should =
stop.&nbsp;</div>

<div><br></div><div>Read is a similar =
story.</div><div><br></div><div>Cheers</div><div>&nbsp;</div><div><br></di=
v><div><br></div><div><div class=3D"im"><div>
<span style=3D"border-collapse:separate;color:rgb(0, 0, =
0);font-family:Helvetica;font-style:normal;font-variant:normal;font-weight=
:normal;letter-spacing:normal;line-height:normal;text-align:auto;text-inde=
nt:0px;text-transform:none;white-space:normal;word-spacing:0px;font-size:m=
edium"><span style=3D"border-collapse:separate;color:rgb(0, 0, =
0);font-family:Helvetica;font-style:normal;font-variant:normal;font-weight=
:normal;letter-spacing:normal;line-height:normal;text-indent:0px;text-tran=
sform:none;white-space:normal;word-spacing:0px;font-size:medium"><div =
style=3D"word-wrap:break-word">

<span style=3D"border-collapse:separate;color:rgb(0, 0, =
0);font-family:Helvetica;font-style:normal;font-variant:normal;font-weight=
:normal;letter-spacing:normal;line-height:normal;text-indent:0px;text-tran=
sform:none;white-space:normal;word-spacing:0px;font-size:medium"><div =
style=3D"word-wrap:break-word">

<div><div>-----------------</div><div>Aaron Morton</div><div>Freelance =
Cassandra Developer</div><div>@aaronmorton</div><div><a =
href=3D"http://www.thelastpickle.com/" =
target=3D"_blank">http://www.thelastpickle.com</a></div></div>

</div></span></div></span></span>
</div>

<br></div><div><div></div><div class=3D"h5"><div><div>On 21/08/2011, at =
8:58 PM, Yan Chunlu wrote:</div><br><blockquote =
type=3D"cite"><div>actually I didn't dropped any CF, &nbsp;maybe my =
understanding was totally wrong, I just describe what I thought as =
belows:&nbsp;</div>

<div><br></div><div>I thought by "<span style=3D"font-family:arial, =
sans-serif;font-size:13px;background-color:rgb(255, 255, 255)">deleted =
CFs" means the sstable that useless(since "node repair" and could copy =
data to another node, &nbsp;the original sstable might be deleted but =
not yet). &nbsp;when I deleted all migration and schema sstables, it =
somehow "forgot" those files should be deleted, so it read the file and =
"can not find cfId"...</span></div>


<div><span style=3D"font-family:arial, =
sans-serif;font-size:13px;background-color:rgb(255, 255, =
255)"><br></span></div><div><br></div>I got to this situation by the =
following steps: at first I did "node repair" on node2 which failed in =
the middle(node3 down), and leave the Load as 170GB while average is =
30GB.<div>


<br></div><div>after I brought up node3, &nbsp;the node2 start up very =
slow, 4 days past it stil starting. &nbsp;it seems loading row cache and =
key cache. &nbsp;so I disabled those cache by set the value to 0 via =
cassandra-cli. during this procedure, of course node2 was not reachable =
so it can not update the schema.</div>


<div><br></div><div>after that node2 could be start very quickly, but =
the "describe cluster" shows it was "UNREACHABLE", so I did as the FAQ =
says, delete schema, migration sstables and restart node2.&nbsp;</div>


<div><br></div><div>then the "<span style=3D"font-family:arial, =
sans-serif;font-size:13px;background-color:rgb(255, 255, 255)">Couldn't =
find cfId=3D1000'" error start showing up.</span></div>

<div><font face=3D"arial, sans-serif"><br></font></div><div><font =
face=3D"arial, =
sans-serif"><br></font></div><div><div><br></div><div><br></div><div><br><=
/div><div>I have just moved those migration &amp;&amp; schema sstables =
back and start cassandra, it still shows "UNREACHABLE", after wait for =
couple of hours, the "describe cluster" shows they are the same version =
now.</div>


<div><br></div><div><br></div><div>even this problem solved, I am not =
sure HOW....... really curious that why just remove "migration* and =
schema*" sstables could cause&nbsp;&nbsp;"<span =
style=3D"font-family:arial, =
sans-serif;font-size:13px;background-color:rgb(255, 255, 255)">Couldn't =
find cfId=3D1000'"&nbsp; error.</span><br>


<br><div class=3D"gmail_quote">On Sun, Aug 21, 2011 at 12:24 PM, =
Jonathan Ellis <span dir=3D"ltr">&lt;<a href=3D"mailto:jbellis@gmail.com" =
target=3D"_blank">jbellis@gmail.com</a>&gt;</span> wrote:<br><blockquote =
class=3D"gmail_quote" =
style=3D"margin-top:0px;margin-right:0px;margin-bottom:0px;margin-left:0.8=
ex;border-left-width:1px;border-left-color:rgb(204, 204, =
204);border-left-style:solid;padding-left:1ex">


I'm not sure what problem you're trying to solve. &nbsp;The exception =
you<br>
pasted should stop once your clients are no longer trying to use the<br>
dropped CF.<br>
<div><div></div><div><br>
On Sat, Aug 20, 2011 at 10:09 PM, Yan Chunlu &lt;<a =
href=3D"mailto:springrider@gmail.com" =
target=3D"_blank">springrider@gmail.com</a>&gt; wrote:<br>
&gt; that could be the reason,&nbsp;I did nodetool repair(unfinished, =
data size<br>
&gt; increased 6 times bigger 30G vs 170G) and there should be some =
unclean<br>
&gt; sstables on that node.<br>
&gt; however upgrade it a tough work for me right now. &nbsp;could the =
nodetool scrub<br>
&gt; help? &nbsp;or decommission the node and join it again?<br>
&gt;<br>
&gt; On Sun, Aug 21, 2011 at 5:56 AM, Jonathan Ellis &lt;<a =
href=3D"mailto:jbellis@gmail.com" =
target=3D"_blank">jbellis@gmail.com</a>&gt; wrote:<br>
&gt;&gt;<br>
&gt;&gt; This means you should upgrade, because we've fixed bugs about =
ignoring<br>
&gt;&gt; deleted CFs since 0.7.4.<br>
&gt;&gt;<br>
&gt;&gt; On Fri, Aug 19, 2011 at 9:26 AM, Yan Chunlu &lt;<a =
href=3D"mailto:springrider@gmail.com" =
target=3D"_blank">springrider@gmail.com</a>&gt; wrote:<br>
&gt;&gt; &gt; the log file shows as follows, not sure what does =
'Couldn't find<br>
&gt;&gt; &gt; cfId=3D1000'<br>
&gt;&gt; &gt; means(google just returned useless results):<br>
&gt;&gt; &gt;<br>
&gt;&gt; &gt; INFO [main] 2011-08-18 07:23:17,688 =
DatabaseDescriptor.java (line 453)<br>
&gt;&gt; &gt; Found<br>
&gt;&gt; &gt; table data in data directories. Consider using JMX to =
call<br>
&gt;&gt; &gt; =
org.apache.cassandra.service.StorageService.loadSchemaFromYaml().<br>
&gt;&gt; &gt; &nbsp;INFO [main] 2011-08-18 07:23:17,705 =
CommitLogSegment.java (line 50)<br>
&gt;&gt; &gt; Creating new commitlog segment<br>
&gt;&gt; &gt; /cassandra/commitlog/CommitLog-1313670197705.log<br>
&gt;&gt; &gt; &nbsp;INFO [main] 2011-08-18 07:23:17,716 CommitLog.java =
(line 155) Replaying<br>
&gt;&gt; &gt; /cassandra/commitlog/CommitLog-1313670030512.log<br>
&gt;&gt; &gt; &nbsp;INFO [main] 2011-08-18 07:23:17,734 CommitLog.java =
(line 314) Finished<br>
&gt;&gt; &gt; reading =
/cassandra/commitlog/CommitLog-1313670030512.log<br>
&gt;&gt; &gt; &nbsp;INFO [main] 2011-08-18 07:23:17,744 CommitLog.java =
(line 163) Log<br>
&gt;&gt; &gt; replay<br>
&gt;&gt; &gt; complete<br>
&gt;&gt; &gt; &nbsp;INFO [main] 2011-08-18 07:23:17,756 =
StorageService.java (line 364)<br>
&gt;&gt; &gt; Cassandra version: 0.7.4<br>
&gt;&gt; &gt; &nbsp;INFO [main] 2011-08-18 07:23:17,756 =
StorageService.java (line 365)<br>
&gt;&gt; &gt; Thrift<br>
&gt;&gt; &gt; API version: 19.4.0<br>
&gt;&gt; &gt; &nbsp;INFO [main] 2011-08-18 07:23:17,756 =
StorageService.java (line 378)<br>
&gt;&gt; &gt; Loading<br>
&gt;&gt; &gt; persisted ring state<br>
&gt;&gt; &gt; &nbsp;INFO [main] 2011-08-18 07:23:17,766 =
StorageService.java (line 414)<br>
&gt;&gt; &gt; Starting<br>
&gt;&gt; &gt; up server gossip<br>
&gt;&gt; &gt; &nbsp;INFO [main] 2011-08-18 07:23:17,771 =
ColumnFamilyStore.java (line 1048)<br>
&gt;&gt; &gt; Enqueuing flush of Memtable-LocationInfo@832310230(29 =
bytes, 1<br>
&gt;&gt; &gt; operations)<br>
&gt;&gt; &gt; &nbsp;INFO [FlushWriter:1] 2011-08-18 07:23:17,772 =
Memtable.java (line 157)<br>
&gt;&gt; &gt; Writing Memtable-LocationInfo@832310230(29 bytes, 1 =
operations)<br>
&gt;&gt; &gt; &nbsp;INFO [FlushWriter:1] 2011-08-18 07:23:17,822 =
Memtable.java (line 164)<br>
&gt;&gt; &gt; Completed flushing =
/cassandra/data/system/LocationInfo-f-66-Data.db (80<br>
&gt;&gt; &gt; bytes)<br>
&gt;&gt; &gt; &nbsp;INFO [CompactionExecutor:1] 2011-08-18 =
07:23:17,823<br>
&gt;&gt; &gt; CompactionManager.java<br>
&gt;&gt; &gt; (line 396) Compacting<br>
&gt;&gt; &gt;<br>
&gt;&gt; &gt; =
[SSTableReader(path=3D'/cassandra/data/system/LocationInfo-f-63-Data.db'),=
SSTableReader(path=3D'/cassandra/data/system/LocationInfo-f-64-Data.db'),S=
STableReader(path=3D'/cassandra/data/system/LocationInfo-f-65-Data.db'),SS=
TableReader(path=3D'/cassandra/data/system/LocationInfo-f-66-Data.db')]<br=
>


&gt;&gt; &gt; &nbsp;INFO [main] 2011-08-18 07:23:17,853 =
StorageService.java (line 478)<br>
&gt;&gt; &gt; Using<br>
&gt;&gt; &gt; saved token 113427455640312821154458202477256070484<br>
&gt;&gt; &gt; &nbsp;INFO [main] 2011-08-18 07:23:17,854 =
ColumnFamilyStore.java (line 1048)<br>
&gt;&gt; &gt; Enqueuing flush of Memtable-LocationInfo@18895884(53 =
bytes, 2<br>
&gt;&gt; &gt; operations)<br>
&gt;&gt; &gt; &nbsp;INFO [FlushWriter:1] 2011-08-18 07:23:17,854 =
Memtable.java (line 157)<br>
&gt;&gt; &gt; Writing Memtable-LocationInfo@18895884(53 bytes, 2 =
operations)<br>
&gt;&gt; &gt; ERROR [MutationStage:28] 2011-08-18 07:23:18,246<br>
&gt;&gt; &gt; RowMutationVerbHandler.java<br>
&gt;&gt; &gt; (line 86) Error in row mutation<br>
&gt;&gt; &gt; =
org.apache.cassandra.db.UnserializableColumnFamilyException: =
Couldn't<br>
&gt;&gt; &gt; find<br>
&gt;&gt; &gt; cfId=3D1000<br>
&gt;&gt; &gt; &nbsp; &nbsp; at<br>
&gt;&gt; &gt;<br>
&gt;&gt; &gt; =
org.apache.cassandra.db.ColumnFamilySerializer.deserialize(ColumnFamilySer=
ializer.java:117)<br>
&gt;&gt; &gt; &nbsp; &nbsp; at<br>
&gt;&gt; &gt;<br>
&gt;&gt; &gt; =
org.apache.cassandra.db.RowMutation$RowMutationSerializer.deserialize(RowM=
utation.java:380)<br>
&gt;&gt; &gt; &nbsp; &nbsp; at<br>
&gt;&gt; &gt;<br>
&gt;&gt; &gt; =
org.apache.cassandra.db.RowMutationVerbHandler.doVerb(RowMutationVerbHandl=
er.java:50)<br>
&gt;&gt; &gt; &nbsp; &nbsp; at<br>
&gt;&gt; &gt;<br>
&gt;&gt; &gt; =
org.apache.cassandra.net.MessageDeliveryTask.run(MessageDeliveryTask.java:=
72)<br>
&gt;&gt; &gt; &nbsp; &nbsp; at<br>
&gt;&gt; &gt;<br>
&gt;&gt; &gt; =
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:=
1110)<br>
&gt;&gt; &gt; &nbsp; &nbsp; at<br>
&gt;&gt; &gt;<br>
&gt;&gt; &gt; =
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java=
:603)<br>
&gt;&gt; &gt; &nbsp; &nbsp; at java.lang.Thread.run(Thread.java:636)<br>
&gt;&gt; &gt; &nbsp;INFO [GossipStage:1] 2011-08-18 07:23:18,255 =
Gossiper.java (line 623)<br>
&gt;&gt; &gt; Node<br>
&gt;&gt; &gt; /node1 has restarted, now UP again<br>
&gt;&gt; &gt; ERROR [ReadStage:1] 2011-08-18 07:23:18,254<br>
&gt;&gt; &gt; DebuggableThreadPoolExecutor.java (line 103) Error in =
ThreadPoolExecutor<br>
&gt;&gt; &gt; java.lang.IllegalArgumentException: Unknown ColumnFamily =
prjcache in<br>
&gt;&gt; &gt; keyspace prjkeyspace<br>
&gt;&gt; &gt; &nbsp; &nbsp; at<br>
&gt;&gt; &gt;<br>
&gt;&gt; &gt; =
org.apache.cassandra.config.DatabaseDescriptor.getComparator(DatabaseDescr=
iptor.java:966)<br>
&gt;&gt; &gt; &nbsp; &nbsp; at<br>
&gt;&gt; &gt;<br>
&gt;&gt; &gt; =
org.apache.cassandra.db.ColumnFamily.getComparatorFor(ColumnFamily.java:38=
8)<br>
&gt;&gt; &gt; &nbsp; &nbsp; at<br>
&gt;&gt; &gt; =
org.apache.cassandra.db.ReadCommand.getComparator(ReadCommand.java:93)<br>=

&gt;&gt; &gt; &nbsp; &nbsp; at<br>
&gt;&gt; &gt;<br>
&gt;&gt; &gt; =
org.apache.cassandra.db.SliceByNamesReadCommand.&lt;init&gt;(SliceByNamesR=
eadCommand.java:44)<br>
&gt;&gt; &gt; &nbsp; &nbsp; at<br>
&gt;&gt; &gt;<br>
&gt;&gt; &gt; =
org.apache.cassandra.db.SliceByNamesReadCommandSerializer.deserialize(Slic=
eByNamesReadCommand.java:110)<br>
&gt;&gt; &gt; &nbsp; &nbsp; at<br>
&gt;&gt; &gt;<br>
&gt;&gt; &gt; =
org.apache.cassandra.db.ReadCommandSerializer.deserialize(ReadCommand.java=
:122)<br>
&gt;&gt; &gt; &nbsp; &nbsp; at<br>
&gt;&gt; &gt; =
org.apache.cassandra.db.ReadVerbHandler.doVerb(ReadVerbHandler.java:67)<br=
>
&gt;&gt; &gt;<br>
&gt;&gt; &gt;<br>
&gt;&gt; &gt; On Fri, Aug 19, 2011 at 5:44 AM, aaron morton &lt;<a =
href=3D"mailto:aaron@thelastpickle.com" =
target=3D"_blank">aaron@thelastpickle.com</a>&gt;<br>
&gt;&gt; &gt; wrote:<br>
&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt; Look in the logs to work find out why the migration =
did not get to<br>
&gt;&gt; &gt;&gt; node2.<br>
&gt;&gt; &gt;&gt; Otherwise yes you can drop those files.<br>
&gt;&gt; &gt;&gt; Cheers<br>
&gt;&gt; &gt;&gt; -----------------<br>
&gt;&gt; &gt;&gt; Aaron Morton<br>
&gt;&gt; &gt;&gt; Freelance Cassandra Developer<br>
&gt;&gt; &gt;&gt; @aaronmorton<br>
&gt;&gt; &gt;&gt; <a href=3D"http://www.thelastpickle.com/" =
target=3D"_blank">http://www.thelastpickle.com</a><br>
&gt;&gt; &gt;&gt; On 18/08/2011, at 11:25 PM, Yan Chunlu wrote:<br>
&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt; just found out that changes via cassandra-cli, the =
schema change didn't<br>
&gt;&gt; &gt;&gt; reach node2. and node2 became unreachable....<br>
&gt;&gt; &gt;&gt; I did as this<br>
&gt;&gt; &gt;&gt; document:<a =
href=3D"http://wiki.apache.org/cassandra/FAQ#schema_disagreement" =
target=3D"_blank">http://wiki.apache.org/cassandra/FAQ#schema_disagreement=
</a><br>
&gt;&gt; &gt;&gt; but after that I just got two schema versons:<br>
&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt; ddcada52-c96a-11e0-99af-3bd951658d61: [node1, =
node3]<br>
&gt;&gt; &gt;&gt; 2127b2ef-6998-11e0-b45b-3bd951658d61: [node2]<br>
&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt; is that enough delete Schema* &amp;&amp; Migrations* =
sstables and restart the<br>
&gt;&gt; &gt;&gt; node?<br>
&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt; On Thu, Aug 18, 2011 at 5:13 PM, Yan Chunlu &lt;<a =
href=3D"mailto:springrider@gmail.com" =
target=3D"_blank">springrider@gmail.com</a>&gt;<br>
&gt;&gt; &gt;&gt; wrote:<br>
&gt;&gt; &gt;&gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt; thanks a lot for &nbsp;all the help! &nbsp;I have =
gone through the steps and<br>
&gt;&gt; &gt;&gt;&gt; successfully brought up the node2 :)<br>
&gt;&gt; &gt;&gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt; On Thu, Aug 18, 2011 at 10:51 AM, Boris Yen &lt;<a =
href=3D"mailto:yulinyen@gmail.com" =
target=3D"_blank">yulinyen@gmail.com</a>&gt;<br>
&gt;&gt; &gt;&gt;&gt; wrote:<br>
&gt;&gt; &gt;&gt;&gt; &gt; Because the file only preserve the "key" of =
records, not the whole<br>
&gt;&gt; &gt;&gt;&gt; &gt; record.<br>
&gt;&gt; &gt;&gt;&gt; &gt; Records for those saved key will be loaded =
into cassandra during the<br>
&gt;&gt; &gt;&gt;&gt; &gt; startup<br>
&gt;&gt; &gt;&gt;&gt; &gt; of cassandra.<br>
&gt;&gt; &gt;&gt;&gt; &gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt; On Wed, Aug 17, 2011 at 5:52 PM, Yan Chunlu =
&lt;<a href=3D"mailto:springrider@gmail.com" =
target=3D"_blank">springrider@gmail.com</a>&gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt; wrote:<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; but the data size in the saved_cache are =
relatively small:<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; will that cause the load problem?<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &nbsp;ls &nbsp;-lh =
&nbsp;/cassandra/saved_caches/<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; total 32M<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; -rw-r--r-- 1 cass cass 2.9M 2011-08-12 =
19:53<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; cass-CommentSortsCache-KeyCache<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; -rw-r--r-- 1 cass cass 2.9M 2011-08-17 =
04:29<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; cass-CommentSortsCache-RowCache<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; -rw-r--r-- 1 cass cass 2.7M 2011-08-12 =
18:50<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; cass-CommentVote-KeyCache<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; -rw-r--r-- 1 cass cass 140K 2011-08-12 =
19:53<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; cass-device_images-KeyCache<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; -rw-r--r-- 1 cass cass &nbsp;33K =
2011-08-12 18:51 cass-Hide-KeyCache<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; -rw-r--r-- 1 cass cass 4.6M 2011-08-12 =
19:53 cass-images-KeyCache<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; -rw-r--r-- 1 cass cass 2.6M 2011-08-12 =
19:53<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; cass-LinksByUrl-KeyCache<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; -rw-r--r-- 1 cass cass 2.5M 2011-08-12 =
18:50 cass-LinkVote-KeyCache<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; -rw-r--r-- 1 cass cass 7.5M 2011-08-12 =
18:50 cass-cache-KeyCache<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; -rw-r--r-- 1 cass cass 3.7M 2011-08-12 =
21:51 cass-cache-RowCache<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; -rw-r--r-- 1 cass cass 1.8M 2011-08-12 =
18:51 cass-Save-KeyCache<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; -rw-r--r-- 1 cass cass 111K 2011-08-12 =
19:50<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; cass-SavesByAccount-KeyCache<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; -rw-r--r-- 1 cass cass &nbsp;864 =
2011-08-12 19:49<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; cass-VotesByDay-KeyCache<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; -rw-r--r-- 1 cass cass 249K 2011-08-12 =
19:49<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; cass-VotesByLink-KeyCache<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; -rw-r--r-- 1 cass cass &nbsp; 28 =
2011-08-14 12:50<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; system-HintsColumnFamily-KeyCache<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; -rw-r--r-- 1 cass cass &nbsp; &nbsp;5 =
2011-08-14 12:50<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; system-LocationInfo-KeyCache<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; -rw-r--r-- 1 cass cass &nbsp; 54 =
2011-08-13 13:30<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; system-Migrations-KeyCache<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; -rw-r--r-- 1 cass cass &nbsp; 76 =
2011-08-13 13:30 system-Schema-KeyCache<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; On Wed, Aug 17, 2011 at 4:31 PM, aaron =
morton<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &lt;<a =
href=3D"mailto:aaron@thelastpickle.com" =
target=3D"_blank">aaron@thelastpickle.com</a>&gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; wrote:<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt; If you have a node that cannot start =
up due to issues loading the<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt; saved<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt; cache delete the files in the =
saved_cache directory before<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt; starting<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt; it.<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt; The settings to save the row and key =
cache are per CF. You can<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt; change<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt; them with an update column family =
statement via the CLI when<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt; attached to any<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt; node. You may then want to check the =
saved_caches directory and<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt; delete any<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt; files that are left (not sure if =
they are automatically deleted).<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt; i would recommend:<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt; - stop node 2<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt; - delete it's saved_cache<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt; - make the schema change via another =
node<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt; - startup node 2<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt; Cheers<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt; -----------------<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt; Aaron Morton<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt; Freelance Cassandra Developer<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt; @aaronmorton<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt; <a =
href=3D"http://www.thelastpickle.com/" =
target=3D"_blank">http://www.thelastpickle.com</a><br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt; On 17/08/2011, at 2:59 PM, Yan =
Chunlu wrote:<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt; does this need to be cluster =
wide? or I could just modify the<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt; caches<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt; on one node? &nbsp; since I =
could not connect to the node with<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt; cassandra-cli, it says =
"connection refused"<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt; [default@unknown] connect =
node2/9160;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt; Exception connecting to =
node2/9160. Reason: Connection refused.<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt; so if I change the cache size =
via other nodes, how could node2<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt; be<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt; notified the changing? &nbsp; =
&nbsp;kill cassandra and start it again<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt; could<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt; make<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt; it update the schema?<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt; On Wed, Aug 17, 2011 at 5:59 AM, =
Teijo Holzer<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt; &lt;<a =
href=3D"mailto:tholzer@wetafx.co.nz" =
target=3D"_blank">tholzer@wetafx.co.nz</a>&gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt; wrote:<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt; Hi,<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt; yes, we saw exactly the same =
messages. We got rid of these by<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt; doing<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt; the<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt; following:<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt; * Set all row &amp; key =
caches in your CFs to 0 via cassandra-cli<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt; * Kill Cassandra<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt; * Remove all files in the =
saved_caches directory<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt; * Start Cassandra<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt; * Slowly bring back row =
&amp; key caches (if desired, we left them<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt; off)<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt; Cheers,<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp;T.<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt; On 16/08/11 23:35, Yan =
Chunlu wrote:<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp;I saw alot =
slicequeryfilter things if changed the log level<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; to<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; DEBUG.<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp;just<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; thought even bring up a =
new node will be faster than start the<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; old<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; one..... it<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; is wired<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; DEBUG [main] 2011-08-16 =
06:32:49,213 SliceQueryFilter.java<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; (line<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; 123)<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; collecting 0 of <a =
href=3D"tel:2147483647" value=3D"+12147483647" =
target=3D"_blank">2147483647</a>:<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; =
76616c7565:false:225@1313068845474382<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; DEBUG [main] 2011-08-16 =
06:32:49,245 SliceQueryFilter.java<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; (line<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; 123)<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; collecting 0 of <a =
href=3D"tel:2147483647" value=3D"+12147483647" =
target=3D"_blank">2147483647</a>:<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; =
76616c7565:false:453@1310999270198313<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; DEBUG [main] 2011-08-16 =
06:32:49,251 SliceQueryFilter.java<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; (line<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; 123)<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; collecting 0 of <a =
href=3D"tel:2147483647" value=3D"+12147483647" =
target=3D"_blank">2147483647</a>:<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; =
76616c7565:false:26@1313199902088827<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; DEBUG [main] 2011-08-16 =
06:32:49,576 SliceQueryFilter.java<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; (line<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; 123)<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; collecting 0 of <a =
href=3D"tel:2147483647" value=3D"+12147483647" =
target=3D"_blank">2147483647</a>:<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; =
76616c7565:false:157@1313097239332314<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; DEBUG [main] 2011-08-16 =
06:32:50,674 SliceQueryFilter.java<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; (line<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; 123)<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; collecting 0 of <a =
href=3D"tel:2147483647" value=3D"+12147483647" =
target=3D"_blank">2147483647</a>:<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; =
76616c7565:false:41729@1313190821826229<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; DEBUG [main] 2011-08-16 =
06:32:50,811 SliceQueryFilter.java<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; (line<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; 123)<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; collecting 0 of <a =
href=3D"tel:2147483647" value=3D"+12147483647" =
target=3D"_blank">2147483647</a>:<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; =
76616c7565:false:6@1313174157301203<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; DEBUG [main] 2011-08-16 =
06:32:50,867 SliceQueryFilter.java<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; (line<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; 123)<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; collecting 0 of <a =
href=3D"tel:2147483647" value=3D"+12147483647" =
target=3D"_blank">2147483647</a>:<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; =
76616c7565:false:98@1312011362250907<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; DEBUG [main] 2011-08-16 =
06:32:50,881 SliceQueryFilter.java<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; (line<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; 123)<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; collecting 0 of <a =
href=3D"tel:2147483647" value=3D"+12147483647" =
target=3D"_blank">2147483647</a>:<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; =
76616c7565:false:42@1313201711997005<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; DEBUG [main] 2011-08-16 =
06:32:50,910 SliceQueryFilter.java<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; (line<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; 123)<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; collecting 0 of <a =
href=3D"tel:2147483647" value=3D"+12147483647" =
target=3D"_blank">2147483647</a>:<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; =
76616c7565:false:96@1312939986190155<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; DEBUG [main] 2011-08-16 =
06:32:50,954 SliceQueryFilter.java<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; (line<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; 123)<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; collecting 0 of <a =
href=3D"tel:2147483647" value=3D"+12147483647" =
target=3D"_blank">2147483647</a>:<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; =
76616c7565:false:621@1313192538616112<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; On Tue, Aug 16, 2011 at =
7:32 PM, Yan Chunlu<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &lt;<a =
href=3D"mailto:springrider@gmail.com" =
target=3D"_blank">springrider@gmail.com</a><br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &lt;mailto:<a =
href=3D"mailto:springrider@gmail.com" =
target=3D"_blank">springrider@gmail.com</a>&gt;&gt; wrote:<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp;but it =
seems the row cache is cluster wide, how will &nbsp;the<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; change<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; of row<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp;cache =
affect the read speed?<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp;On Mon, Aug =
15, 2011 at 7:33 AM, Jonathan Ellis<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &lt;<a =
href=3D"mailto:jbellis@gmail.com" =
target=3D"_blank">jbellis@gmail.com</a><br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; =
&nbsp;&lt;mailto:<a href=3D"mailto:jbellis@gmail.com" =
target=3D"_blank">jbellis@gmail.com</a>&gt;&gt; wrote:<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp;Or leave row cache enabled but disable cache saving<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; (and<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; remove the<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp;one already on disk).<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp;On Sun, Aug 14, 2011 at 5:05 PM, aaron morton<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &lt;<a =
href=3D"mailto:aaron@thelastpickle.com" =
target=3D"_blank">aaron@thelastpickle.com</a><br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp;&lt;mailto:<a href=3D"mailto:aaron@thelastpickle.com" =
target=3D"_blank">aaron@thelastpickle.com</a>&gt;&gt; wrote:<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp; &gt; &nbsp;INFO [main] 2011-08-14 09:24:52,198<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; =
ColumnFamilyStore.java<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; (line 547)<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp; &gt; completed loading (1744370 ms; 200000 keys) row<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; cache<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; for<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; COMMENT<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp; &gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp; &gt; It's taking 29 minutes to load 200,000 rows in the<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp;row<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; cache.<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; Thats a<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp; &gt; pretty big row cache, I would suggest reducing or<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; disabling<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; it.<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp; &gt; Background<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp;<a =
href=3D"http://www.datastax.com/dev/blog/maximizing-cache-benefit-with-cas=
sandra" =
target=3D"_blank">http://www.datastax.com/dev/blog/maximizing-cache-benefi=
t-with-cassandra</a><br>


&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp; &gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp; &gt; and server can not afford the load then crashed.<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; after<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; come<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; back,<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp;node 3 can<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp; &gt; not return for more than 96 hours<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp; &gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp; &gt; Crashed how ?<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp; &gt; You may be seeing<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; <a =
href=3D"https://issues.apache.org/jira/browse/CASSANDRA-2280" =
target=3D"_blank">https://issues.apache.org/jira/browse/CASSANDRA-2280</a>=
<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp; &gt; Watch nodetool compactionstats to see when the<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; Merkle<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; tree<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; build<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp;finishes<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp; &gt; and nodetool netstats to see which CF's are<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; streaming.<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp; &gt; Cheers<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp; &gt; -----------------<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp; &gt; Aaron Morton<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp; &gt; Freelance Cassandra Developer<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp; &gt; @aaronmorton<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp; &gt; <a href=3D"http://www.thelastpickle.com/" =
target=3D"_blank">http://www.thelastpickle.com</a><br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp; &gt; On 15 Aug 2011, at 04:23, Yan Chunlu wrote:<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp; &gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp; &gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp; &gt; I got 3 nodes and RF=3D3, when I repairing ndoe3, it<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; seems<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; alot<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; data<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp; &gt; generated. &nbsp;and server can not afford the load then<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; crashed.<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp; &gt; after come back, node 3 can not return for more than<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; 96<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; hours<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp; &gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp; &gt; for 34GB data, the node 2 could restart and back<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; online<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; within 1<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; hour.<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp; &gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp; &gt; I am not sure what's wrong with node3 and should I<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; restart<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; node<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; 3 again?<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp; &gt; thanks!<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp; &gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp; &gt; Address &nbsp; &nbsp; &nbsp; &nbsp; Status State &nbsp; Load =
&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;Owns<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp;Token<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp; &gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp; &gt; 113427455640312821154458202477256070484<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp; &gt; node1 &nbsp; &nbsp; Up &nbsp; &nbsp; Normal &nbsp;34.11 GB =
&nbsp; &nbsp; &nbsp; &nbsp;33.33% &nbsp;0<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp; &gt; node2 &nbsp; &nbsp; Up &nbsp; &nbsp; Normal &nbsp;31.44 GB =
&nbsp; &nbsp; &nbsp; &nbsp;33.33%<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp; &gt; 56713727820156410577229101238628035242<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp; &gt; node3 &nbsp; &nbsp; Down &nbsp; Normal &nbsp;177.55 GB =
&nbsp; &nbsp; &nbsp; 33.33%<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp; &gt; 113427455640312821154458202477256070484<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp; &gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp; &gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp; &gt; the log shows it is still going on, not sure why it<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; is<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; so<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; slow:<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp; &gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp; &gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp; &gt; &nbsp;INFO [main] 2011-08-14 08:55:47,734<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; SSTableReader.java<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; (line<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; 154)<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp;Opening<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp; &gt; /cassandra/data/COMMENT<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp; &gt; &nbsp;INFO [main] 2011-08-14 08:55:47,828<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; =
ColumnFamilyStore.java<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; (line 275)<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp; &gt; reading saved cache<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; =
/cassandra/saved_caches/COMMENT-RowCache<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp; &gt; &nbsp;INFO [main] 2011-08-14 09:24:52,198<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; =
ColumnFamilyStore.java<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; (line 547)<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp; &gt; completed loading (1744370 ms; 200000 keys) row<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; cache<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; for<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; COMMENT<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp; &gt; &nbsp;INFO [main] 2011-08-14 09:24:52,299<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; =
ColumnFamilyStore.java<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; (line 275)<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp; &gt; reading saved cache<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; =
/cassandra/saved_caches/COMMENT-RowCache<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp; &gt; &nbsp;INFO [CompactionExecutor:1] 2011-08-14 =
10:24:55,480<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp;CacheWriter.java (line<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp; &gt; 96) Saved COMMENT-RowCache (200000 items) in 2535 ms<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp; &gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp; &gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp; &gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp; &gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp; &gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp; &gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp;--<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp;Jonathan Ellis<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp;Project Chair, Apache Cassandra<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp;co-founder of DataStax, the source for professional<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; Cassandra<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; support<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt; &nbsp; &nbsp; &nbsp; =
&nbsp;<a href=3D"http://www.datastax.com/" =
target=3D"_blank">http://www.datastax.com</a><br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;&gt;&gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;&gt; &gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;<br>
&gt;&gt; &gt;&gt;&gt; &gt;<br>
&gt;&gt; &gt;&gt;&gt;<br>
&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;&gt;<br>
&gt;&gt; &gt;<br>
&gt;&gt; &gt;<br>
&gt;&gt;<br>
&gt;&gt;<br>
&gt;&gt;<br>
&gt;&gt; --<br>
&gt;&gt; Jonathan Ellis<br>
&gt;&gt; Project Chair, Apache Cassandra<br>
&gt;&gt; co-founder of DataStax, the source for professional Cassandra =
support<br>
&gt;&gt; <a href=3D"http://www.datastax.com/" =
target=3D"_blank">http://www.datastax.com</a><br>
&gt;<br>
&gt;<br>
<br>
<br>
<br>
</div></div>--<br>
<div><div></div><div>Jonathan Ellis<br>
Project Chair, Apache Cassandra<br>
co-founder of DataStax, the source for professional Cassandra =
support<br>
<a href=3D"http://www.datastax.com/" =
target=3D"_blank">http://www.datastax.com</a><br>
</div></div></blockquote></div><br></div></div>
=
</blockquote></div><br></div></div></div></div></blockquote></div><br></di=
v>
</blockquote></div><br></div></body></html>=

--Apple-Mail=_B26C2FFD-5CD5-4DB1-B91F-376864A02AB7--