Return-Path: X-Original-To: apmail-cassandra-user-archive@www.apache.org Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id EB8579EE8 for ; Thu, 16 Aug 2012 08:52:04 +0000 (UTC) Received: (qmail 36286 invoked by uid 500); 16 Aug 2012 08:52:02 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 35952 invoked by uid 500); 16 Aug 2012 08:51:58 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 35926 invoked by uid 99); 16 Aug 2012 08:51:57 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 16 Aug 2012 08:51:57 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=FSL_RCVD_USER,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of filippo@ntoklo.com designates 74.125.83.44 as permitted sender) Received: from [74.125.83.44] (HELO mail-ee0-f44.google.com) (74.125.83.44) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 16 Aug 2012 08:51:51 +0000 Received: by eekb45 with SMTP id b45so701115eek.31 for ; Thu, 16 Aug 2012 01:51:30 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20120113; h=date:from:to:message-id:in-reply-to:references:subject:x-mailer :mime-version:content-type:content-transfer-encoding :content-disposition:x-gm-message-state; bh=TcP2HMlMmMh4OVYVe8lwHsW5cK5DcozEXPlBACSktqI=; b=lHrYcG0bwmPEeg17jgPmQsi10CcOBamcP8505HQzfiNSH5LOTrPBqnkZx2ID6y3zhX uPCfS7hGSiUDoPADsu8pZlw33Qym7rI2J6+klBcjr+4ZEQmsYq2bxjvpkNgPIWtUo7w0 l8wTywWaukZNJehmDYUz0vdyrI9txw2UB1AUPsK9liIQErNw4yKOH1YCzMNwN19BCUhX J6+we0XsGhfdr3D80pV6b4qYseEcHnKBpwyNYbZwHqYJk9da8qa5i1SK1fboO1MQ7wXz trcRyhxXfq7ZPogEiNLobMPFAXdIMsyVXBXmV6EJWQP4ttvjdNSVlbF+oAvXRyBBHFkm l/iQ== Received: by 10.14.177.193 with SMTP id d41mr566926eem.19.1345107090689; Thu, 16 Aug 2012 01:51:30 -0700 (PDT) Received: from Filippos-MacBook.local ([80.71.29.65]) by mx.google.com with ESMTPS id u47sm10172670eeo.9.2012.08.16.01.51.29 (version=SSLv3 cipher=OTHER); Thu, 16 Aug 2012 01:51:29 -0700 (PDT) Date: Thu, 16 Aug 2012 09:51:27 +0100 From: Filippo Diotalevi To: user@cassandra.apache.org Message-ID: <95ECC1ABC58041C8A94503285E016C97@ntoklo.com> In-Reply-To: References: <0E1DF749F2C142B38EA75A47216970A9@ntoklo.com> Subject: Re: Migrating to a new cluster (using SSTableLoader or other approaches) X-Mailer: sparrow 1.5 (build 1043.1) MIME-Version: 1.0 Content-Type: text/plain; charset="utf-8" Content-Transfer-Encoding: quoted-printable Content-Disposition: inline X-Gm-Message-State: ALoCoQmD8j6/Z7cafS4FGWeFpYFA1HMpTaJ6n9WxcXP85cOp8C9t4O5UZoGs7leagyx7g1TE8yzp > > ERROR 09:02:38,614 Error in ThreadPoolExecutor > > java.lang.RuntimeException: java.io.EO=46Exception: unable to seek to= position 93069003 in /opt/analytics/analytics/chart-hd-104-Data.db (6573= 7276 bytes) in read-only mode > =20 > =20 > This one looks like an error. > =20 > Can you run nodetool with DEBUG level logging and post the logs =3F =20 Thank Aaron. Which nodetool command are you referring to=3F (info, cfstats, ring,=E2=80= =A6.) Do I modify the log4j-tools.properties in =24CASSANDRA=5FHOME/conf to set= the nodetool logs to DEBUG=3F Thanks, -- =20 =46ilippo Diotalevi On Wednesday, 15 August 2012 at 22:53, aaron morton wrote: > > WARN 09:02:38,534 Unable to instantiate cache provider org.apache.cas= sandra.cache.SerializingCacheProvider; using default org.apache.cassandra= .cache.ConcurrentLinkedHashCacheProvider=405d59054d instead > =20 > Happens when JNA is not in the path. Nothing to worry about when using = the sstableloader. =20 > =20 > > ERROR 09:02:38,614 Error in ThreadPoolExecutor > > java.lang.RuntimeException: java.io.EO=46Exception: unable to seek to= position 93069003 in /opt/analytics/analytics/chart-hd-104-Data.db (6573= 7276 bytes) in read-only mode > =20 > This one looks like an error. =20 > =20 > Can you run nodetool with DEBUG level logging and post the logs =3F =20 > =20 > Cheers > =20 > ----------------- > Aaron Morton > =46reelance Developer > =40aaronmorton > http://www.thelastpickle.com > =20 > =20 > =20 > =20 > =20 > =20 > On 15/08/2012, at 9:32 PM, =46ilippo Diotalevi wrote: > > Hi, =20 > > we are trying to use SSTableLoader to bootstrap a new 7-node cassandr= a (v. 1.0.10) cluster with the snapshots taken from a 3-node cassandra cl= uster. The new cluster is in a different data centre. =20 > > =20 > > After reading the articles at > > =5B1=5D http://www.datastax.com/dev/blog/bulk-loading > > =5B2=5D http://geekswithblogs.net/johnsPerfBlog/archive/2011/07/26/ho= w-to-use-cassandrs-sstableloader.aspx > > =20 > > we are tried to follow this procedure =20 > > 1) we took a snapshot of our keyspaces in the old cluster and moved t= hem to the data folder of 3 of the new machines > > 2) started cassandra in the new cluster > > but we noticed that some column families were missing, other had miss= ing data. > > =20 > > After that we tried to use sstableloader > > 1) we reinstalled cassandra in the new cluster > > 2) run sstableloader (as explained in =5B2=5D) to load the keyspaces > > =20 > > SSTableLoader starts, but the progress is always 0 and the transfer r= ate is 0MB/s. Some warning and exceptions are present in the logs > > =20 > > ./sstableloader /opt/analytics/analytics/ > > Starting client (and waiting 30 seconds for gossip) ... > > Streaming revelant part of /opt/analytics/analytics/chart-hd-104-Data= .db /opt/analytics/analytics/chart-hd-105-Data.db /opt/analytics/analytic= s/chart-hd-106-Data.db /opt/analytics/analytics/chart-hd-107-Data.db /opt= /analytics/analytics/chart-hd-108-Data.db to =5B/1x.xx.xx.xx5, /1x.xx.xx.= xx7, /1x.xx.xx.xx0, /1x.xx.xx.xx7, /1x.xx.xx.xx3, /1x.xx.xx.xx8, /1x.xx.x= x.xx7=5D > > WARN 09:02:38,534 Unable to instantiate cache provider org.apache.cas= sandra.cache.SerializingCacheProvider; using default org.apache.cassandra= .cache.ConcurrentLinkedHashCacheProvider=405d59054d instead > > WARN 09:02:38,549 Unable to instantiate cache provider org.apache.cas= sandra.cache.SerializingCacheProvider; using default org.apache.cassandra= .cache.ConcurrentLinkedHashCacheProvider=405d59054d instead > > =20 > > =20 > > =5B=E2=80=A6.=5D > > ERROR 09:02:38,614 Error in ThreadPoolExecutor > > java.lang.RuntimeException: java.io.EO=46Exception: unable to seek to= position 93069003 in /opt/analytics/analytics/chart-hd-104-Data.db (6573= 7276 bytes) in read-only mode > > at org.apache.cassandra.utils.=46BUtilities.unchecked(=46BUtilities.j= ava:689) > > at org.apache.cassandra.utils.WrappedRunnable.run(WrappedRunnable.jav= a:34) > > at java.util.concurrent.ThreadPoolExecutor=24Worker.runTask(ThreadPoo= lExecutor.java:886) > > at java.util.concurrent.ThreadPoolExecutor=24Worker.run(ThreadPoolExe= cutor.java:908) > > at java.lang.Thread.run(Thread.java:619) > > Caused by: java.io.EO=46Exception: unable to seek to position 9306900= 3 in /opt/analytics/analytics/chart-hd-104-Data.db (65737276 bytes) in re= ad-only mode > > at org.apache.cassandra.io.util.RandomAccessReader.seek(RandomAccessR= eader.java:253) > > at org.apache.cassandra.streaming.=46ileStreamTask.stream(=46ileStrea= mTask.java:136) > > at org.apache.cassandra.streaming.=46ileStreamTask.runMayThrow(=46ile= StreamTask.java:91) > > at org.apache.cassandra.utils.WrappedRunnable.run(WrappedRunnable.jav= a:30) > > ... 3 more > > Exception in thread =22Streaming:1=22 java.lang.RuntimeException: jav= a.io.EO=46Exception: unable to seek to position 93069003 in /opt/analytic= s/analytics/chart-hd-104-Data.db (65737276 bytes) in read-only mode > > at org.apache.cassandra.utils.=46BUtilities.unchecked(=46BUtilities.j= ava:689) > > at org.apache.cassandra.utils.WrappedRunnable.run(WrappedRunnable.jav= a:34) > > at java.util.concurrent.ThreadPoolExecutor=24Worker.runTask(ThreadPoo= lExecutor.java:886) > > at java.util.concurrent.ThreadPoolExecutor=24Worker.run(ThreadPoolExe= cutor.java:908) > > at java.lang.Thread.run(Thread.java:619) > > Caused by: java.io.EO=46Exception: unable to seek to position 9306900= 3 in /opt/analytics/analytics/chart-hd-104-Data.db (65737276 bytes) in re= ad-only mode > > at org.apache.cassandra.io.util.RandomAccessReader.seek(RandomAccessR= eader.java:253) > > at org.apache.cassandra.streaming.=46ileStreamTask.stream(=46ileStrea= mTask.java:136) > > at org.apache.cassandra.streaming.=46ileStreamTask.runMayThrow(=46ile= StreamTask.java:91) > > at org.apache.cassandra.utils.WrappedRunnable.run(WrappedRunnable.jav= a:30) > > ... 3 more > > =20 > > =20 > > =20 > > =20 > > =20 > > --------------- > > =20 > > =20 > > What's the correct approach to migrate data from one cluster to anoth= er=3F How can I troubleshoot the problem with sstableloader=3F =20 > > =20 > > Thanks, > > -- =20 > > =46ilippo Diotalevi > =20