Return-Path: X-Original-To: apmail-cassandra-user-archive@www.apache.org Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 9FFBDD93F for ; Wed, 15 Aug 2012 09:32:44 +0000 (UTC) Received: (qmail 68825 invoked by uid 500); 15 Aug 2012 09:32:42 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 68546 invoked by uid 500); 15 Aug 2012 09:32:41 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 68517 invoked by uid 99); 15 Aug 2012 09:32:40 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 15 Aug 2012 09:32:40 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=FSL_RCVD_USER,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of filippo@ntoklo.com designates 209.85.215.172 as permitted sender) Received: from [209.85.215.172] (HELO mail-ey0-f172.google.com) (209.85.215.172) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 15 Aug 2012 09:32:35 +0000 Received: by eaai11 with SMTP id i11so402277eaa.31 for ; Wed, 15 Aug 2012 02:32:13 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20120113; h=date:from:to:message-id:subject:x-mailer:mime-version:content-type :content-transfer-encoding:content-disposition:x-gm-message-state; bh=dn6bZS79oIsr5pOTzCasxumN10p7VVHdacSAFYRrYfI=; b=ntOMszJb105Zf2jTLhFHPwiYd7DzpCCicgd+HcJIxMYPpE/4KoKn5QMGCTF83vDB78 yhfCZ6Ey/vKf0saUkjuTEZQ2LhRyMtSAAm582pjk1BU8GExHYwF1GFhyYoslXJ97gRLv ZqrRAoXE6cNIO1VoAaJBWLIB5Iy+sdR1kb5xoSUjw9IsBOoHdefFut9a4J+uZSOc3ix8 kaueFWAz6TMEiggWFtNGXApPGmy1ah3tyAmsy/egoy9aLxI6fjbrb35+yr5fTxOwzitu WtgOX3Hbl7V7pzbv7KVsIBeTKSdjddzoD33gHLrGZQFNmw5tDw/fsdawVes2uL3DZ/wc kGCw== Received: by 10.14.215.193 with SMTP id e41mr14395388eep.44.1345023133713; Wed, 15 Aug 2012 02:32:13 -0700 (PDT) Received: from Filippos-MacBook.local ([80.71.29.65]) by mx.google.com with ESMTPS id y1sm2595503eel.0.2012.08.15.02.32.10 (version=SSLv3 cipher=OTHER); Wed, 15 Aug 2012 02:32:11 -0700 (PDT) Date: Wed, 15 Aug 2012 10:32:09 +0100 From: Filippo Diotalevi To: user@cassandra.apache.org Message-ID: <0E1DF749F2C142B38EA75A47216970A9@ntoklo.com> Subject: Migrating to a new cluster (using SSTableLoader or other approaches) X-Mailer: sparrow 1.5 (build 1043.1) MIME-Version: 1.0 Content-Type: text/plain; charset="utf-8" Content-Transfer-Encoding: quoted-printable Content-Disposition: inline X-Gm-Message-State: ALoCoQnbCCOZnSKNHAUB0ziKJaDr5Kvuo273yceZvaUAoGXKOaVz5M6QEsCD1NUYezrz0u2kYdHY X-Virus-Checked: Checked by ClamAV on apache.org Hi, =20 we are trying to use SSTableLoader to bootstrap a new 7-node cassandra (v= . 1.0.10) cluster with the snapshots taken from a 3-node cassandra cluste= r. The new cluster is in a different data centre. =20 After reading the articles at =5B1=5D http://www.datastax.com/dev/blog/bulk-loading =5B2=5D http://geekswithblogs.net/johnsPerfBlog/archive/2011/07/26/how-to= -use-cassandrs-sstableloader.aspx we are tried to follow this procedure =20 1) we took a snapshot of our keyspaces in the old cluster and moved them = to the data folder of 3 of the new machines 2) started cassandra in the new cluster but we noticed that some column families were missing, other had missing = data. After that we tried to use sstableloader 1) we reinstalled cassandra in the new cluster 2) run sstableloader (as explained in =5B2=5D) to load the keyspaces SSTableLoader starts, but the progress is always 0 and the transfer rate = is 0MB/s. Some warning and exceptions are present in the logs ./sstableloader /opt/analytics/analytics/ Starting client (and waiting 30 seconds for gossip) ... Streaming revelant part of /opt/analytics/analytics/chart-hd-104-Data.db = /opt/analytics/analytics/chart-hd-105-Data.db /opt/analytics/analytics/ch= art-hd-106-Data.db /opt/analytics/analytics/chart-hd-107-Data.db /opt/ana= lytics/analytics/chart-hd-108-Data.db to =5B/1x.xx.xx.xx5, /1x.xx.xx.xx7,= /1x.xx.xx.xx0, /1x.xx.xx.xx7, /1x.xx.xx.xx3, /1x.xx.xx.xx8, /1x.xx.xx.xx= 7=5D WARN 09:02:38,534 Unable to instantiate cache provider org.apache.cassand= ra.cache.SerializingCacheProvider; using default org.apache.cassandra.cac= he.ConcurrentLinkedHashCacheProvider=405d59054d instead WARN 09:02:38,549 Unable to instantiate cache provider org.apache.cassand= ra.cache.SerializingCacheProvider; using default org.apache.cassandra.cac= he.ConcurrentLinkedHashCacheProvider=405d59054d instead =5B=E2=80=A6.=5D ERROR 09:02:38,614 Error in ThreadPoolExecutor java.lang.RuntimeException: java.io.EO=46Exception: unable to seek to pos= ition 93069003 in /opt/analytics/analytics/chart-hd-104-Data.db (65737276= bytes) in read-only mode at org.apache.cassandra.utils.=46BUtilities.unchecked(=46BUtilities.java:= 689) at org.apache.cassandra.utils.WrappedRunnable.run(WrappedRunnable.java:34= ) at java.util.concurrent.ThreadPoolExecutor=24Worker.runTask(ThreadPoolExe= cutor.java:886) at java.util.concurrent.ThreadPoolExecutor=24Worker.run(ThreadPoolExecuto= r.java:908) at java.lang.Thread.run(Thread.java:619) Caused by: java.io.EO=46Exception: unable to seek to position 93069003 in= /opt/analytics/analytics/chart-hd-104-Data.db (65737276 bytes) in read-o= nly mode at org.apache.cassandra.io.util.RandomAccessReader.seek(RandomAccessReade= r.java:253) at org.apache.cassandra.streaming.=46ileStreamTask.stream(=46ileStreamTas= k.java:136) at org.apache.cassandra.streaming.=46ileStreamTask.runMayThrow(=46ileStre= amTask.java:91) at org.apache.cassandra.utils.WrappedRunnable.run(WrappedRunnable.java:30= ) ... 3 more Exception in thread =22Streaming:1=22 java.lang.RuntimeException: java.io= .EO=46Exception: unable to seek to position 93069003 in /opt/analytics/an= alytics/chart-hd-104-Data.db (65737276 bytes) in read-only mode at org.apache.cassandra.utils.=46BUtilities.unchecked(=46BUtilities.java:= 689) at org.apache.cassandra.utils.WrappedRunnable.run(WrappedRunnable.java:34= ) at java.util.concurrent.ThreadPoolExecutor=24Worker.runTask(ThreadPoolExe= cutor.java:886) at java.util.concurrent.ThreadPoolExecutor=24Worker.run(ThreadPoolExecuto= r.java:908) at java.lang.Thread.run(Thread.java:619) Caused by: java.io.EO=46Exception: unable to seek to position 93069003 in= /opt/analytics/analytics/chart-hd-104-Data.db (65737276 bytes) in read-o= nly mode at org.apache.cassandra.io.util.RandomAccessReader.seek(RandomAccessReade= r.java:253) at org.apache.cassandra.streaming.=46ileStreamTask.stream(=46ileStreamTas= k.java:136) at org.apache.cassandra.streaming.=46ileStreamTask.runMayThrow(=46ileStre= amTask.java:91) at org.apache.cassandra.utils.WrappedRunnable.run(WrappedRunnable.java:30= ) ... 3 more --------------- What's the correct approach to migrate data from one cluster to another=3F= How can I troubleshoot the problem with sstableloader=3F =20 Thanks, -- =20 =46ilippo Diotalevi