Return-Path: X-Original-To: apmail-cassandra-user-archive@www.apache.org Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id ADEE010084 for ; Wed, 25 Sep 2013 11:07:00 +0000 (UTC) Received: (qmail 49518 invoked by uid 500); 25 Sep 2013 11:01:11 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 49443 invoked by uid 500); 25 Sep 2013 11:00:51 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 49238 invoked by uid 99); 25 Sep 2013 11:00:01 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 25 Sep 2013 11:00:01 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of chris.wirt@struq.com designates 209.85.215.180 as permitted sender) Received: from [209.85.215.180] (HELO mail-ea0-f180.google.com) (209.85.215.180) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 25 Sep 2013 10:59:54 +0000 Received: by mail-ea0-f180.google.com with SMTP id h10so3117516eaj.11 for ; Wed, 25 Sep 2013 03:59:34 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:from:to:references:in-reply-to:subject:date :message-id:mime-version:content-type:thread-index:content-language; bh=pNeBvTX8IrGi1U9zISq4CpBZTZ1fvroUs9uyF3ujqMc=; b=TlPX2CEYdW8pbHt/vin+mEp7G0L2uRAqPwYoK6qVCJaHahTeJkQn4MBJCrFRA1HVYd u3ERB69kuwqfPGtwu9Rooloz01s32psjCkUORRaws5Ri0hCo8GI+YevDZb5H90F8xQuN GQNfMIklGTajzLAeSn51fgDR4h2EBPBSUD75LM1KVvr256tx+5y6c3U8SPGGwgufErhq lObXGmBQJBw0vFP0YhGEHR6PkBwJOVp4fVmJ2UdyZhSZF0yUkMkiD78Bq6Ph6yFQb8SJ 76bcvh2RCwRkwMJWKuVmfmrtxUwQewFDMakHCQndzb8SXTObSyxIBGPmVG8UYiRL8Zl+ /kOQ== X-Gm-Message-State: ALoCoQmzUzBBGq/Mo6ThJkF5HlAGiMZsMIPQRwL7G1fVoHfypUlT1f3sUZXImBRMGdo+a2ySFa0X X-Received: by 10.14.109.201 with SMTP id s49mr2966160eeg.54.1380106773851; Wed, 25 Sep 2013 03:59:33 -0700 (PDT) Received: from StevePereiraPC (host81-133-200-21.in-addr.btopenworld.com. [81.133.200.21]) by mx.google.com with ESMTPSA id v8sm14057194eeo.12.1969.12.31.16.00.00 (version=TLSv1 cipher=RC4-SHA bits=128/128); Wed, 25 Sep 2013 03:59:33 -0700 (PDT) From: "Christopher Wirt" To: References: <002401ceb980$85a26d10$90e74730$@struq.com> In-Reply-To: Subject: RE: 1.2.10 -> 2.0.1 migration issue Date: Wed, 25 Sep 2013 11:59:32 +0100 Message-ID: <009601ceb9de$50fa43e0$f2eecba0$@struq.com> MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="----=_NextPart_000_0097_01CEB9E6.B2C40310" X-Mailer: Microsoft Outlook 14.0 Thread-Index: AQKNSJKDQv/mVsrXuuRf3wXL9wFexAKNaK5xAl5wT7yYMaatkA== Content-Language: en-gb X-Virus-Checked: Checked by ClamAV on apache.org This is a multipart message in MIME format. ------=_NextPart_000_0097_01CEB9E6.B2C40310 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Hi Marcus, Thanks for having a look at this. Just noticed this in the NEWS.txt For leveled compaction users, 2.0 must be atleast started before upgrading to 2.1 due to the fact that the old JSON leveled manifest is migrated into the sstable metadata files on startup in 2.0 and this code is gone from 2.1. Basically, my fault for skimming over this too quickly. We will move from 1.2.10 -> 2.0 -> 2.1 Thanks, Chris From: Marcus Eriksson [mailto:krummas@gmail.com] Sent: 25 September 2013 09:37 To: user@cassandra.apache.org Subject: Re: 1.2.10 -> 2.0.1 migration issue cant really reproduce, could you update the ticket with a bit more info about your setup? do you have multiple .json files in your data dirs? On Wed, Sep 25, 2013 at 10:07 AM, Marcus Eriksson wrote: this is most likely a bug, filed https://issues.apache.org/jira/browse/CASSANDRA-6093 and will try to have a look today. On Wed, Sep 25, 2013 at 1:48 AM, Christopher Wirt wrote: Hi, Just had a go at upgrading a node to the latest stable c* 2 release and think I ran into some issues with manifest migration. On initial start up I hit this error as it starts to load the first of my CF. INFO [main] 2013-09-24 22:56:01,018 LegacyLeveledManifest.java (line 89) Migrating manifest for struqrealtime/impressionstorev2 INFO [main] 2013-09-24 22:56:01,019 LegacyLeveledManifest.java (line 119) Snapshotting struqrealtime, impressionstorev2 to pre-sstablemetamigration ERROR [main] 2013-09-24 22:56:01,030 CassandraDaemon.java (line 459) Exception encountered during startup FSWriteError in /disk1/cassandra/data/struqrealtime/impressionstorev2/snapshots/pre-sstablem etamigration/impressionstorev2.json at org.apache.cassandra.io.util.FileUtils.createHardLink(FileUtils.java:83) at org.apache.cassandra.db.compaction.LegacyLeveledManifest.snapshotWithoutCFS( LegacyLeveledManifest.java:138) at org.apache.cassandra.db.compaction.LegacyLeveledManifest.migrateManifests(Le gacyLeveledManifest.java:91) at org.apache.cassandra.service.CassandraDaemon.setup(CassandraDaemon.java:246) at org.apache.cassandra.service.CassandraDaemon.activate(CassandraDaemon.java:4 42) at org.apache.cassandra.service.CassandraDaemon.main(CassandraDaemon.java:485) Caused by: java.nio.file.NoSuchFileException: /disk1/cassandra/data/struqrealtime/impressionstorev2/snapshots/pre-sstablem etamigration/impressionstorev2.json -> /disk1/cassandra/data/struqrealtime/impressionstorev2/impressionstorev2.json at sun.nio.fs.UnixException.translateToIOException(UnixException.java:86) at sun.nio.fs.UnixException.rethrowAsIOException(UnixException.java:102) at sun.nio.fs.UnixFileSystemProvider.createLink(UnixFileSystemProvider.java:474 ) at java.nio.file.Files.createLink(Files.java:1037) at org.apache.cassandra.io.util.FileUtils.createHardLink(FileUtils.java:79) ... 5 more I had already successful run a test migration on our dev server. Only real difference I can see if the number of data directories defined and the amount of data being held. I've run upgradesstables under 1.2.10. I have always been using vnodes and CQL3. I recently moved to using LZ4 instead of Snappy.. I tried to startup again and it gave me a slightly different error INFO [main] 2013-09-24 22:58:28,218 LegacyLeveledManifest.java (line 89) Migrating manifest for struqrealtime/impressionstorev2 INFO [main] 2013-09-24 22:58:28,218 LegacyLeveledManifest.java (line 119) Snapshotting struqrealtime, impressionstorev2 to pre-sstablemetamigration ERROR [main] 2013-09-24 22:58:28,222 CassandraDaemon.java (line 459) Exception encountered during startup java.lang.RuntimeException: Tried to create duplicate hard link to /disk3/cassandra/data/struqrealtime/impressionstorev2/snapshots/pre-sstablem etamigration/struqrealtime-impressionstorev2-ic-1030-TOC.txt at org.apache.cassandra.io.util.FileUtils.createHardLink(FileUtils.java:71) at org.apache.cassandra.db.compaction.LegacyLeveledManifest.snapshotWithoutCFS( LegacyLeveledManifest.java:129) at org.apache.cassandra.db.compaction.LegacyLeveledManifest.migrateManifests(Le gacyLeveledManifest.java:91) at org.apache.cassandra.service.CassandraDaemon.setup(CassandraDaemon.java:246) at org.apache.cassandra.service.CassandraDaemon.activate(CassandraDaemon.java:4 42) at org.apache.cassandra.service.CassandraDaemon.main(CassandraDaemon.java:485) Will have a go recreating this tomorrow. Any insight or guesses at what the issue might be are always welcome. Thanks, Chris ------=_NextPart_000_0097_01CEB9E6.B2C40310 Content-Type: text/html; charset="us-ascii" Content-Transfer-Encoding: quoted-printable

Hi Marcus,

Thanks for having a look at this.

 

Just noticed this in the NEWS.txt

 

For leveled compaction = users, 2.0 must be atleast = started before

     upgrading to 2.1 due to the fact that the old = JSON leveled

     manifest is migrated into the sstable metadata files on = startup

     in 2.0 and this code is gone from = 2.1.

 

Basically, my fault for skimming over this too quickly. =

 

We will move from 1.2.10 -> 2.0 -> 2.1

 

Thanks,

Chris

 

 

From:= Marcus = Eriksson [mailto:krummas@gmail.com]
Sent: 25 September 2013 = 09:37
To: user@cassandra.apache.org
Subject: Re: = 1.2.10 -> 2.0.1 migration issue

 

cant = really reproduce, could you update the ticket with a bit more info about = your setup?

 

do you have multiple .json files in your data = dirs?

 

On Wed, Sep 25, 2013 at 10:07 AM, Marcus Eriksson = <krummas@gmail.com> wrote:

 

On Wed, Sep 25, 2013 at 1:48 AM, Christopher Wirt = <chris.wirt@struq.com> = wrote:

Hi,

 <= /o:p>

Just had a = go at upgrading a node to the latest stable c* 2 release and think I ran = into some issues with manifest migration.

 <= /o:p>

On initial = start up I hit this error as it starts to load the first of my CF. =

 <= /o:p>

INFO [main] = 2013-09-24 22:56:01,018 LegacyLeveledManifest.java (line 89) Migrating = manifest for struqrealtime/impressionstorev2

INFO [main] = 2013-09-24 22:56:01,019 LegacyLeveledManifest.java (line 119) = Snapshotting struqrealtime, impressionstorev2 to = pre-sstablemetamigration

ERROR = [main] 2013-09-24 22:56:01,030 CassandraDaemon.java (line 459) Exception = encountered during startup

FSWriteError= in = /disk1/cassandra/data/struqrealtime/impressionstorev2/snapshots/pre-sstab= lemetamigration/impressionstorev2.json

  =       at = org.apache.cassandra.io.util.FileUtils.createHardLink(FileUtils.java:83)<= o:p>

  =       at = org.apache.cassandra.db.compaction.LegacyLeveledManifest.snapshotWithoutC= FS(LegacyLeveledManifest.java:138)

  =       at = org.apache.cassandra.db.compaction.LegacyLeveledManifest.migrateManifests= (LegacyLeveledManifest.java:91)

  =       at = org.apache.cassandra.service.CassandraDaemon.setup(CassandraDaemon.java:2= 46)

  =       at = org.apache.cassandra.service.CassandraDaemon.activate(CassandraDaemon.jav= a:442)

  =       at = org.apache.cassandra.service.CassandraDaemon.main(CassandraDaemon.java:48= 5)

Caused by: = java.nio.file.NoSuchFileException: = /disk1/cassandra/data/struqrealtime/impressionstorev2/snapshots/pre-sstab= lemetamigration/impressionstorev2.json -> = /disk1/cassandra/data/struqrealtime/impressionstorev2/impressionstorev2.j= son

  =       at = sun.nio.fs.UnixException.translateToIOException(UnixException.java:86)

  =       at = sun.nio.fs.UnixException.rethrowAsIOException(UnixException.java:102)

  =       at = sun.nio.fs.UnixFileSystemProvider.createLink(UnixFileSystemProvider.java:= 474)

  =       at = java.nio.file.Files.createLink(Files.java:1037)

  =       at = org.apache.cassandra.io.util.FileUtils.createHardLink(FileUtils.java:79)<= o:p>

  =       ... 5 more

 <= /o:p>

I had = already successful run a test migration on our dev server. Only real = difference I can see if the number of data directories defined and the = amount of data being held.

 <= /o:p>

I’ve = run upgradesstables under 1.2.10. I have always been using vnodes and = CQL3. I recently moved to using LZ4 instead of Snappy..

 <= /o:p>

I tried to = startup again and it gave me a slightly different error

 <= /o:p>

INFO [main] = 2013-09-24 22:58:28,218 LegacyLeveledManifest.java (line 89) Migrating = manifest for struqrealtime/impressionstorev2

INFO [main] = 2013-09-24 22:58:28,218 LegacyLeveledManifest.java (line 119) = Snapshotting struqrealtime, impressionstorev2 to = pre-sstablemetamigration

ERROR = [main] 2013-09-24 22:58:28,222 CassandraDaemon.java (line 459) Exception = encountered during startup

java.lang.Ru= ntimeException: Tried to create duplicate hard link to = /disk3/cassandra/data/struqrealtime/impressionstorev2/snapshots/pre-sstab= lemetamigration/struqrealtime-impressionstorev2-ic-1030-TOC.txt

  =       at = org.apache.cassandra.io.util.FileUtils.createHardLink(FileUtils.java:71)<= o:p>

  =       at = org.apache.cassandra.db.compaction.LegacyLeveledManifest.snapshotWithoutC= FS(LegacyLeveledManifest.java:129)

  =       at = org.apache.cassandra.db.compaction.LegacyLeveledManifest.migrateManifests= (LegacyLeveledManifest.java:91)

  =       at = org.apache.cassandra.service.CassandraDaemon.setup(CassandraDaemon.java:2= 46)

  =       at = org.apache.cassandra.service.CassandraDaemon.activate(CassandraDaemon.jav= a:442)

  =       at = org.apache.cassandra.service.CassandraDaemon.main(CassandraDaemon.java:48= 5)

 <= /o:p>

Will have a = go recreating this tomorrow.

 <= /o:p>

Any insight = or guesses at what the issue might be are always = welcome.

 <= /o:p>

Thanks,=

Chris

 

 

------=_NextPart_000_0097_01CEB9E6.B2C40310--