Return-Path: X-Original-To: apmail-cassandra-user-archive@www.apache.org Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 20CD518E50 for ; Mon, 22 Jun 2015 03:25:54 +0000 (UTC) Received: (qmail 94835 invoked by uid 500); 22 Jun 2015 03:25:51 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 94784 invoked by uid 500); 22 Jun 2015 03:25:51 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 94772 invoked by uid 99); 22 Jun 2015 03:25:51 -0000 Received: from Unknown (HELO spamd4-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 22 Jun 2015 03:25:51 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd4-us-west.apache.org (ASF Mail Server at spamd4-us-west.apache.org) with ESMTP id CD72AC0045 for ; Mon, 22 Jun 2015 03:25:50 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd4-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 2.879 X-Spam-Level: ** X-Spam-Status: No, score=2.879 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=3, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd4-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-eu-west.apache.org ([10.40.0.8]) by localhost (spamd4-us-west.apache.org [10.40.0.11]) (amavisd-new, port 10024) with ESMTP id ZEK8-FMnUAD3 for ; Mon, 22 Jun 2015 03:25:50 +0000 (UTC) Received: from mail-ig0-f174.google.com (mail-ig0-f174.google.com [209.85.213.174]) by mx1-eu-west.apache.org (ASF Mail Server at mx1-eu-west.apache.org) with ESMTPS id 3E7BF24FF9 for ; Mon, 22 Jun 2015 03:25:49 +0000 (UTC) Received: by igboe5 with SMTP id oe5so53706739igb.1 for ; Sun, 21 Jun 2015 20:25:48 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:date:message-id:subject:from:to:content-type; bh=i+9UNAq7K/UhJD3vCegdQUCKXRmBhwGEJtCh7hbelQU=; b=jEKpNKJ/RHnoVZqKz75CZq8Gp7lELffNr2ASlDvuy7xF67nSj/3zAhkNIT7fPhvS3v VmGwuxhDCZ/0g5GorjPWp539+kPsZ+MzqGgigoOiBAO2QbPKWFdFxm+fJ1F6l1XsNXnf wV1YKd1Y7BpU9nUQGLGytnRtjcfMFnsCj3vN4f0cbSLB/whUPS6b1KRewo9cwDnhXzNZ HQcaE1XoDtYH/JKWaqytEXQv08gj1OXHRI+ybtFP9TxBc8v1u/XCvBO3JUE+glUEKJQ3 xiy5SJMz62i+LFZNqXjG7tRTfxh4bbDrd8XLKVr9Y9We09Y6JpSl3f9jlOTuniNWPk8D dHRA== MIME-Version: 1.0 X-Received: by 10.107.14.65 with SMTP id 62mr24833185ioo.67.1434943548218; Sun, 21 Jun 2015 20:25:48 -0700 (PDT) Received: by 10.64.131.5 with HTTP; Sun, 21 Jun 2015 20:25:48 -0700 (PDT) Date: Sun, 21 Jun 2015 23:25:48 -0400 Message-ID: Subject: Create a smaller cluster based on snapshost From: John Wong To: user@cassandra.apache.org Content-Type: multipart/alternative; boundary=001a113fd05a76e871051912d468 --001a113fd05a76e871051912d468 Content-Type: text/plain; charset=UTF-8 Hi. Supposed I have a 6-node cluster running and I want to build a 3-node cluster based on that 6-node cluster. What is the recommended way to quickly build such cluster? Each node is about 120Gb and we have RF=3. We are on Cassandra 1.2.19 and we are not using vnode. My initial research shows it can either be done with sstableloader or restore using snapshot and fix the token range. In the case of sstableloader, given it is streaming, and we are restoring from a live server, this seems to be a slow process if we throttle the traffic. Even if I take this route, do I just pick 3 node out of the 6 nodes in any random order? In the case of restoring from snapshots I have restored a 6-node replica with just copying snapshot files (along with schema files), run nodetool refresh, and should be able to complete in a few hours. But now with smaller replica, do I again just pick snapshots from any 3 nodes? What and why do I need to fix token range (from what I read)? Any feedback is appreciated. Thanks. John --001a113fd05a76e871051912d468 Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable
Hi.

Supposed I have a 6-node c= luster running and I want to build a 3-node cluster based on that 6-node cl= uster. What is the recommended way to quickly build such cluster? Each node= is about 120Gb and we have RF=3D3. We are on Cassandra 1.2.19 and we are n= ot using vnode.

My initial research shows it can either b= e done with sstableloader or restore using snapshot and fix the token range= .

In the case of sstableloader, given it is streaming, an= d we are restoring from a live server, this seems to be a slow process if w= e throttle the traffic. Even if I take this route, do I just pick 3 node ou= t of the 6 nodes in any random order?

In the case of rest= oring from snapshots I have restored a 6-node replica with just copying sna= pshot files (along with schema files), run nodetool refresh, and should be = able to complete in a few hours. But now with smaller replica, do I again j= ust pick snapshots from any 3 nodes? What and why do I need to fix token ra= nge (from what I read)?

Any feedback is appreciated.
<= br>
Thanks.

John
--001a113fd05a76e871051912d468--