Return-Path: X-Original-To: apmail-cassandra-user-archive@www.apache.org Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id DE6B817712 for ; Thu, 7 May 2015 19:07:29 +0000 (UTC) Received: (qmail 85769 invoked by uid 500); 7 May 2015 19:07:27 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 85729 invoked by uid 500); 7 May 2015 19:07:27 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 85718 invoked by uid 99); 7 May 2015 19:07:27 -0000 Received: from Unknown (HELO spamd4-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 07 May 2015 19:07:27 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd4-us-west.apache.org (ASF Mail Server at spamd4-us-west.apache.org) with ESMTP id C5D77C2A48 for ; Thu, 7 May 2015 19:07:26 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd4-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -0.002 X-Spam-Level: X-Spam-Status: No, score=-0.002 tagged_above=-999 required=6.31 tests=[RCVD_IN_MSPIKE_H2=-0.001, SPF_PASS=-0.001] autolearn=disabled Received: from mx1-eu-west.apache.org ([10.40.0.8]) by localhost (spamd4-us-west.apache.org [10.40.0.11]) (amavisd-new, port 10024) with ESMTP id Gvtd5saCFrBj for ; Thu, 7 May 2015 19:07:26 +0000 (UTC) Received: from mail-pa0-f44.google.com (mail-pa0-f44.google.com [209.85.220.44]) by mx1-eu-west.apache.org (ASF Mail Server at mx1-eu-west.apache.org) with ESMTPS id 30E83249E9 for ; Thu, 7 May 2015 19:07:25 +0000 (UTC) Received: by pacyx8 with SMTP id yx8so47829925pac.1 for ; Thu, 07 May 2015 12:07:23 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:from:content-type:content-transfer-encoding :subject:message-id:date:to:mime-version; bh=CFPspzsmEF1R5iKAdDGcnvYcHJN7/8cGch1VhcT73mg=; b=K+pjTeX0VkojTOgsv0aiEc/hWr0ovTb5iAXkoileya4WUCkFp08l0unwyRuIou5PWm qk6QLcSu/KsRQ8KGOrTycs8sdGRn5dqIBNTW8ZONbCOGocHMRMkAq4CYcUlT3b85zyA/ 9cuxIWji36Q1EWpwIRico8iIZMMuH2zNzdzcFogOEQd9iMz0aC+GTxvy4rzx596WPzwI ySavp7LI5BvhtFIMo4mEKp+mEtQBNdg6tgWrg8GqjGui3oO4vNFzvhanOvy/f7vE1uG2 bi4oswgHx0P1A1OE/vKmCqNNv5dBRK/e0pnRMc0be4QgVQMPcZN8EYAo2o1/xl/Zq2v6 I9BA== X-Gm-Message-State: ALoCoQn8IIbZTjGA0qYHblZGLOw2JyOyLWH8aGppW7yQJqjBehET5AtOyP1IM2yzYohRaDnDPQIt X-Received: by 10.68.226.37 with SMTP id rp5mr130824pbc.21.1431025643592; Thu, 07 May 2015 12:07:23 -0700 (PDT) Received: from [192.168.32.55] (c-76-102-54-167.hsd1.ca.comcast.net. [76.102.54.167]) by mx.google.com with ESMTPSA id e5sm2888567pdc.94.2015.05.07.12.07.22 for (version=TLSv1 cipher=ECDHE-RSA-RC4-SHA bits=128/128); Thu, 07 May 2015 12:07:22 -0700 (PDT) From: Jeff Ferland Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: quoted-printable Subject: Offline Compaction and Token Splitting Message-Id: <34A710B2-F95D-42E9-87EB-C504DC737E40@tubularlabs.com> Date: Thu, 7 May 2015 12:07:21 -0700 To: user@cassandra.apache.org Mime-Version: 1.0 (Mac OS X Mail 8.2 \(2098\)) X-Mailer: Apple Mail (2.2098) I have an ideal for backups in my mind with Cassandra to dump each = columnfamily to a directory and use an offline process to compact them = all into one sstable (or max sstable size set). I have an ideal for = restoration which involves a streaming read an sstable set and output = based on whether the data fits within a token range. The result of this = is that I can store a single copy of data that is effectively already = repaired and can read from the specific range that covers a node that I = wish to restore. My first look at this was somewhat frustrated by = sstable code in the current versions have a strong reliance on the = system keyspace. Does anybody have any thoughts in regards to other things that might = exist and fulfill this (particularly offline collective compaction), = have a desire for such tools, or have any useful information for me = before I attempt to build such beasts? -Jeff=