Return-Path: Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: (qmail 37411 invoked from network); 3 Sep 2010 01:26:16 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 3 Sep 2010 01:26:16 -0000 Received: (qmail 45822 invoked by uid 500); 3 Sep 2010 01:26:14 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 45698 invoked by uid 500); 3 Sep 2010 01:26:14 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 45690 invoked by uid 99); 3 Sep 2010 01:26:14 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 03 Sep 2010 01:26:14 +0000 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests= X-Spam-Check-By: apache.org Received-SPF: unknown amxip4:204.200.197.195ip4:174.37.77.13ip4:174.37.77.14ip4:174.37.77.15ip4:67.228.191.123ip4:67.228.190.12ip4:174.36.43.26ip4:67.228.190.15ip4:204.200.197.196~all (athena.apache.org: encountered unrecognized mechanism during SPF processing of domain of cassandra@softwareprojects.com) Received: from [204.200.197.196] (HELO mx1.softwareprojects.com) (204.200.197.196) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 03 Sep 2010 01:26:06 +0000 Received: from [192.168.0.132] (bzq-84-109-23-242.red.bezeqint.net [84.109.23.242]) (authenticated bits=0) by mx1.softwareprojects.com (8.13.6.20060614/8.13.6) with ESMTP id o831PgHJ083989 for ; Fri, 3 Sep 2010 01:25:44 GMT Message-ID: <4C804E8E.1020105@softwareprojects.com> Date: Thu, 02 Sep 2010 21:25:34 -0400 From: Mike Peters User-Agent: Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.9.2.8) Gecko/20100802 Thunderbird/3.1.2 MIME-Version: 1.0 To: user@cassandra.apache.org Subject: 4k keyspaces... Maybe we're doing it wrong? Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Hi, We're in the process of migrating 4,000 MySQL client databases to Cassandra. All database schemas are identical. With MySQL, we used to provision a separate 'database' per each client, to make it easier to shard and move things around. Does it make sense to migrate the 4,000 MySQL databases to 4,000 keyspaces in Cassandra? Or should we stick with a single keyspace? My concerns are - #1. Will every single node end up with 4k folders under /cassandra/data/? #2. Performance: Will Cassandra work better with a single keyspace + lots of keys, or thousands of keyspaces? - Granted it's 'cleaner' to have a separate keyspace per each client, but maybe that's not the best approach with Cassandra. Thoughts?