From cassandra-user-return-2967-apmail-incubator-cassandra-user-archive=incubator.apache.org@incubator.apache.org Fri Mar 05 19:42:48 2010 Return-Path: Delivered-To: apmail-incubator-cassandra-user-archive@minotaur.apache.org Received: (qmail 46475 invoked from network); 5 Mar 2010 19:42:48 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 5 Mar 2010 19:42:48 -0000 Received: (qmail 8849 invoked by uid 500); 5 Mar 2010 19:42:33 -0000 Delivered-To: apmail-incubator-cassandra-user-archive@incubator.apache.org Received: (qmail 8833 invoked by uid 500); 5 Mar 2010 19:42:33 -0000 Mailing-List: contact cassandra-user-help@incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: cassandra-user@incubator.apache.org Delivered-To: mailing list cassandra-user@incubator.apache.org Received: (qmail 8825 invoked by uid 99); 5 Mar 2010 19:42:33 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 05 Mar 2010 19:42:33 +0000 X-ASF-Spam-Status: No, hits=2.2 required=10.0 tests=FREEMAIL_FROM,HTML_MESSAGE,RCVD_IN_DNSWL_NONE,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: local policy) Received: from [68.142.207.73] (HELO web31810.mail.mud.yahoo.com) (68.142.207.73) by apache.org (qpsmtpd/0.29) with SMTP; Fri, 05 Mar 2010 19:42:24 +0000 Received: (qmail 81807 invoked by uid 60001); 5 Mar 2010 19:42:03 -0000 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=yahoo.com; s=s1024; t=1267818123; bh=2DUolxZLbZgJPkoaOvaXSGoPfy2Apnu2RHoCYM2orZI=; h=Message-ID:X-YMail-OSG:Received:X-Mailer:Date:From:Subject:To:MIME-Version:Content-Type; b=u+OSjb30bDrmREOjPOiBt6DUlnOBL91QdvOW5pp5hxZyQxgPjPltWwqu5DiBJiAZyvcnhfZ8gt/HSSbpLRb+WL9vLmQD1XJevbt4PWSfqQAnOG8iw4TOKauhpg1dU/WcmZPdoOnGb78If3dnBctvtcroDEQOZTJ4woU17ySe0OU= DomainKey-Signature:a=rsa-sha1; q=dns; c=nofws; s=s1024; d=yahoo.com; h=Message-ID:X-YMail-OSG:Received:X-Mailer:Date:From:Subject:To:MIME-Version:Content-Type; b=wqVlm6zQkFetbKiJt/3xmVuPnKL5uU1t+yHIl16zhmJlcigtRR1laDAGSMwQszfrVkTyEqDgTr7J9F4j37TEG64A9RRz+nI2HOmLnn+stQ34z5mHhHjR4pWj+l9B9MCtQ/Ek/0c1ZLHMUy95+L9YyT+4a4SHIyzjdvuZ1L3qxgk=; Message-ID: <10713.81434.qm@web31810.mail.mud.yahoo.com> X-YMail-OSG: Al98aH8VM1n8jdD0xry2Nl8ofhW9zN5ujhmez981QLswX6U whqxZrz5yvLyS4b5QRbmEB7CpveGk4wXUvIFy03.O5dkDCgj7z0NP_fdxvmo bXDfHkjz3bYeC8RAalr5cniaK.XxZwcncRSYpPAs61EWEFD4VCujEhCsFGzb 6dDKFOmAJQ43A92wRmOdSI_dFre2WQIhJeY57WI1xRidbhqwwOM4xSaRBVeT n0d4sYw1tkzGioCcJh6JoyIyUEKnbGD4lOdbii1mxmHHD9zOFckdIk8OHaoj 9VBpZlMsq Received: from [64.71.1.162] by web31810.mail.mud.yahoo.com via HTTP; Fri, 05 Mar 2010 11:42:02 PST X-Mailer: YahooMailRC/324.3 YahooMailWebService/0.8.102.267879 Date: Fri, 5 Mar 2010 11:42:02 -0800 (PST) From: shiv shivaji Subject: Dynamically Switching from Ordered Partitioner to Random? To: cassandra-user@incubator.apache.org MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="0-1545220872-1267818122=:81434" X-Virus-Checked: Checked by ClamAV on apache.org --0-1545220872-1267818122=:81434 Content-Type: text/plain; charset=us-ascii I started with the ordered partitioner as I was hoping to make use of the map-reduce functionality. However, my data was likely lopped onto 2 key machines with most of it on one (as seen from another thread. There were also machine failures to blame for the uneven distribution). One solution which I am trying is to load balance. Is there any other thing I can try to convert the partitioner to random on a live system? I know this sounds like an odd request. Curious about my options though. I did see a post mentioning that one can compute the md5 hash of each key and then insert using that and have a mapping table from key to md5 hash. Unfortunately, the data is already loaded using an ordered partitioner and I was wondering if there is a way to switch to random now. Shiv --0-1545220872-1267818122=:81434 Content-Type: text/html; charset=us-ascii
I started with the ordered partitioner as I was hoping to make use of the map-reduce functionality. However, my data was likely lopped onto 2 key machines with most of it on one (as seen from another thread. There were also machine failures to blame for the uneven distribution). One solution which I am trying is to load balance. Is there any other thing I can try to convert the partitioner to random on a live system?

I know this sounds like an odd request. Curious about my options though. I did see a post mentioning that one can compute the md5 hash of each key and then insert using that and have a mapping table from key to md5 hash. Unfortunately, the data is already loaded using an ordered partitioner and I was wondering if there is a way to switch to random now.

Shiv
--0-1545220872-1267818122=:81434--