Return-Path: Delivered-To: apmail-incubator-cassandra-user-archive@minotaur.apache.org Received: (qmail 7708 invoked from network); 2 Feb 2010 01:33:27 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 2 Feb 2010 01:33:27 -0000 Received: (qmail 19209 invoked by uid 500); 2 Feb 2010 01:33:26 -0000 Delivered-To: apmail-incubator-cassandra-user-archive@incubator.apache.org Received: (qmail 19194 invoked by uid 500); 2 Feb 2010 01:33:26 -0000 Mailing-List: contact cassandra-user-help@incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: cassandra-user@incubator.apache.org Delivered-To: mailing list cassandra-user@incubator.apache.org Received: (qmail 19185 invoked by uid 99); 2 Feb 2010 01:33:26 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 02 Feb 2010 01:33:26 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of cagatay.kavukcuoglu@gmail.com designates 209.85.219.212 as permitted sender) Received: from [209.85.219.212] (HELO mail-ew0-f212.google.com) (209.85.219.212) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 02 Feb 2010 01:33:18 +0000 Received: by ewy4 with SMTP id 4so107380ewy.27 for ; Mon, 01 Feb 2010 17:32:58 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:reply-to:in-reply-to :references:from:date:message-id:subject:to:content-type :content-transfer-encoding; bh=es7kh1xW887r4+ewF8YguqK7ezQbiuuZTfNuHb9l0Pk=; b=SwQSzrJjgPmtnOSTj+lNWRmmrUgLjeHyc4ZLxUsr0ZbZ82FcQCsfFdnVtgUXFnUb9x sRXJF6KPIb4TQgnVC7LLDfldZ3NcY7vV0Xclut8DMp+o90ptPUoWHXsJmGNXJzo4LGNT W/c186Wltqumf2XwJUFIffx1c+WG6Z7iehRpI= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:reply-to:in-reply-to:references:from:date:message-id :subject:to:content-type:content-transfer-encoding; b=iXyuy36kxzElOAh6JQxsrU5zXHC9ZdXJEXodkVfRcuoXKxrLRgLKNm8IRrs76YgLdm K30QIrdN39kuMdXrz0j9pnuwuqMrEh4xA/Ct1mlAvZxh1FL/4PGiGQZirGhyj7k91+e6 YlI6xuUCqvH2dT2AC7zjrRxahUf1DlEbsY2kQ= MIME-Version: 1.0 Received: by 10.216.89.129 with SMTP id c1mr3665682wef.35.1265074378227; Mon, 01 Feb 2010 17:32:58 -0800 (PST) Reply-To: cagatay@kavukcuoglu.org In-Reply-To: References: <35b8f8001002011445s3eea029j72cf7953761e1946@mail.gmail.com> From: Cagatay Kavukcuoglu Date: Mon, 1 Feb 2010 20:32:38 -0500 Message-ID: <35b8f8001002011732s5303275cga913eae501f30810@mail.gmail.com> Subject: Re: Recommended way to do parallel reads of a large column slice? To: cassandra-user@incubator.apache.org Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable X-Virus-Checked: Checked by ClamAV on apache.org A large column slice in my case is tens of thousands of columns, each a few K's in size and independent in processing from others. My plan was to read slices of a few hundred to a thousand columns and process them in a pipeline for reduced overall latency. Regardless of my specific case, though, I thought one of the best ways to get good performance scaling in Cassandra was to distribute reads and writes to multiple nodes. Are there situations where that's not a good idea? CK. On Mon, Feb 1, 2010 at 6:00 PM, Jonathan Ellis wrote: > No. =C2=A0Why do you want to do multiple parallel reads instead of one > sequential read? > > On Mon, Feb 1, 2010 at 4:45 PM, Cagatay Kavukcuoglu > wrote: >> Hi, >> >> What's the recommended way to do parallel reads of a large slice of >> columns when one doesn't know enough about the column names to divide >> them for parallel reading in a meaningful way? SliceRange allows >> setting the start and finish column names, but you wouldn't be able to >> set the start field of the next read until the previous read >> completed. An offset field for the SliceRange would have worked, but I >> don't see it. Is there a way to divide the big read query into >> multiple *parallel* small read queries without requiring advance >> knowledge of the column names? >> >> CK. >> >