Return-Path: Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: (qmail 76468 invoked from network); 27 Jun 2010 13:54:20 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 27 Jun 2010 13:54:20 -0000 Received: (qmail 49099 invoked by uid 500); 27 Jun 2010 13:54:19 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 49019 invoked by uid 500); 27 Jun 2010 13:54:18 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 49006 invoked by uid 99); 27 Jun 2010 13:54:18 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 27 Jun 2010 13:54:18 +0000 X-ASF-Spam-Status: No, hits=2.2 required=10.0 tests=FREEMAIL_FROM,HTML_MESSAGE,RCVD_IN_DNSWL_NONE,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of atul.gosain@gmail.com designates 209.85.212.44 as permitted sender) Received: from [209.85.212.44] (HELO mail-vw0-f44.google.com) (209.85.212.44) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 27 Jun 2010 13:54:11 +0000 Received: by vws7 with SMTP id 7so361978vws.31 for ; Sun, 27 Jun 2010 06:53:50 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:received:in-reply-to :references:date:message-id:subject:from:to:content-type; bh=+A9No5F3aD0VAKwpKGgrINmo1/pzk6LJoWzs5wchglc=; b=HyFDFgTB0ZqfijKo9Iuk0sqyYDu3XtkAvgUyS1pe7bqWVWOP4Uc6vCNfCKjS7Hauh4 THEnO/Rzo66oFNzdBnb/bGO2HNYpQWZOQzHNVH9dT1U1UUyrpfYoZAgyxg84aZIy2QD8 KMIMKHOS9Q9UeFGen5ymi4hbfZMHW/GX9J6FY= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; b=pxD9M+12ltXcuKafR2bEmCbaOn37ijTsxjyul1lx12oNP4dmYUlH0RLhBt8gTqntxs skDDSe/fL5cvAsHrn5mL6WLue5jP3O8qxfel3SHu7NCTDIxLQHi7HRxDE78c75uA3KDu 1MxZXtO6NaYYdsv1adT7w/29JIagXa/d/sTnQ= MIME-Version: 1.0 Received: by 10.220.124.198 with SMTP id v6mr2135027vcr.39.1277646829585; Sun, 27 Jun 2010 06:53:49 -0700 (PDT) Received: by 10.220.71.75 with HTTP; Sun, 27 Jun 2010 06:53:49 -0700 (PDT) In-Reply-To: References: Date: Sun, 27 Jun 2010 19:23:49 +0530 Message-ID: Subject: Re: Map Reduce support From: Atul Gosain To: user@cassandra.apache.org Content-Type: multipart/alternative; boundary=0016e6d2755a6d14aa048a035593 X-Virus-Checked: Checked by ClamAV on apache.org --0016e6d2755a6d14aa048a035593 Content-Type: text/plain; charset=ISO-8859-1 Thanks for the information Drew and Jonathan. Is there any difference in performance while using Pig compared to MapReduce directly on data store ? I will do the experiments with both of them though in some time. On Fri, Jun 25, 2010 at 5:46 PM, Drew Dahlke wrote: > The cassandra column family input format will go over a an entire > column family sending a slice of a row into a mapper at a time. From > there there's a lot you can do. As far as how you aggregate data > together, I'd suggest experimenting with the latest version of Pig > which thankfully supports the new input format. It gives you a > SQL'esque syntax for manipulating the data and is probably the easiest > way to experiment. > > On Thu, Jun 24, 2010 at 11:01 AM, Atul Gosain > wrote: > > Hi > > What kind of Map Reduce support is provided for Cassandra ? > > Can i get some columns from different rows and then aggregate them up > > together. Its basically aggregation of statistics for various devices > > connected to a network manager. Is it a right kind of use case to be > > supported by MR ? > > Thanks > > Atul > --0016e6d2755a6d14aa048a035593 Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Thanks for the information Drew and Jonathan.
Is there any difference i= n performance while using Pig compared to MapReduce directly on data store = ?
I will do the experiments with both of them though in some time.

On Fri, Jun 25, 2010 at 5:46 PM, Drew Dahlke= <drew.dahlk= e@bronto.com> wrote:
The cassandra column family input format will go over a an entire
column family sending a slice of a row into a mapper at a time. From
there there's a lot you can do. As far as how you aggregate data
together, I'd suggest experimenting with the latest version of Pig
which thankfully supports the new input format. It gives you a
SQL'esque syntax for manipulating the data and is probably the easiest<= br> way to experiment.

On Thu, Jun 24, 2010 at 11:01 AM, Atul Gosain <atul.gosain@gmail.com> wrote:
> Hi
> =A0=A0What kind of Map Reduce support is provided for Cassandra ?
> Can i get some columns from different rows and then aggregate them up<= br> > together. Its basically aggregation of statistics for various devices<= br> > connected to a network manager. Is it a right kind of use case to be > supported by MR ?
> Thanks
> Atul

--0016e6d2755a6d14aa048a035593--