Return-Path: X-Original-To: apmail-cassandra-user-archive@www.apache.org Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id F04C4D84A for ; Tue, 9 Oct 2012 01:36:15 +0000 (UTC) Received: (qmail 41224 invoked by uid 500); 9 Oct 2012 01:36:13 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 41205 invoked by uid 500); 9 Oct 2012 01:36:13 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 41197 invoked by uid 99); 9 Oct 2012 01:36:13 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 09 Oct 2012 01:36:13 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of mishra.vivs@gmail.com designates 209.85.220.44 as permitted sender) Received: from [209.85.220.44] (HELO mail-pa0-f44.google.com) (209.85.220.44) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 09 Oct 2012 01:36:07 +0000 Received: by mail-pa0-f44.google.com with SMTP id fb11so4572406pad.31 for ; Mon, 08 Oct 2012 18:35:45 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=yC9574bZ8KI/xDxeflxsFD9C1EOUojpE2DSHRxujoYw=; b=O0V/8x2lx9wIYzZIEC50yBC2aIeEcGRBJ7VD/ox8LIEOoea5UFC1+xgI5Ie3jOscsp yrMRVZVZx4G6fORBUaF5B9PWiKYYvYSXoGTvGTnA8KE3/TM5ZUBw9Akymg0Zrh5dFrK8 N6glF/n2tZveMCVSjJTe87K+wSRa1Y52N3EUFxpBL1dTFvd5qaKBd0kBtLM4mljcWTD0 N+ziOCyqy/O/zbhC3GblZiJrxIgWyoKaKh/T0VnaINhPQ0Ap2sqTqu0uVuM43nEnCV5R gzStKi2jeJ5mN+yAtXqPsDARSgWDo4Sp5gfJN7qjQN8N31n1/Ux4Kn7PtfvEgSomMoZI mFqQ== MIME-Version: 1.0 Received: by 10.68.225.199 with SMTP id rm7mr59058825pbc.150.1349746545499; Mon, 08 Oct 2012 18:35:45 -0700 (PDT) Received: by 10.66.10.71 with HTTP; Mon, 8 Oct 2012 18:35:45 -0700 (PDT) In-Reply-To: <4C5F6A03-6FFE-48EE-BC9C-DC322859580C@thelastpickle.com> References: <4C5F6A03-6FFE-48EE-BC9C-DC322859580C@thelastpickle.com> Date: Tue, 9 Oct 2012 07:05:45 +0530 Message-ID: Subject: Re: Query over secondary indexes From: Vivek Mishra To: user@cassandra.apache.org Content-Type: multipart/alternative; boundary=047d7b10cc6b619ec804cb965b5f --047d7b10cc6b619ec804cb965b5f Content-Type: text/plain; charset=windows-1252 Content-Transfer-Encoding: quoted-printable It was on 1 node and there is no error in server logs. -Vivek On Tue, Oct 9, 2012 at 1:21 AM, aaron morton wrote= : > get User where user_name =3D 'Vivek', it is taking ages to retrieve that >> data. Is there anything i am doing wrong? >> > How long is ages and how many nodes do you have? > Are there any errors in server logs ? > > When you do a get by secondary index at a CL higher than ONE ever RFth > node is involved. > > Cheers > > > ----------------- > Aaron Morton > Freelance Developer > @aaronmorton > http://www.thelastpickle.com > > On 5/10/2012, at 10:20 PM, Vivek Mishra wrote: > > Thanks Rishabh. But i want to search over duplicate columns only. > > -Vivek > > On Fri, Oct 5, 2012 at 2:45 PM, Rishabh Agrawal < > rishabh.agrawal@impetus.co.in> wrote: > >> Try making *user_name* a primary key in combination with some other >> unique column and see if results are improving. >> >> -Rishabh >> >> *From:* Vivek Mishra [mailto:mishra.vivs@gmail.com] >> *Sent:* Friday, October 05, 2012 2:35 PM >> *To:* user@cassandra.apache.org >> *Subject:* Query over secondary indexes >> >> >> I have a column family "User" which is having a indexed column >> "user_name". My schema is having around 0.1 million records only and >> user_name is duplicated across all rows. >> >> Now when i am trying to retrieve it as: >> >> get User where user_name =3D 'Vivek', it is taking ages to retrieve that >> data. Is there anything i am doing wrong? >> >> Also, i tried get_indexed_slices via Thrift API by setting >> IndexClause.setCount(1), still no luck, it got hang and not even return= ing >> a single result. I believe 0.1 million is not a huge amount of data. >> >> >> Cassandra version : 1.1.2 >> >> Any idea? >> >> >> -Vivek >> >> ------------------------------ >> >> Impetus Ranked in the Top 50 India=92s Best Companies to Work For 2012. >> >> Impetus webcast =91Designing a Test Automation Framework for Multi-vendo= r >> Interoperable Systems=92 available at http://lf1.me/0E/. >> >> >> NOTE: This message may contain information that is confidential, >> proprietary, privileged or otherwise protected by law. The message is >> intended solely for the named addressee. If received in error, please >> destroy and notify the sender. Any use of this email is prohibited when >> received in error. Impetus does not represent, warrant and/or guarantee, >> that the integrity of this communication has been maintained nor that th= e >> communication is free of errors, virus, interception or interference. >> > > > --047d7b10cc6b619ec804cb965b5f Content-Type: text/html; charset=windows-1252 Content-Transfer-Encoding: quoted-printable It was on 1 node and there is no error in server logs.

-= Vivek

On Tue, Oct 9, 2012 at 1:21 AM, aar= on morton <aaron@thelastpickle.com> wrote:

get User where user_name =3D 'Vivek', it is taking ages to retr= ieve that data. Is there anything i am doing wrong?

How long is ages and how many nodes do = you have?
Are there any errors in server logs ?

When you do a get by secondary index at a CL higher than ONE ever R= Fth node is involved.=A0

Cheers


<= div style=3D"word-wrap:break-word">
-----------------
Aaron Morton
Freelance Deve= loper
@aaronmorton

On 5/10/2012, at 10:20 PM, Vivek Mishra <mishra.vivs@gmail.com> wro= te:

Thanks Rishabh. But i want to search= over duplicate columns only.

-Vivek

On Fri, Oct 5, 2012 at 2:45 PM= , Rishabh Agrawal <rishabh.agrawal@impetus.co.in> wrote:

Try making user_name a primary key in combination with some other unique column= and see if results are improving.

-Rishabh

From: Vivek Mishra= [mailto:mishra.= vivs@gmail.com]
Sent: Friday, October 05, 2012 2:35 PM
To: u= ser@cassandra.apache.org
Subject: Query over secondary indexes

=A0

I have a column family "User" whic= h is having a indexed column "user_name". My schema is having aro= und 0.1 million records only and user_name is duplicated=A0 across all rows= .

Now when i am trying to retrieve it as:

get User where user_name =3D 'Vivek', it is taking ages to retrieve= that data. Is there anything i am doing wrong?

Also, i tried get_indexed_slices via Thrift API by setting=A0 IndexClause.s= etCount(1), still=A0 no luck, it got hang and not even returning a single r= esult. I believe 0.1 million is not a huge amount of data.


Cassandra version : 1.1.2

Any idea?


-Vivek




Impetus Ranked in the Top 50 India=92s Best Companies to Work For 2012.
Impetus webcast =91Designing a Test Automation Framework for Multi-vendor I= nteroperable Systems=92 available at http://lf1.me/0E/.


NOTE: This message may contain information that is confidential, proprietar= y, privileged or otherwise protected by law. The message is intended solely= for the named addressee. If received in error, please destroy and notify t= he sender. Any use of this email is prohibited when received in error. Impetus does not represent, warrant = and/or guarantee, that the integrity of this communication has been maintai= ned nor that the communication is free of errors, virus, interception or in= terference.



--047d7b10cc6b619ec804cb965b5f--