Return-Path: X-Original-To: apmail-cassandra-user-archive@www.apache.org Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 533B6C6AE for ; Tue, 11 Jun 2013 14:33:32 +0000 (UTC) Received: (qmail 91535 invoked by uid 500); 11 Jun 2013 14:33:29 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 91298 invoked by uid 500); 11 Jun 2013 14:33:28 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 90981 invoked by uid 99); 11 Jun 2013 14:33:28 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 11 Jun 2013 14:33:28 +0000 X-ASF-Spam-Status: No, hits=2.2 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_NONE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of Arthur.Zubarev@aol.com designates 205.188.109.194 as permitted sender) Received: from [205.188.109.194] (HELO omr-d02.mx.aol.com) (205.188.109.194) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 11 Jun 2013 14:33:23 +0000 Received: from mtaout-mb04.r1000.mx.aol.com (mtaout-mb04.r1000.mx.aol.com [172.29.41.68]) by omr-d02.mx.aol.com (Outbound Mail Relay) with ESMTP id 24AC1700738E4 for ; Tue, 11 Jun 2013 10:33:03 -0400 (EDT) Received: from dellvostro09 (unassigned-82.149.252.66.net.blink.ca [66.252.149.82]) by mtaout-mb04.r1000.mx.aol.com (MUA/Third Party Client Interface) with ESMTPA id B9DA4E00008B; Tue, 11 Jun 2013 10:33:02 -0400 (EDT) Message-ID: <5C369618E0394862855B064DC0D5A7BD@vig.local> Reply-To: "Arthur Zubarev" From: "Arthur Zubarev" To: "Arthur Zubarev" , References: In-Reply-To: Subject: Re: Unable to count records of a column family with 210 columns x 500K rows Date: Tue, 11 Jun 2013 10:33:01 -0400 MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="----=_NextPart_000_0019_01CE668F.0C7F9F50" X-Priority: 3 X-MSMail-Priority: Normal Importance: Normal X-Mailer: Microsoft Windows Live Mail 16.4.3505.912 X-MimeOLE: Produced By Microsoft MimeOLE V16.4.3505.912 x-aol-global-disposition: G DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=mx.aol.com; s=20121107; t=1370961183; bh=Z3h4pTFSo8TpXt/7QEEfHgiZJvvmIGiDFRCjxB/CLyg=; h=From:To:Subject:Message-ID:Date:MIME-Version:Content-Type; b=rS/ynKwkfS4DGfLJWCbsX30fFQ15Yh+M69coKJzwZjw0+JppjGLB5dwvT+Mr3ykCP h9ydEedoZ7BgQsLtMYpr6+1LUwLMU7V2d/nL2kTSFlvNFOevz6fQFZ30goN7Sspf8q HoE0+HVu1d7/bT4a2ap2i7frBsBbZ1cIzm+/SYsg= X-AOL-SCOLL-SCORE: 0:2:474062656:93952408 X-AOL-SCOLL-URL_COUNT: 0 x-aol-sid: 3039ac1d294451b7351e27fa X-AOL-IP: 66.252.149.82 X-Virus-Checked: Checked by ClamAV on apache.org This is a multi-part message in MIME format. ------=_NextPart_000_0019_01CE668F.0C7F9F50 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable I sent this email a little early, the error I get is: Request did not complete within rpc_timeout. If I merely repeat the same query I get: cqlsh:my_dw> select count(*) from MyCF limit 70000; TSocket read 0 bytes cqlsh:my_dw> select count(*) from MyCF limit 70000; Traceback (most recent call last): File "/usr/bin/cqlsh", line 1001, in perform_statement_untraced self.cursor.execute(statement, decoder=3Ddecoder) File = "/usr/share/cassandra/lib/cql-internal-only-1.4.0.zip/cql-1.4.0/cql/curso= r.py", line 80, in execute response =3D self.get_response(prepared_q, cl) File = "/usr/share/cassandra/lib/cql-internal-only-1.4.0.zip/cql-1.4.0/cql/thrif= teries.py", line 77, in get_response return self.handle_cql_execution_errors(doquery, compressed_q, = compress, cl) File = "/usr/share/cassandra/lib/cql-internal-only-1.4.0.zip/cql-1.4.0/cql/thrif= teries.py", line 96, in handle_cql_execution_errors return executor(*args, **kwargs) File = "/usr/share/cassandra/lib/cql-internal-only-1.4.0.zip/cql-1.4.0/cql/cassa= ndra/Cassandra.py", line 1782, in execute_cql3_query self.send_execute_cql3_query(query, compression, consistency) File = "/usr/share/cassandra/lib/cql-internal-only-1.4.0.zip/cql-1.4.0/cql/cassa= ndra/Cassandra.py", line 1793, in send_execute_cql3_query self._oprot.trans.flush() File = "/usr/share/cassandra/lib/thrift-python-internal-only-0.7.0.zip/thrift/tr= ansport/TTransport.py", line 293, in flush self.__trans.write(buf) File = "/usr/share/cassandra/lib/thrift-python-internal-only-0.7.0.zip/thrift/tr= ansport/TSocket.py", line 117, in write plus =3D self.handle.send(buff) error: [Errno 32] Broken pipe I then lose C* and need to restart its service to reconnect. Does that mean I have an underpowered machine? From: Arthur Zubarev=20 Sent: Tuesday, June 11, 2013 10:02 AM To: user@cassandra.apache.org=20 Subject: Unable to count records of a column family with 210 columns x = 500K rows Hello, I am unable to count records using cqlsh (e.g. select count(*) from = MyCF limit 50000;)=20 I have a column family with 210 columns x 500K rows. The row length is = 40K chars. The same issue is with any other large CF. ------=_NextPart_000_0019_01CE668F.0C7F9F50 Content-Type: text/html; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable
I sent this email a little early, the error I get is:
 
Request did not complete within rpc_timeout.
 
If I merely repeat the same query I get:
 
cqlsh:my_dw> select count(*) from MyCF limit 70000;
TSocket read 0 bytes
cqlsh:my_dw> select count(*) from MyCF limit 70000;
Traceback (most recent call last):
  File "/usr/bin/cqlsh", line 1001, in=20 perform_statement_untraced
    self.cursor.execute(statement, = decoder=3Ddecoder)
  File=20 "/usr/share/cassandra/lib/cql-internal-only-1.4.0.zip/cql-1.4.0/cql/curso= r.py",=20 line 80, in execute
    response =3D self.get_response(prepared_q, = cl)
  File=20 "/usr/share/cassandra/lib/cql-internal-only-1.4.0.zip/cql-1.4.0/cql/thrif= teries.py",=20 line 77, in get_response
    return self.handle_cql_execution_errors(doquery, = compressed_q, compress, cl)
  File=20 "/usr/share/cassandra/lib/cql-internal-only-1.4.0.zip/cql-1.4.0/cql/thrif= teries.py",=20 line 96, in handle_cql_execution_errors
    return executor(*args, **kwargs)
  File=20 "/usr/share/cassandra/lib/cql-internal-only-1.4.0.zip/cql-1.4.0/cql/cassa= ndra/Cassandra.py",=20 line 1782, in execute_cql3_query
    self.send_execute_cql3_query(query, compression, = consistency)
  File=20 "/usr/share/cassandra/lib/cql-internal-only-1.4.0.zip/cql-1.4.0/cql/cassa= ndra/Cassandra.py",=20 line 1793, in send_execute_cql3_query
    self._oprot.trans.flush()
  File=20 "/usr/share/cassandra/lib/thrift-python-internal-only-0.7.0.zip/thrift/tr= ansport/TTransport.py",=20 line 293, in flush
    self.__trans.write(buf)
  File=20 "/usr/share/cassandra/lib/thrift-python-internal-only-0.7.0.zip/thrift/tr= ansport/TSocket.py",=20 line 117, in write
    plus =3D self.handle.send(buff)
error: [Errno 32] Broken pipe
 
I then lose C* and need to restart = its service to=20 reconnect.
 
Does that mean I have an underpowered = machine?
 
Sent: Tuesday, June 11, 2013 10:02 AM
Subject: Unable to count records of a column family with 210 = columns=20 x 500K rows
 
Hello,
 
I am unable to count records using cqlsh (e.g.  select = count(*) from=20 MyCF limit 50000;)
I have a column family with 210 columns x 500K rows. The row length = is 40K=20 chars.
The same issue is with any other large CF.
 
 
------=_NextPart_000_0019_01CE668F.0C7F9F50--