Return-Path: X-Original-To: apmail-cassandra-user-archive@www.apache.org Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 5D61E1036E for ; Thu, 18 Jul 2013 21:04:09 +0000 (UTC) Received: (qmail 73335 invoked by uid 500); 18 Jul 2013 21:04:07 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 73279 invoked by uid 500); 18 Jul 2013 21:04:06 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 73271 invoked by uid 500); 18 Jul 2013 21:04:06 -0000 Delivered-To: apmail-incubator-cassandra-user@incubator.apache.org Received: (qmail 73268 invoked by uid 99); 18 Jul 2013 21:04:06 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 18 Jul 2013 21:04:06 +0000 X-ASF-Spam-Status: No, hits=1.3 required=5.0 tests=URI_HEX X-Spam-Check-By: apache.org Received-SPF: error (nike.apache.org: local policy) Received: from [216.139.236.26] (HELO sam.nabble.com) (216.139.236.26) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 18 Jul 2013 21:04:01 +0000 Received: from jim.nabble.com ([192.168.236.80]) by sam.nabble.com with esmtp (Exim 4.72) (envelope-from ) id 1UzvM7-0000lK-CW for cassandra-user@incubator.apache.org; Thu, 18 Jul 2013 14:03:19 -0700 Date: Thu, 18 Jul 2013 14:03:19 -0700 (PDT) From: hajjat To: cassandra-user@incubator.apache.org Message-ID: <1374181399357-7589141.post@n2.nabble.com> Subject: Recommended data size for Reads/Writes in Cassandra MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org Hi, Is there a recommended data size for Reads/Writes in Cassandra? I tried inserting 10 MB objects and the latency I got was pretty high. Also, I was never able to insert larger objects (say 50 MB) since Cassandra kept crashing when I tried that. Here is my experiment setup: I used two Large VMs in EC2 within the same data-center. Inserts have ALL consistency (strong consistency). The latencies were as follows: Data size: 10 MB 1 MB 100 Bytes Latency: 250ms 50ms 8ms I've also done the same for two Large VMs across two data-centers. The latencies were around: Data size: 10 MB 1 MB 100 Bytes Latency: 1200ms 800ms 80ms 1) Ain't the 10 MB latency extremely high? 2) Is there a recommended data size to use with Cassandra (e.g., a few bytes up to 1 MB)? 3) Also, I tried inserting 50 MB data but Cassandra kept crashing. Does anybody know why? I thought the max data size should be up to 2 GB? Thanks, Mohammad PS. Here is my python code I use to insert into Cassandra. I put my stopwatch timers around the insert statement: fh = open(TEST_FILE,'r') data = str(fh.read()) POOL = ConnectionPool(keyspace, server_list=['localhost:9160'], timeout=None) USER = ColumnFamily(POOL, 'User') USER.insert('Ali', {'data': data},write_consistency_level=pycassa.cassandra.ttypes.ConsistencyLevel.ALL) -- View this message in context: http://cassandra-user-incubator-apache-org.3065146.n2.nabble.com/Recommended-data-size-for-Reads-Writes-in-Cassandra-tp7589141.html Sent from the cassandra-user@incubator.apache.org mailing list archive at Nabble.com.