Return-Path: X-Original-To: apmail-cassandra-user-archive@www.apache.org Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 69D899B42 for ; Tue, 26 Feb 2013 15:32:48 +0000 (UTC) Received: (qmail 49399 invoked by uid 500); 26 Feb 2013 15:32:45 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 49381 invoked by uid 500); 26 Feb 2013 15:32:45 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 49372 invoked by uid 99); 26 Feb 2013 15:32:45 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 26 Feb 2013 15:32:45 +0000 X-ASF-Spam-Status: No, hits=-2.3 required=5.0 tests=RCVD_IN_DNSWL_MED,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: local policy) Received: from [192.174.58.134] (HELO XEDGEA.nrel.gov) (192.174.58.134) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 26 Feb 2013 15:32:39 +0000 Received: from XHUBA.nrel.gov (10.20.4.58) by XEDGEA.nrel.gov (192.174.58.134) with Microsoft SMTP Server (TLS) id 8.3.245.1; Tue, 26 Feb 2013 08:32:07 -0700 Received: from MAILBOX2.nrel.gov ([fe80::19a0:6c19:6421:12f]) by XHUBA.nrel.gov ([::1]) with mapi; Tue, 26 Feb 2013 08:32:17 -0700 From: "Hiller, Dean" To: "user@cassandra.apache.org" Date: Tue, 26 Feb 2013 08:32:17 -0700 Subject: Re: Read Perf Thread-Topic: Read Perf Thread-Index: Ac4UNnUiu5l1Xs5HQuCmA79UaJum7g== Message-ID: In-Reply-To: <57C7C3CBDCB04F45A57AEC4CB21C0CCD1DB48425@mbx024-e1-nj-6.exch024.domain.local> Accept-Language: en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: user-agent: Microsoft-MacOutlook/14.2.5.121010 acceptlanguage: en-US Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable MIME-Version: 1.0 X-Virus-Checked: Checked by ClamAV on apache.org In that case, make sure you don't plan on going into the millions or test the limit as I pretty sure it can't go above 10 million. (from previous posts on this list). Dean On 2/26/13 8:23 AM, "Kanwar Sangha" wrote: >Thanks. For our case, the no of rows will more or less be the same. The >only thing which changes is the columns and they keep getting added. > >-----Original Message----- >From: Hiller, Dean [mailto:Dean.Hiller@nrel.gov] >Sent: 26 February 2013 09:21 >To: user@cassandra.apache.org >Subject: Re: Read Perf > >To find stuff on disk, there is a bloomfilter for each file in memory. >On the docs, 1 billion rows has 2Gig of RAM, so it really will have a >huge dependency on your number of rows. As you get more rows, you may >need to modify the bloomfilter false positive to use less RAM but that >means slower reads. Ie. As you add more rows, you will have slower reads >on a single machine. > >We hit the RAM limit on one machine with 1 billion rows so we are in the >process of tweaking the ratio of 0.000744(the default) to 0.1 to give us >more time to solve. Since we see no I/o load on our machines(or rather >extremely little), we plan on moving to leveled compaction where 0.1 is >the default in new releases and size tiered new default I think is 0.01. > >Ie. If you store more data per row, this is not an issue as much but >still something to consider. (Also, rows have a limit I think as well on >data size but not sure what that is. I know the column limit on a row is >in the millions, somewhere lower than 10 million). > >Later, >Dean > >From: Kanwar Sangha > >Reply-To: "user@cassandra.apache.org" >> >Date: Monday, February 25, 2013 8:31 PM >To: "user@cassandra.apache.org" >> >Subject: Read Perf > >Hi - I am doing a performance run using modified YCSB client and was able >to populate 8TB on a node and then ran some read workloads. I am seeing >an average TPS of 930 ops/sec for random reads. There is no key cache/row >cache. Question - > >Will the read TPS degrade if the data size increases to say 20 TB , 50 >TB, 100 TB ? If I understand correctly, the read should remain constant >irrespective of the data size since we eventually have sorted SStables >and binary search would be done on the index filter to find the row ? > > >Thanks, >Kanwar