Return-Path: X-Original-To: apmail-cassandra-user-archive@www.apache.org Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 7AE55189C8 for ; Tue, 16 Jun 2015 09:38:58 +0000 (UTC) Received: (qmail 47127 invoked by uid 500); 16 Jun 2015 09:38:55 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 47089 invoked by uid 500); 16 Jun 2015 09:38:55 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 47079 invoked by uid 99); 16 Jun 2015 09:38:54 -0000 Received: from Unknown (HELO spamd3-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 16 Jun 2015 09:38:54 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd3-us-west.apache.org (ASF Mail Server at spamd3-us-west.apache.org) with ESMTP id 3A5CC182829 for ; Tue, 16 Jun 2015 09:38:54 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd3-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 2.879 X-Spam-Level: ** X-Spam-Status: No, score=2.879 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=3, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd3-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-us-west.apache.org ([10.40.0.8]) by localhost (spamd3-us-west.apache.org [10.40.0.10]) (amavisd-new, port 10024) with ESMTP id uPhX_QwG6Am2 for ; Tue, 16 Jun 2015 09:38:53 +0000 (UTC) Received: from mail-qk0-f169.google.com (mail-qk0-f169.google.com [209.85.220.169]) by mx1-us-west.apache.org (ASF Mail Server at mx1-us-west.apache.org) with ESMTPS id E168F22F04 for ; Tue, 16 Jun 2015 09:38:52 +0000 (UTC) Received: by qkhu186 with SMTP id u186so5815240qkh.0 for ; Tue, 16 Jun 2015 02:38:46 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:date:message-id:subject:from:to:content-type; bh=vH/p8xK3j0V+Q9Xk+cDQ8TpqZJWrWWlq0fUYa4TJRXo=; b=eVNQLnAMIi24XzqcaCIqLLr6oedcaN8TDWjAE0zzGgFnHAgV1aGxzvVDmzlwFt0a2e nMS+6Q6DX1NLyGLGsowxdAp9bO9rchQrrh6Ii/ZeBK3vB2T5j2xPqnp3YM2Mq4AyKqRZ 2IOHr7a9gQIREWiP/Ws5Xq6UymZoFTIHwlLtT+90QBRW/tASGGa4Rg6QBsrxXSGUdc6s KjDz148X0kHmG2GxY/0GGYg6QwG3TNy0bfDM8S3X8RtseDVIy+fi+1EIfyoH9sS1/Q8L kCkMBfSXowtntGJOhVosf/Nryi271xG5h/Ak/zlKeRbCCXEx8LaDQh96TglH7znR3IU9 lOqw== MIME-Version: 1.0 X-Received: by 10.55.21.219 with SMTP id 88mr47132547qkv.96.1434447525961; Tue, 16 Jun 2015 02:38:45 -0700 (PDT) Received: by 10.96.88.130 with HTTP; Tue, 16 Jun 2015 02:38:45 -0700 (PDT) Date: Tue, 16 Jun 2015 15:08:45 +0530 Message-ID: Subject: Tuning Cassandra 2.1 for High Writes and Immediate Reads From: Pracheer Agarwal To: user@cassandra.apache.org Content-Type: multipart/alternative; boundary=001a11473b123be5eb05189f57ca --001a11473b123be5eb05189f57ca Content-Type: text/plain; charset=UTF-8 Hi, We are evaluating Cassandra 2.1 for our new production system. The following are the requirements: 1. 15K writes/sec with 5 KB blob in a single column of a column family, 2. This is followed by immediate Reads by multiple consumer threads, the read requires us to return entire Row and not only the recently updated column. 3. Around 1B unique keys. So I am assuming for the reads the data can be fetched from both Memtable (if it is not flushed) and Key-cache. (Row-cache is disabled) How can we optimize for higher Read throughput at the cost of Writes? Machine configuration, 10 Node cluster. - 24 core/machine - 64 GB RAM - 2TB*5 HDD per machine - 10G NIC Till now, for optimization we have done the following: 1. We have provided 4G keycache. 2. Created parition-key and clustering key in such a way that, for every new event a new cell is created, we never update a record. 3. No light-weight transactions 4. ReplicationFactor 3 5. Write quorum 2, Read quorum 2 How can we further optimize for the write/read patterns explained above? Thanks, Pracheer --001a11473b123be5eb05189f57ca Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable
Hi,

We are evaluating Cassandra 2.1 for= our new production system. The following are the requirements:
<= br>
1. 15K writes/sec with 5 KB blob in a single column of a colu= mn family,=C2=A0
2. This is followed by immediate Reads by multip= le consumer threads, the read requires us to return entire Row and not only= the recently updated column.
3. Around 1B unique keys.

So I am assuming for the reads the data can be fetched from= both Memtable (if it is not flushed) and Key-cache. (Row-cache is disabled= )=C2=A0

How can we optimize for higher Read throug= hput at the cost of Writes?

Machine configuration,= 10 Node cluster.
  • 24 core/machine
  • 64 GB RAM
  • <= li>2TB*5 HDD per machine
  • 10G NIC=C2=A0

Till now, for optimization we have done the following:
1. = We have provided 4G keycache.
2. Created parition-key and cluster= ing key in such a way that, for every new event a new cell is created, we n= ever update a record.
3. No light-weight transactions
4= . ReplicationFactor 3
5. Write quorum 2, Read quorum 2
=
How can we further optimize for the write/read patterns expl= ained above?

Thanks,
Pracheer
--001a11473b123be5eb05189f57ca--