Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 556FC2009E8 for ; Mon, 30 May 2016 16:55:50 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 53FE4160A19; Mon, 30 May 2016 14:55:50 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 75596160969 for ; Mon, 30 May 2016 16:55:49 +0200 (CEST) Received: (qmail 15510 invoked by uid 500); 30 May 2016 14:55:48 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 15500 invoked by uid 99); 30 May 2016 14:55:48 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd2-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 30 May 2016 14:55:48 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd2-us-west.apache.org (ASF Mail Server at spamd2-us-west.apache.org) with ESMTP id 94F771A4845 for ; Mon, 30 May 2016 14:55:47 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd2-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 1.198 X-Spam-Level: * X-Spam-Status: No, score=1.198 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=2, RCVD_IN_DNSWL_LOW=-0.7, RCVD_IN_MSPIKE_H2=-0.001, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd2-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd2-us-west.apache.org [10.40.0.9]) (amavisd-new, port 10024) with ESMTP id ezO7AV15nXzc for ; Mon, 30 May 2016 14:55:46 +0000 (UTC) Received: from mail-oi0-f44.google.com (mail-oi0-f44.google.com [209.85.218.44]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTPS id 61F725F24C for ; Mon, 30 May 2016 14:55:46 +0000 (UTC) Received: by mail-oi0-f44.google.com with SMTP id j1so271793290oih.3 for ; Mon, 30 May 2016 07:55:46 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:references:in-reply-to:from:date:message-id:subject:to; bh=MiY0bMkXI/Kbp1Q7BTzRFm7P9SW9FrFir8AMA6r/a2k=; b=h22L8oyaKPSHwPj0kD1P77ak6SmztQfa1435V1pWHyUaSdvu1ORMowMS5xMT0wwup2 6derFjrfCC+94CfnplVNu4rV6mDpj7YgSjPWoFda/FyVA0coNDOtiLLy4m1UpaHfgyZb jdq1HPlhb/HNHLN6XMvZ2aMpLwMZp3+BUL8zPictp5qGMVeVyfVTz+4cT9iCO/jMHDBX L9JESmYZliN77+fhBo+ysNg6GXePKb3CGY24ltLzCHoOTMjWms6ZRXe9Y7NbzjlLYx5j 3BgNlshAczvEKd37XUVoJ7Mi7sV+XXMlN3lXasJSS5fGNLy0ksivIASEdTqeB75xXXu2 +1xw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to; bh=MiY0bMkXI/Kbp1Q7BTzRFm7P9SW9FrFir8AMA6r/a2k=; b=UR4aRk0JRBY+pt6L1o+2wDkCWgRRDZh1QeZGMZzg4GNV6xjmT5ObiM8j+MuMKIyXQs 5sSy7coHQztuk2K/F3D642sZTW44EWWFPlFg4lt0YhVAd7APsftQq7MoCjCpvwtEHebQ UFKnOEwT5ousKyd68bVYZOB5wo9eZYPbZmIrE81SxX3tjfF2WTBBw2h89Z05Y67HaQ+b VjXwL21s5AtqdbOV0q0R4AZjcM/VFkhBmKeCBvLKt9S3XpJ7Iolzkd1GiSASHdaewT0j CEl0VKfqb3y2IfX1/iGzW9+HY3F/HF4gyy7ViavUJSzCjvJsAe6P/2o3eNTrKVPLQd5D p6Ng== X-Gm-Message-State: ALyK8tJVzYy04rn0020j+wqPuQ3U7f3U2Evnl4O3fVs0OZM7Z8RL2PChcM7mFYJjtj6JWwItxInGJYd60XFGOg== X-Received: by 10.202.220.87 with SMTP id t84mr15969002oig.57.1464620145625; Mon, 30 May 2016 07:55:45 -0700 (PDT) MIME-Version: 1.0 References: In-Reply-To: From: Eric Stevens Date: Mon, 30 May 2016 14:55:36 +0000 Message-ID: Subject: Re: Per node limit for Disk Space To: user@cassandra.apache.org Content-Type: multipart/alternative; boundary=001a113d30b282d07005341073f5 archived-at: Mon, 30 May 2016 14:55:50 -0000 --001a113d30b282d07005341073f5 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable Those are rough guidelines, actual effective node size is going to depend on your read/write workload and the compaction strategy you choose. The biggest reason data density per node usually needs to be limited is due to data grooming overhead introduced by compaction. Data at rest essentially becomes I/O debt. If you're using Leveled compaction, the interest rate on that debt is higher. If you're writing aggressively you'll find that you run out of I/O capacity for smaller data at rest. If you use compaction strategies that allow for data to eventually stop compacting (Date Tiered, Time Windowed), you may be able to have higher data density per node assuming that some of your data is going into the no-longer-compacting stages. Beyond that it'll be hard to say what the right size for you is. Target the recommended numbers and if you find that you're not running out of I/O as you approach them you can probably go bigger. Just remember to leave ~50% disk capacity free to leave room for compaction to happen. On Fri, May 27, 2016 at 1:52 PM Anshu Vajpayee wrote: > Hi All, > I have question regarding max disk space limit on a node. > > As per Data stax, We can have 1TB max disk space for rotational disks and > up to 5 TB for SSDs on a node. > > Could you please suggest per your experience what would be limit for spac= e > on a single node with out causing so much stress on a node? > > > > > > *=E2=80=8BThanks,=E2=80=8B* > > --001a113d30b282d07005341073f5 Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable
Those are rough guidelines, actual effective node size is = going to depend on your read/write workload and the compaction strategy you= choose.=C2=A0 The biggest reason data density per node usually needs to be= limited is due to data grooming overhead introduced by compaction.=C2=A0 D= ata at rest essentially becomes I/O debt.=C2=A0 If you're using Leveled= compaction, the interest rate on that debt is higher. =C2=A0

If you're writing aggressively you'll find that you run out o= f I/O capacity for smaller data at rest.=C2=A0 If you use compaction strate= gies that allow for data to eventually stop compacting (Date Tiered, Time W= indowed), you may be able to have higher data density per node assuming tha= t some of your data is going into the no-longer-compacting stages.

Beyond that it'll be hard to say what the right size f= or you is.=C2=A0 Target the recommended numbers and if you find that you= 9;re not running out of I/O as you approach them you can probably go bigger= .=C2=A0 Just remember to leave ~50% disk capacity free to leave room for co= mpaction to happen.

On Fri, May 27, 2016 at 1:52 PM Anshu Vajpayee <anshu.vajpayee@gmail.com> wrote:
Hi All,
I have question regarding = max disk space limit =C2=A0on a node.

As per Data stax, We can have 1T= B max disk space for rotational disks and up to 5 TB for SSDs on a node.
Could you please suggest per your experience what would be limit for spa= ce on a single node with out causing so much stress on a =C2=A0node?=C2=A0<= /div>
=



=
=E2= =80=8BThanks,=E2=80=8B


<= /div>
--001a113d30b282d07005341073f5--