Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: neutral (athena.apache.org: 209.85.214.47 is neither permitted
 nor denied by domain of nick@riptano.com)
MIME-Version: 1.0
In-Reply-To: <AANLkTi=PT8akxtR7grDJepW_0Ppgz2X9FJ4-6AkaiJGO@mail.gmail.com>
References: <4D010F9C.60301@gmail.com>
 <AANLkTi=X2DV701MD2k+v6QdTAGqEyfW5=41NF7Jcwyaw@mail.gmail.com>
 <4D011DDA.30404@code.az>
 <AANLkTim=GFXUnPVKgUMo-FboSYeqmoNhd5Xxw6Uxe=G_@mail.gmail.com>
 <4D016BB3.7010703@code.az>
 <AANLkTi=PT8akxtR7grDJepW_0Ppgz2X9FJ4-6AkaiJGO@mail.gmail.com>
From: Nick Bailey <nick@riptano.com>
Date: Thu, 9 Dec 2010 18:15:37 -0600
Message-ID: <AANLkTim7FGkDrqY830Ur_RtqxDL3SP50xn7eP=3ycJpE@mail.gmail.com>
Subject: Re: Cassandra and disk space
To: user@cassandra.apache.org
Content-Type: multipart/alternative; boundary=001485f7bf2ccab7c204970343bb

--001485f7bf2ccab7c204970343bb
Content-Type: text/plain; charset=ISO-8859-1

Additionally, cleanup will fail to run when the disk is more than 50% full.
Another reason to stay below 50%.

On Thu, Dec 9, 2010 at 6:03 PM, Tyler Hobbs <tyler@riptano.com> wrote:

> Yes, that's correct, but I wouldn't push it too far.  You'll become much
> more sensitive to disk usage changes; in particular, rebalancing your
> cluster will particularly difficult, and repair will also become dangerous.
> Disk performance also tends to drop when a disk nears capacity.
>
> There's no recommended maximum size -- it all depends on your access
> rates.  Anywhere from 10 GB to 1TB is typical.
>
> - Tyler
>
>
> On Thu, Dec 9, 2010 at 5:52 PM, Rustam Aliyev <rustam@code.az> wrote:
>
>>
>> That depends on your scenario.  In the worst case of one big CF, there's
>> not much that can be easily done for the disk usage of compaction and
>> cleanup (which is essentially compaction).
>>
>> If, instead, you have several column families and no single CF makes up
>> the majority of your data, you can push your disk usage a bit higher.
>>
>>
>> Is there any formula to calculate this? Let's say I have 500GB in single
>> CF. So I need at least 500GB of free space for compaction. If I partition
>> this CF and split it into 10 proportional CFs each 50GB, does it mean that I
>> will need only 50GB of free space?
>>
>> Also, is there recommended maximum of data size per node?
>>
>> Thanks.
>>
>>
>> A fundamental idea behind Cassandra's architecture is that disk space is
>> cheap (which, indeed, it is).  If you are particularly sensitive to this,
>> Cassandra might not be the best solution to your problem.  Also keep in mind
>> that Cassandra performs well with average disks, so you don't need to spend
>> a lot there.  Additionally, most people find that the replication protects
>> their data enough to allow them to use RAID 0 instead of 1, 10, 5, or 6.
>>
>> - Tyler
>>
>> On Thu, Dec 9, 2010 at 12:20 PM, Rustam Aliyev <rustam@code.az> wrote:
>>
>>>  Is there any plans to improve this in future?
>>>
>>> For big data clusters this could be very expensive. Based on your
>>> comment, I will need 200TB of storage for 100TB of data to keep Cassandra
>>> running.
>>>
>>> --
>>>  Rustam.
>>>
>>> On 09/12/2010 17:56, Tyler Hobbs wrote:
>>>
>>> If you are on 0.6, repair is particularly dangerous with respect to disk
>>> space usage.  If your replica is sufficiently out of sync, you can triple
>>> your disk usage pretty easily.  This has been improved in 0.7, so repairs
>>> should use about half as much disk space, on average.
>>>
>>> In general, yes, keep your nodes under 50% disk usage at all times.  Any
>>> of: compaction, cleanup, snapshotting, repair, or bootstrapping (the latter
>>> two are improved in 0.7) can double your disk usage temporarily.
>>>
>>> You should plan to add more disk space or add nodes when you get close to
>>> this limit.  Once you go over 50%, it's more difficult to add nodes, at
>>> least in 0.6.
>>>
>>> - Tyler
>>>
>>> On Thu, Dec 9, 2010 at 11:19 AM, Mark <static.void.dev@gmail.com> wrote:
>>>
>>>> I recently ran into a problem during a repair operation where my nodes
>>>> completely ran out of space and my whole cluster was... well, clusterfucked.
>>>>
>>>> I want to make sure how to prevent this problem in the future.
>>>>
>>>> Should I make sure that at all times every node is under 50% of its disk
>>>> space? Are there any normal day-to-day operations that would cause the any
>>>> one node to double in size that I should be aware of? If on or more nodes to
>>>> surpass the 50% mark, what should I plan to do?
>>>>
>>>> Thanks for any advice
>>>>
>>>
>>>
>>
>

--001485f7bf2ccab7c204970343bb
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

Additionally, cleanup will fail to run when the disk is more than 50% full.=
 Another reason to stay below 50%.<br><br><div class=3D"gmail_quote">On Thu=
, Dec 9, 2010 at 6:03 PM, Tyler Hobbs <span dir=3D"ltr">&lt;<a href=3D"mail=
to:tyler@riptano.com">tyler@riptano.com</a>&gt;</span> wrote:<br>

<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex;">Yes, that&#39;s correct, but I wouldn&#39;t=
 push it too far.=A0 You&#39;ll become much more sensitive to disk usage ch=
anges; in particular, rebalancing your cluster will particularly difficult,=
 and repair will also become dangerous.=A0 Disk performance also tends to d=
rop when a disk nears capacity.<br>


<br>There&#39;s no recommended maximum size -- it all depends on your acces=
s rates.=A0 Anywhere from 10 GB to 1TB is typical.<br><font color=3D"#88888=
8"><br>- Tyler</font><div><div></div><div class=3D"h5"><br><br><div class=
=3D"gmail_quote">

On Thu, Dec 9, 2010 at 5:52 PM, Rustam Aliyev <span dir=3D"ltr">&lt;<a href=
=3D"mailto:rustam@code.az" target=3D"_blank">rustam@code.az</a>&gt;</span> =
wrote:<br>
<blockquote class=3D"gmail_quote" style=3D"margin:0pt 0pt 0pt 0.8ex;border-=
left:1px solid rgb(204, 204, 204);padding-left:1ex">

 =20
   =20
 =20
  <div bgcolor=3D"#ffffff" text=3D"#000000"><div>
    <br>
    <blockquote type=3D"cite">That depends on your scenario.=A0 In the wors=
t case of
      one big CF, there&#39;s not much that can be easily done for the disk
      usage of compaction and cleanup (which is essentially compaction).<br=
>
      <br>
      If, instead, you have several column families and no single CF
      makes up the majority of your data, you can push your disk usage a
      bit higher.<br>
      <br>
    </blockquote>
    <br></div>
    Is there any formula to calculate this? Let&#39;s say I have 500GB in
    single CF. So I need at least 500GB of free space for compaction. If
    I partition this CF and split it into 10 proportional CFs each 50GB,
    does it mean that I will need only 50GB of free space?<br>
    <br>
    Also, is there recommended maximum of data size per node?<br>
    <br>
    Thanks.<div><br>
    <br>
    <blockquote type=3D"cite">A fundamental idea behind Cassandra&#39;s arc=
hitecture is
      that disk space is cheap (which, indeed, it is).=A0 If you are
      particularly sensitive to this, Cassandra might not be the best
      solution to your problem.=A0 Also keep in mind that Cassandra
      performs well with average disks, so you don&#39;t need to spend a lo=
t
      there.=A0 Additionally, most people find that the replication
      protects their data enough to allow them to use RAID 0 instead of
      1, 10, 5, or 6.<br>
      <br>
      - Tyler<br>
      <br>
      <div class=3D"gmail_quote">On Thu, Dec 9, 2010 at 12:20 PM, Rustam
        Aliyev <span dir=3D"ltr">&lt;<a href=3D"mailto:rustam@code.az" targ=
et=3D"_blank">rustam@code.az</a>&gt;</span>
        wrote:<br>
        <blockquote class=3D"gmail_quote" style=3D"margin:0pt 0pt 0pt 0.8ex=
;border-left:1px solid rgb(204, 204, 204);padding-left:1ex">
          <div bgcolor=3D"#ffffff" text=3D"#000000"> <font size=3D"-1"><fon=
t face=3D"Helvetica, Arial, sans-serif">Is there any plans
                to improve this in future?<br>
                <br>
                For big data clusters this could be very expensive.
                Based on your comment, I will need 200TB of storage for
                100TB of data to keep Cassandra running.<br>
                <br>
                --<br>
                <font color=3D"#888888"> Rustam.<br>
                </font></font></font>
            <div>
              <div><br>
                On 09/12/2010 17:56, Tyler Hobbs wrote:
                <blockquote type=3D"cite">If you are on 0.6, repair is
                  particularly dangerous with respect to disk space
                  usage.=A0 If your replica is sufficiently out of sync,
                  you can triple your disk usage pretty easily.=A0 This
                  has been improved in 0.7, so repairs should use about
                  half as much disk space, on average.<br>
                  <br>
                  In general, yes, keep your nodes under 50% disk usage
                  at all times.=A0 Any of: compaction, cleanup,
                  snapshotting, repair, or bootstrapping (the latter two
                  are improved in 0.7) can double your disk usage
                  temporarily.<br>
                  <br>
                  You should plan to add more disk space or add nodes
                  when you get close to this limit.=A0 Once you go over
                  50%, it&#39;s more difficult to add nodes, at least in
                  0.6.<br>
                  <br>
                  - Tyler<br>
                  <br>
                  <div class=3D"gmail_quote">On Thu, Dec 9, 2010 at 11:19
                    AM, Mark <span dir=3D"ltr">&lt;<a href=3D"mailto:static=
.void.dev@gmail.com" target=3D"_blank">static.void.dev@gmail.com</a>&gt;</s=
pan>
                    wrote:<br>
                    <blockquote class=3D"gmail_quote" style=3D"margin:0pt 0=
pt 0pt 0.8ex;border-left:1px solid rgb(204, 204, 204);padding-left:1ex">I r=
ecently ran into
                      a problem during a repair operation where my nodes
                      completely ran out of space and my whole cluster
                      was... well, clusterfucked.<br>
                      <br>
                      I want to make sure how to prevent this problem in
                      the future.<br>
                      <br>
                      Should I make sure that at all times every node is
                      under 50% of its disk space? Are there any normal
                      day-to-day operations that would cause the any one
                      node to double in size that I should be aware of?
                      If on or more nodes to surpass the 50% mark, what
                      should I plan to do?<br>
                      <br>
                      Thanks for any advice<br>
                    </blockquote>
                  </div>
                  <br>
                </blockquote>
              </div>
            </div>
          </div>
        </blockquote>
      </div>
      <br>
    </blockquote>
  </div></div>

</blockquote></div><br>
</div></div></blockquote></div><br>

--001485f7bf2ccab7c204970343bb--