Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Subject: Re: Bottleneck for small inserts?
To: user@cassandra.apache.org
References: <CALEPPW_z-t5swXdhMMiO6cxevL5sMsQqRYkydzx_821sqhAf7w@mail.gmail.com>
 <CACUnPaCThGg8DNUNaa9qYC9HEAmKDU-7xvGzza-vOb7oytPr6g@mail.gmail.com>
From: Cogumelos Maravilha <cogumelosmaravilha@sapo.pt>
Message-ID: <488d76cf-c833-1be5-609c-97901ac54165@sapo.pt>
Date: Tue, 23 May 2017 09:23:36 +0100
User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:52.0) Gecko/20100101
 Thunderbird/52.1.1
MIME-Version: 1.0
In-Reply-To: <CACUnPaCThGg8DNUNaa9qYC9HEAmKDU-7xvGzza-vOb7oytPr6g@mail.gmail.com>
Content-Type: multipart/alternative;
 boundary="------------A258C555A9B258E159F6B3AA"
Content-Language: en-US
archived-at: Tue, 23 May 2017 08:23:50 -0000

--------------A258C555A9B258E159F6B3AA
Content-Type: text/plain; charset=utf-8
Content-Transfer-Encoding: 7bit

Hi,

Change to *|durable_writes = false|*

And please post the results.

Thanks.

On 05/22/2017 10:08 PM, Jonathan Haddad wrote:
> How many CPUs are you using for interrupts?
>  http://www.alexonlinux.com/smp-affinity-and-proper-interrupt-handling-in-linux
>
> Have you tried making a flame graph to see where Cassandra is spending
> its
> time? http://www.brendangregg.com/blog/2014-06-12/java-flame-graphs.html
>
> Are you tracking GC pauses?
>
> Jon
>
> On Mon, May 22, 2017 at 2:03 PM Eric Pederson <ericacm@gmail.com
> <mailto:ericacm@gmail.com>> wrote:
>
>     Hi all:
>
>     I'm new to Cassandra and I'm doing some performance testing.  One
>     of things that I'm testing is ingestion throughput.   My server
>     setup is:
>
>       * 3 node cluster
>       * SSD data (both commit log and sstables are on the same disk)
>       * 64 GB RAM per server
>       * 48 cores per server
>       * Cassandra 3.0.11
>       * 48 Gb heap using G1GC
>       * 1 Gbps NICs
>
>     Since I'm using SSD I've tried tuning the following (one at a
>     time) but none seemed to make a lot of difference:
>
>       * concurrent_writes=384
>       * memtable_flush_writers=8
>       * concurrent_compactors=8
>
>     I am currently doing ingestion tests sending data from 3 clients
>     on the same subnet.  I am using cassandra-stress to do some
>     ingestion testing.  The tests are using CL=ONE and RF=2.
>
>     Using cassandra-stress (3.10) I am able to saturate the disk using
>     a large enough column size and the standard five column
>     cassandra-stress schema.  For example, -col size=fixed(400) will
>     saturate the disk and compactions will start falling behind. 
>
>     One of our main tables has a row size that approximately 200
>     bytes, across 64 columns.  When ingesting this table I don't see
>     any resource saturation.  Disk utilization is around 10-15% per
>     iostat.  Incoming network traffic on the servers is around 100-300
>     Mbps.  CPU utilization is around 20-70%.  nodetool tpstats shows
>     mostly zeros with occasional spikes around 500 in MutationStage.  
>
>     The stress run does 10,000,000 inserts per client, each with a
>     separate range of partition IDs.  The run with 200 byte rows takes
>     about 4 minutes, with mean Latency 4.5ms, Total GC time of 21
>     secs, Avg GC time 173 ms.   
>
>     The overall performance is good - around 120k rows/sec ingested. 
>     But I'm curious to know where the bottleneck is.  There's no
>     resource saturation andnodetool tpstats shows only occasional
>     brief queueing.  Is the rest just expected latency inside of
>     Cassandra?
>
>     Thanks,
>
>     -- Eric
>


--------------A258C555A9B258E159F6B3AA
Content-Type: text/html; charset=utf-8
Content-Transfer-Encoding: 8bit

<html>
  <head>
    <meta http-equiv="Content-Type" content="text/html; charset=utf-8">
  </head>
  <body text="#000000" bgcolor="#FFFFFF">
    <p>Hi,</p>
    <p>Change to
      <style type="text/css">pre.cjk { font-family: "Nimbus Mono L",monospace; }p { margin-bottom: 0.25cm; line-height: 120%; }code.cjk { font-family: "Nimbus Mono L",monospace; }a:link {  }</style><font
        style="font-size: 13pt" size="3"><b><code class="western">durable_writes
            = false</code></b></font></p>
    And please post the results.<br>
    <br>
    Thanks.<br>
    <br>
    <div class="moz-cite-prefix">On 05/22/2017 10:08 PM, Jonathan Haddad
      wrote:<br>
    </div>
    <blockquote type="cite"
cite="mid:CACUnPaCThGg8DNUNaa9qYC9HEAmKDU-7xvGzza-vOb7oytPr6g@mail.gmail.com">
      <div dir="ltr">
        <div>How many CPUs are you using for interrupts?  <a
href="http://www.alexonlinux.com/smp-affinity-and-proper-interrupt-handling-in-linux"
            moz-do-not-send="true">http://www.alexonlinux.com/smp-affinity-and-proper-interrupt-handling-in-linux</a></div>
        <div><br>
        </div>
        <div>Have you tried making a flame graph to see where Cassandra
          is spending its time? <a
href="http://www.brendangregg.com/blog/2014-06-12/java-flame-graphs.html"
            moz-do-not-send="true">http://www.brendangregg.com/blog/2014-06-12/java-flame-graphs.html</a></div>
        <div><br>
        </div>
        <div>Are you tracking GC pauses?</div>
        <div><br>
        </div>
        <div>Jon</div>
      </div>
      <br>
      <div class="gmail_quote">
        <div dir="ltr">On Mon, May 22, 2017 at 2:03 PM Eric Pederson
          &lt;<a href="mailto:ericacm@gmail.com" moz-do-not-send="true">ericacm@gmail.com</a>&gt;
          wrote:<br>
        </div>
        <blockquote class="gmail_quote" style="margin:0 0 0
          .8ex;border-left:1px #ccc solid;padding-left:1ex">
          <div dir="ltr">Hi all:
            <div><br>
            </div>
            <div>I'm new to Cassandra and I'm doing some performance
              testing.  One of things that I'm testing is ingestion
              throughput.   My server setup is:</div>
            <div>
              <ul>
                <li>3 node cluster<br>
                </li>
                <li>SSD data (both commit log and sstables are on the
                  same disk)</li>
                <li>64 GB RAM per server</li>
                <li>48 cores per server</li>
                <li>Cassandra 3.0.11<br>
                </li>
                <li>48 Gb heap using G1GC<br>
                </li>
                <li>1 Gbps NICs</li>
              </ul>
              <div>Since I'm using SSD I've tried tuning the following
                (one at a time) but none seemed to make a lot of
                difference:</div>
            </div>
            <div>
              <ul>
                <li>concurrent_writes=384<br>
                </li>
                <li>memtable_flush_writers=8<br>
                </li>
                <li>concurrent_compactors=8</li>
              </ul>
            </div>
            <div>I am currently doing ingestion tests sending data from
              3 clients on the same subnet.  I am using cassandra-stress
              to do some ingestion testing.  The tests are using CL=ONE
              and RF=2.</div>
            <div><br>
            </div>
            <div>Using cassandra-stress (3.10) I am able to saturate the
              disk using a large enough column size and the standard
              five column cassandra-stress schema.  For example, <font
                face="monospace, monospace">-col size=fixed(400)</font>
              will saturate the disk and compactions will start falling
              behind. </div>
            <div><br>
            </div>
            <div>One of our main tables has a row size that
              approximately 200 bytes, across 64 columns.  When
              ingesting this table I don't see any resource saturation. 
              Disk utilization is around 10-15% per <font
                face="monospace, monospace">iostat</font>.  Incoming
              network traffic on the servers is around 100-300 Mbps. 
              CPU utilization is around 20-70%.  <font face="monospace,
                monospace">nodetool tpstats</font> shows mostly zeros
              with occasional spikes around 500 in <font
                face="monospace, monospace">MutationStage</font>.  </div>
            <div><br>
            </div>
            <div>The stress run does 10,000,000 inserts per client, each
              with a separate range of partition IDs.  The run with 200
              byte rows takes about 4 minutes, with mean Latency 4.5ms,
              Total GC time of 21 secs, Avg GC time 173 ms.   </div>
            <div><br>
            </div>
            <div>The overall performance is good - around 120k rows/sec
              ingested.  But I'm curious to know where the bottleneck
              is.  There's no resource saturation and<font
                face="monospace, monospace"> nodetool tpstats </font>shows
              only occasional brief queueing.  Is the rest just expected
              latency inside of Cassandra?</div>
            <div><br>
            </div>
            <div>Thanks,</div>
            <div>
              <div>
                <div class="m_-3045046115051677960gmail_signature"><br>
                  -- Eric<br>
                </div>
              </div>
            </div>
          </div>
        </blockquote>
      </div>
    </blockquote>
    <br>
  </body>
</html>

--------------A258C555A9B258E159F6B3AA--