Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (nike.apache.org: local policy)
From: aaron morton <aaron@thelastpickle.com>
Content-Type: multipart/alternative;
 boundary="Apple-Mail=_C74F1ED7-A8A7-45BF-918A-795CF34F5E0A"
Message-Id: <EE9251A5-62F0-41DC-A305-E94EE09E7B84@thelastpickle.com>
Mime-Version: 1.0 (Mac OS X Mail 6.2 \(1499\))
Subject: Re: Read operations resulting in a write?
Date: Mon, 17 Dec 2012 14:41:13 +1300
References: <1355529494.21459.YahooMailNeo@web164602.mail.gq1.yahoo.com>
To: Cassandra User <user@cassandra.apache.org>
In-Reply-To: <1355529494.21459.YahooMailNeo@web164602.mail.gq1.yahoo.com>


--Apple-Mail=_C74F1ED7-A8A7-45BF-918A-795CF34F5E0A
Content-Transfer-Encoding: quoted-printable
Content-Type: text/plain;
	charset=iso-8859-1


> 1) Am I reading things correctly?
Yes.=20
If you do a read/slice by name and more than min compaction level nodes =
where read the data is re-written so that the next read uses fewer =
SSTables.

> 2) What is really happening here?  Essentially minor compactions can =
occur between 4 and 32 memtable flushes.  Looking through the code, this =
seems to only effect a couple types of select statements (when selecting =
a specific column on a specific key being one of them). During the time =
between these two values, every "select" statement will perform a write.
Yup, only for readying a row where the column names are specified.
Remember minor compaction when using SizedTiered Compaction (the =
default) works on buckets of the same size.=20

Imagine a row that had been around for a while and had fragments in more =
than Min Compaction Threshold sstables. Say it is 3 SSTables in the 2nd =
tier and 2 sstables in the 1st. So it takes (potentially) 5 SSTable =
reads. If this row is read it will get hoisted back up.=20

But the row has is in only 1 SSTable in the 2nd tier and 2 in the 1st =
tier it will not hoisted.=20

There are a few short circuits in the SliceByName read path. One of them =
is to end the search when we know that no other SSTables contain columns =
that should be considered. So if the 4 columns you read frequently are =
hoisted into the 1st bucket your reads will get handled by that one =
bucket.=20

It's not every select. Just those that touched more the min compaction =
sstables.=20


> 3) Is this desired behavior?  Is there something else I should be =
looking at that could be causing this behavior?
Yes.
https://issues.apache.org/jira/browse/CASSANDRA-2503

Cheers

  =20
-----------------
Aaron Morton
Freelance Cassandra Developer
New Zealand

@aaronmorton
http://www.thelastpickle.com

On 15/12/2012, at 12:58 PM, Michael Theroux <mtheroux2@yahoo.com> wrote:

> Hello,
>=20
> We have an unusual situation that I believe I've reproduced, at least =
temporarily, in a test environment.  I also think I see where this issue =
is occurring in the code.
>=20
> We have a specific column family that is under heavy read and write =
load on a nightly basis.   For the purposes of this description, I'll =
refer to this column family as "Bob".  During this nightly processing, =
sometimes Bob is under very write load, other times it is very heavy =
read load.
>=20
> The application is such that when something is written to Bob, a write =
is made to one of two other tables.  We've witnessed a situation where =
the write count on Bob far outstrips the write count on either of the =
other tables, by a factor of 3->10.  This is based on the WriteCount =
available on the column family JMX MBean.  We have not been able to find =
where in our code this is happening, and we have gone as far as tracing =
our CQL calls to determine that the relationship between Bob and the =
other tables are what we expect.
>=20
> I brought up a test node to experiment, and see a situation where, =
when a "select" statement is executed, a write will occur.
>=20
> In my test, I perform the following (switching between nodetool and =
cqlsh):
>=20
> update bob set 'about'=3D'coworker' where key=3D'<hex key>';   =20
> nodetool flush
> update bob set 'about'=3D'coworker' where key=3D'<hex key>';   =20
> nodetool flush
> update bob set 'about'=3D'coworker' where key=3D'<hex key>';   =20
> nodetool flush
> update bob set 'about'=3D'coworker' where key=3D'<hex key>';   =20
> nodetool flush
> update bob set 'about'=3D'coworker' where key=3D'<hex key>';   =20
> nodetool flush
>=20
> Then, for a period of time (before a minor compaction occurs), a =
select statement that selects specific columns will cause writes to =
occur in the write count of the column family:
>=20
> select about,changed,data from bob where key=3D'<hex key>';
>=20
> This situation will continue until a minor compaction is completed.
>=20
> I went into the code and added some traces to =
CollationController.java:
>=20
>   private ColumnFamily collectTimeOrderedData()
>     {
>         logger.debug("collectTimeOrderedData");
>=20
>       ... <snip> ...
>=20
> ---> HERE   logger.debug( "tables iterated: " + sstablesIterated +  " =
Min compact: " + cfs.getMinimumCompactionThreshold() );
>             // "hoist up" the requested data into a more recent =
sstable
>             if (sstablesIterated > cfs.getMinimumCompactionThreshold()
>                 && !cfs.isCompactionDisabled()
>                 && cfs.getCompactionStrategy() instanceof =
SizeTieredCompactionStrategy)
>             {
>                 RowMutation rm =3D new RowMutation(cfs.table.name, new =
Row(filter.key, returnCF.cloneMe()));
>                 try
>                 {
> ---> HERE               logger.debug( "Apply hoisted up row mutation" =
);=09
>                     // skipping commitlog and index updates is fine =
since we're just de-fragmenting existing data
>                     Table.open(rm.getTable()).apply(rm, false, false);
>                 }
>                 catch (IOException e)
>                 {
>                     // log and allow the result to be returned
>                     logger.error("Error re-writing read results", e);
>                 }
>             }=20
> ... <snip> ...
>=20
> Performing the steps above, I see the following traces (in the test =
environment I decreased the minimum compaction threshold to make this =
easier to reproduce).  After I do a couple of update/flush, I see this =
in the log:
>=20
> DEBUG [FlushWriter:7] 2012-12-14 22:54:40,106 CompactionManager.java =
(line 117) Scheduling a background task check for bob with =
SizeTieredCompactionStrategy
>=20
> Then, until compaction occurs, I see (when performing a select):
>=20
> DEBUG [ScheduledTasks:1] 2012-12-14 22:55:15,998 LoadBroadcaster.java =
(line 86) Disseminating load info ...
> DEBUG [Thrift:12] 2012-12-14 22:55:16,990 CassandraServer.java (line =
1227) execute_cql_query
> DEBUG [Thrift:12] 2012-12-14 22:55:16,991 QueryProcessor.java (line =
445) CQL statement type: SELECT
> DEBUG [Thrift:12] 2012-12-14 22:55:16,991 StorageProxy.java (line 653) =
Command/ConsistencyLevel is SliceByNamesReadCommand(table=3D'open', =
key=3D804229d1933669d0a25d2a38c8b26ded10069573003e6dbb1ce21b5f402a5342, =
columnParent=3D'QueryPath(columnFamilyName=3D'bob', =
superColumnName=3D'null', columnName=3D'null')', =
columns=3D[about,changed,data,])/ONE
> DEBUG [Thrift:12] 2012-12-14 22:55:16,992 ReadCallback.java (line 79) =
Blockfor is 1; setting up requests to /10.0.4.20
> DEBUG [Thrift:12] 2012-12-14 22:55:16,992 StorageProxy.java (line 669) =
reading data locally
> DEBUG [ReadStage:61] 2012-12-14 22:55:16,992 StorageProxy.java (line =
813) LocalReadRunnable reading SliceByNamesReadCommand(table=3D'open', =
key=3D804229d1933669d0a25d2a38c8b26ded10069573003e6dbb1ce21b5f402a5342, =
columnParent=3D'QueryPath(columnFamilyName=3D'bob', =
superColumnName=3D'null', columnName=3D'null')', =
columns=3D[about,changed,data,])
> DEBUG [ReadStage:61] 2012-12-14 22:55:16,992 CollationController.java =
(line 68) In get top level columns: class =
org.apache.cassandra.db.filter.NamesQueryFilter type: Standard valid: =
class org.apache.cassandra.db.marshal.BytesType
> DEBUG [ReadStage:61] 2012-12-14 22:55:16,992 CollationController.java =
(line 84) collectTimeOrderedData
> ---> DEBUG [ReadStage:61] 2012-12-14 22:55:17,192 =
CollationController.java (line 188) tables iterated: 4 Min compact: 2
> ----> DEBUG [ReadStage:61] 2012-12-14 22:55:17,192 =
CollationController.java (line 198) Apply hoisted up row mutation
> DEBUG [ReadStage:61] 2012-12-14 22:55:17,193 Table.java (line 395) =
applying mutation of row =
804229d1933669d0a25d2a38c8b26ded10069573003e6dbb1ce21b5f402a5342
>=20
> The above traces will occur every time I repeat the above select =
statement.
>=20
> Minor compaction doesn't start until a few minutes after the request =
was submitted above (note, this is an unloaded test node):
>=20
> DEBUG [CompactionExecutor:11] 2012-12-14 22:57:03,278 =
IntervalNode.java (line 45) Creating IntervalNode from =
[Interval(DecoratedKey(Token(bytes[804229d1933669d0a25d2a38c8b26ded1006957=
3003e6dbb1ce...
>=20
> Once minor compaction occurs, the behavior around write count being =
incremented stops, until more than the minimum compaction threshold =
memtables are flush to disk.
>=20
> So, my questions are:
>=20
> 1) Am I reading things correctly?
>=20
> 2) What is really happening here?  Essentially minor compactions can =
occur between 4 and 32 memtable flushes.  Looking through the code, this =
seems to only effect a couple types of select statements (when selecting =
a specific column on a specific key being one of them). During the time =
between these two values, every "select" statement will perform a write.
>=20
> 3) Is this desired behavior?  Is there something else I should be =
looking at that could be causing this behavior?
>=20
> We are running Cassandra 1.1.2, with SizeTieredCompactionStrategy. =20
> Any help is appreciated,
> Thanks,
> -Mike
>=20
>=20
> =20


--Apple-Mail=_C74F1ED7-A8A7-45BF-918A-795CF34F5E0A
Content-Transfer-Encoding: quoted-printable
Content-Type: text/html;
	charset=iso-8859-1

<html><head><meta http-equiv=3D"Content-Type" content=3D"text/html =
charset=3Diso-8859-1"></head><body style=3D"word-wrap: break-word; =
-webkit-nbsp-mode: space; -webkit-line-break: after-white-space; =
"><div><br></div><div><blockquote type=3D"cite"><div =
style=3D"background-color: rgb(255, 255, 255); font-family: 'times new =
roman', 'new york', times, serif; font-size: 12pt; position: static; =
z-index: auto; ">1) Am I reading things =
correctly?</div></blockquote>Yes.&nbsp;</div><div>If you do a read/slice =
by name and more than min compaction level nodes where read the data is =
re-written so that the next read uses fewer =
SSTables.</div><div><br></div><div></div><blockquote =
type=3D"cite"><div><span style=3D"background-color: rgb(255, 255, 255); =
font-family: 'times new roman', 'new york', times, serif; font-size: =
12pt; ">2) What is really happening here? &nbsp;Essentially minor =
compactions can occur between 4 and 32 memtable flushes. &nbsp;Looking =
through the code, this seems to only effect a couple types of select =
statements (when selecting a specific column on a specific key being one =
of them). During the time between these two values, every "select" =
statement will perform a write.</span></div></blockquote><div>Yup, only =
for readying a row where the column names are =
specified.</div><div>Remember minor compaction when using SizedTiered =
Compaction (the default) works on buckets of the same =
size.&nbsp;</div><div><br></div><div>Imagine a row that had been around =
for a while and had fragments in more than Min Compaction Threshold =
sstables. Say it is 3 SSTables in the 2nd tier and 2 sstables in the =
1st. So it takes (potentially) 5 SSTable reads. If this row is read it =
will get hoisted back up.&nbsp;</div><div><br></div><div>But the row has =
is in only 1 SSTable in the 2nd tier and 2 in the 1st tier it will not =
hoisted.&nbsp;</div><div><br></div><div>There are a few short circuits =
in the SliceByName read path. One of them is to end the search when we =
know that no other SSTables contain columns that should be considered. =
So if the 4 columns you read frequently are hoisted into the 1st bucket =
your reads will get handled by that one =
bucket.&nbsp;</div><div><br></div><div>It's not every select. Just those =
that touched more the min compaction =
sstables.&nbsp;</div><div><br></div><div><br></div><blockquote =
type=3D"cite"><div style=3D"background-color: rgb(255, 255, 255); =
font-family: 'times new roman', 'new york', times, serif; font-size: =
12pt; position: static; z-index: auto; ">3) Is this desired behavior? =
&nbsp;Is there something else I should be looking at that could be =
causing this behavior?</div></blockquote><div>Yes.</div><div><a =
href=3D"https://issues.apache.org/jira/browse/CASSANDRA-2503">https://issu=
es.apache.org/jira/browse/CASSANDRA-2503</a></div><div><br></div><div>Chee=
rs</div><div><br></div><div>&nbsp; &nbsp;</div><div></div><div =
apple-content-edited=3D"true">
<div style=3D"color: rgb(0, 0, 0); font-family: Helvetica; font-size: =
medium; font-style: normal; font-variant: normal; font-weight: normal; =
letter-spacing: normal; line-height: normal; orphans: 2; text-align: =
-webkit-auto; text-indent: 0px; text-transform: none; white-space: =
normal; widows: 2; word-spacing: 0px; -webkit-text-size-adjust: auto; =
-webkit-text-stroke-width: 0px; word-wrap: break-word; =
-webkit-nbsp-mode: space; -webkit-line-break: after-white-space; "><span =
class=3D"Apple-style-span" style=3D"border-collapse: separate; color: =
rgb(0, 0, 0); font-family: Helvetica; font-style: normal; font-variant: =
normal; font-weight: normal; letter-spacing: normal; line-height: =
normal; orphans: 2; text-align: -webkit-auto; text-indent: 0px; =
text-transform: none; white-space: normal; widows: 2; word-spacing: 0px; =
border-spacing: 0px; -webkit-text-decorations-in-effect: none; =
-webkit-text-size-adjust: auto; -webkit-text-stroke-width: 0px; =
font-size: medium; "><div style=3D"word-wrap: break-word; =
-webkit-nbsp-mode: space; -webkit-line-break: after-white-space; "><span =
class=3D"Apple-style-span" style=3D"border-collapse: separate; color: =
rgb(0, 0, 0); font-family: Helvetica; font-style: normal; font-variant: =
normal; font-weight: normal; letter-spacing: normal; line-height: =
normal; orphans: 2; text-indent: 0px; text-transform: none; white-space: =
normal; widows: 2; word-spacing: 0px; border-spacing: 0px; =
-webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><div =
style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; =
-webkit-line-break: after-white-space; "><span class=3D"Apple-style-span" =
style=3D"border-collapse: separate; color: rgb(0, 0, 0); font-family: =
Helvetica; font-style: normal; font-variant: normal; font-weight: =
normal; letter-spacing: normal; line-height: normal; orphans: 2; =
text-indent: 0px; text-transform: none; white-space: normal; widows: 2; =
word-spacing: 0px; border-spacing: 0px; =
-webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><div =
style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; =
-webkit-line-break: after-white-space; "><span class=3D"Apple-style-span" =
style=3D"border-collapse: separate; color: rgb(0, 0, 0); font-family: =
Helvetica; font-style: normal; font-variant: normal; font-weight: =
normal; letter-spacing: normal; line-height: normal; orphans: 2; =
text-indent: 0px; text-transform: none; white-space: normal; widows: 2; =
word-spacing: 0px; border-spacing: 0px; =
-webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; font-size: medium; "><div =
style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; =
-webkit-line-break: after-white-space; =
"><div>-----------------</div><div>Aaron Morton</div><div>Freelance =
Cassandra Developer</div><div>New =
Zealand</div><div><br></div><div>@aaronmorton</div><div><a =
href=3D"http://www.thelastpickle.com">http://www.thelastpickle.com</a></di=
v></div></span></div></span></div></span></div></span></div>
</div>

<br><div><div>On 15/12/2012, at 12:58 PM, Michael Theroux &lt;<a =
href=3D"mailto:mtheroux2@yahoo.com">mtheroux2@yahoo.com</a>&gt; =
wrote:</div><br class=3D"Apple-interchange-newline"><blockquote =
type=3D"cite"><div style=3D"background-color: rgb(255, 255, 255); =
font-family: 'times new roman', 'new york', times, serif; font-size: =
12pt; position: static; z-index: auto; =
"><div>Hello,</div><div><br></div><div style=3D"font-size: 16px; =
font-family: 'times new roman', 'new york', times, serif; =
background-color: transparent; font-style: normal; ">We have an unusual =
situation that I believe I've reproduced, at least temporarily, in a =
test environment. &nbsp;I also think I see where this issue is occurring =
in the code.</div><div style=3D"font-size: 16px; font-family: 'times new =
roman', 'new york', times, serif; background-color: transparent; =
font-style: normal; "><br></div><div style=3D"font-size: 16px; =
font-family: 'times new roman', 'new york', times, serif; =
background-color: transparent; font-style: normal; ">We have a specific =
column family that is under heavy read and write load on a nightly =
basis. &nbsp; For the purposes of this description,
 I'll refer to this column family as "Bob". &nbsp;During this nightly =
processing, sometimes Bob is under very write load, other times it is =
very heavy read load.</div><div style=3D"font-size: 16px; font-family: =
'times new roman', 'new york', times, serif; background-color: =
transparent; font-style: normal; "><br></div><div style=3D"font-size: =
16px; font-family: 'times new roman', 'new york', times, serif; =
background-color: transparent; font-style: normal; ">The application is =
such that when something is written to Bob, a write is made to one of =
two other tables. &nbsp;We've witnessed a situation where the write =
count on Bob far outstrips the write count on either of the other =
tables, by a factor of 3-&gt;10. &nbsp;This is based on the WriteCount =
available on the column family JMX MBean. &nbsp;We have not been able to =
find where in our code this is happening, and we have gone as far as =
tracing our CQL calls to
 determine that the relationship between Bob and the other tables are =
what we expect.</div><div style=3D"font-size: 16px; font-family: 'times =
new roman', 'new york', times, serif; background-color: transparent; =
font-style: normal; "><br></div><div style=3D"font-size: 16px; =
font-family: 'times new roman', 'new york', times, serif; =
background-color: transparent; font-style: normal; ">I brought up a test =
node to experiment, and see a situation where, when a "select" statement =
is executed, a write will occur.</div><div style=3D"font-size: 16px; =
font-family: 'times new roman', 'new york', times, serif; =
background-color: transparent; font-style: normal; "><br></div><div =
style=3D"font-size: 16px; font-family: 'times new roman', 'new york', =
times, serif; background-color: transparent; font-style: normal; ">In my =
test, I perform the following (switching between nodetool and
 cqlsh):</div><div style=3D"font-size: 16px; font-family: 'times new =
roman', 'new york', times, serif; background-color: transparent; =
font-style: normal; "><br></div><div style=3D"font-size: 16px; =
font-family: 'times new roman', 'new york', times, serif; =
background-color: transparent; font-style: normal; ">update bob set =
'about'=3D'coworker' where key=3D'&lt;hex key&gt;';&nbsp;&nbsp; =
&nbsp;</div><div style=3D"font-size: 16px; font-family: 'times new =
roman', 'new york', times, serif; background-color: transparent; =
font-style: normal; ">nodetool flush</div><div style=3D"font-size: 16px; =
font-family: 'times new roman', 'new york', times, serif; =
background-color: transparent; font-style: normal; "><div =
style=3D"background-color: transparent; ">update bob set =
'about'=3D'coworker' where key=3D'&lt;hex key&gt;';&nbsp;&nbsp; =
&nbsp;</div><div>nodetool flush</div><div><div style=3D"background-color:
 transparent; ">update bob set 'about'=3D'coworker' where key=3D'&lt;hex =
key&gt;';&nbsp;&nbsp; &nbsp;</div><div>nodetool flush</div><div><div =
style=3D"background-color: transparent; ">update bob set =
'about'=3D'coworker' where key=3D'&lt;hex key&gt;';&nbsp;&nbsp; =
&nbsp;</div><div>nodetool flush</div><div><div style=3D"background-color: =
transparent; ">update bob set 'about'=3D'coworker' where key=3D'&lt;hex =
key&gt;';&nbsp;&nbsp; &nbsp;</div><div>nodetool =
flush</div><div><br></div><div>Then, for a period of time (before a =
minor compaction occurs), a select statement that selects specific =
columns will cause writes to occur in the write count of the column =
family:</div><div><br></div><div>select about,changed,data from bob =
where key=3D'&lt;hex key&gt;';</div><div><span style=3D"background-color: =
transparent; "><br></span></div><div><span style=3D"background-color: =
transparent; ">This situation will continue until a minor compaction is =
completed.</span></div><div><span style=3D"background-color: =
transparent; "><br></span></div><div><span style=3D"background-color: =
transparent; ">I went into the code and added some traces to =
CollationController.java:</span></div><div><span =
style=3D"background-color: transparent; "><br></span></div><div><span =
style=3D"background-color: transparent; "><pre style=3D"word-wrap: =
break-word; white-space: pre-wrap; ">  private ColumnFamily =
collectTimeOrderedData()
    {
        logger.debug("collectTimeOrderedData");

      ... &lt;snip&gt; ...
<br></pre><pre style=3D"word-wrap: break-word; white-space: pre-wrap; =
">---&gt; HERE   logger.debug( "tables iterated: " + sstablesIterated +  =
" Min compact: " + cfs.getMinimumCompactionThreshold() );</pre><pre =
style=3D"word-wrap: break-word; white-space: pre-wrap; ">            // =
"hoist up" the requested data into a more recent sstable
            if (sstablesIterated &gt; =
cfs.getMinimumCompactionThreshold()
                &amp;&amp; !cfs.isCompactionDisabled()
                &amp;&amp; cfs.getCompactionStrategy() instanceof =
SizeTieredCompactionStrategy)
            {
                RowMutation rm =3D new RowMutation(cfs.table.name, new =
Row(filter.key, returnCF.cloneMe()));
                try
                {</pre><pre style=3D"word-wrap: break-word; white-space: =
pre-wrap; "><span style=3D"background-color: transparent; font-family: =
'times new roman', 'new york', times, serif; ">---&gt; HERE   </span>    =
        logger.debug( "Apply hoisted up row mutation" );<span =
class=3D"Apple-tab-span" style=3D"background-color: transparent; =
font-family: 'times new roman', 'new york', times, serif; white-space: =
pre; ">	</span></pre><pre style=3D"word-wrap: break-word; white-space: =
pre-wrap; ">                    // skipping commitlog and index updates =
is fine since we're just de-fragmenting existing data
                    Table.open(rm.getTable()).apply(rm, false, false);
                }
                catch (IOException e)
                {
                    // log and allow the result to be returned
                    logger.error("Error re-writing read results", e);
                }
            }&nbsp;</pre></span></div><div><span =
style=3D"background-color: transparent; "><span style=3D"white-space: =
pre-wrap; background-color: transparent; ">... &lt;snip&gt; =
...</span></span></div><div><span style=3D"background-color: =
transparent; "><span style=3D"white-space: pre-wrap; background-color: =
transparent; "><br></span></span></div><div><span =
style=3D"background-color: transparent; "><span style=3D"white-space: =
pre-wrap; background-color: transparent; ">Performing the steps above, I =
see the following traces (in the test environment I decreased the =
minimum compaction threshold to make this easier to reproduce).  After I =
do a couple of update/flush, I see this in the =
log:</span></span></div><div><span style=3D"background-color: =
transparent; "><span style=3D"white-space: pre-wrap; background-color: =
transparent; "><br></span></span></div><div>DEBUG [FlushWriter:7] =
2012-12-14 22:54:40,106 CompactionManager.java (line 117) Scheduling a =
background task check
 for bob with SizeTieredCompactionStrategy<span style=3D"background-color:=
 transparent; "><span style=3D"white-space: pre-wrap; background-color: =
transparent; "><br></span></span></div><div><br></div><div>Then, until =
compaction occurs, I see (when performing a =
select):</div><div><br></div><div><div>DEBUG [ScheduledTasks:1] =
2012-12-14 22:55:15,998 LoadBroadcaster.java (line 86) Disseminating =
load info ...</div><div>DEBUG [Thrift:12] 2012-12-14 22:55:16,990 =
CassandraServer.java (line 1227) execute_cql_query</div><div>DEBUG =
[Thrift:12] 2012-12-14 22:55:16,991 QueryProcessor.java (line 445) CQL =
statement type: SELECT</div><div>DEBUG [Thrift:12] 2012-12-14 =
22:55:16,991 StorageProxy.java (line 653) Command/ConsistencyLevel is =
SliceByNamesReadCommand(table=3D'open', =
key=3D804229d1933669d0a25d2a38c8b26ded10069573003e6dbb1ce21b5f402a5342, =
columnParent=3D'QueryPath(columnFamilyName=3D'bob', =
superColumnName=3D'null', columnName=3D'null')',
 columns=3D[about,changed,data,])/ONE</div><div>DEBUG [Thrift:12] =
2012-12-14 22:55:16,992 ReadCallback.java (line 79) Blockfor is 1; =
setting up requests to /10.0.4.20</div><div>DEBUG [Thrift:12] 2012-12-14 =
22:55:16,992 StorageProxy.java (line 669) reading data =
locally</div><div>DEBUG [ReadStage:61] 2012-12-14 22:55:16,992 =
StorageProxy.java (line 813) LocalReadRunnable reading =
SliceByNamesReadCommand(table=3D'open', =
key=3D804229d1933669d0a25d2a38c8b26ded10069573003e6dbb1ce21b5f402a5342, =
columnParent=3D'QueryPath(columnFamilyName=3D'bob', =
superColumnName=3D'null', columnName=3D'null')', =
columns=3D[about,changed,data,])</div><div>DEBUG [ReadStage:61] =
2012-12-14 22:55:16,992 CollationController.java (line 68) In get top =
level columns: class org.apache.cassandra.db.filter.NamesQueryFilter =
type: Standard valid: class =
org.apache.cassandra.db.marshal.BytesType</div><div>DEBUG [ReadStage:61] =
2012-12-14 22:55:16,992 CollationController.java (line 84)
 collectTimeOrderedData</div><div><span style=3D"background-color: =
transparent; ">---&gt; DEBUG [ReadStage:61] 2012-12-14 22:55:17,192 =
CollationController.java (line 188) tables iterated: 4 Min compact: =
2</span><br></div><div>----&gt; DEBUG [ReadStage:61] 2012-12-14 =
22:55:17,192 CollationController.java (line 198) Apply hoisted up row =
mutation</div><div>DEBUG [ReadStage:61] 2012-12-14 22:55:17,193 =
Table.java (line 395) applying mutation of row =
804229d1933669d0a25d2a38c8b26ded10069573003e6dbb1ce21b5f402a5342</div><div=
><br></div><div>The above traces will occur every time I repeat the =
above select statement.</div><div><br></div><div>Minor compaction =
doesn't start until a few minutes after the request was submitted above =
(note, this is an unloaded test node):</div><div><br></div><div>DEBUG =
[CompactionExecutor:11] 2012-12-14 22:57:03,278 IntervalNode.java (line =
45) Creating IntervalNode from
 =
[Interval(DecoratedKey(Token(bytes[804229d1933669d0a25d2a38c8b26ded1006957=
3003e6dbb1ce...<br></div><div><br></div></div><div>Once minor compaction =
occurs, the behavior around write count being incremented stops, until =
more than the minimum compaction threshold memtables are flush to =
disk.</div><div><br></div><div>So, my questions =
are:</div><div><br></div><div>1) Am I reading things =
correctly?</div><div><br></div><div>2) What is really happening here? =
&nbsp;Essentially minor compactions can occur between 4 and 32 memtable =
flushes. &nbsp;Looking through the code, this seems to only effect a =
couple types of select statements (when selecting a specific column on a =
specific key being one of them). During the time between these two =
values, every "select" statement will perform a =
write.</div><div><br></div><div>3) Is this desired behavior? &nbsp;Is =
there something else I should be looking at that could be causing this =
behavior?</div><div><br></div><div>We are
 running Cassandra 1.1.2, with&nbsp;SizeTieredCompactionStrategy. =
&nbsp;</div><div>Any help is =
appreciated,</div><div>Thanks,</div><div>-Mike</div><div><br></div><div><s=
pan style=3D"background-color: transparent; =
"><br></span></div><div><span style=3D"background-color: transparent; =
">&nbsp;</span><br></div></div></div></div></div></div></blockquote></div>=
<br></body></html>=

--Apple-Mail=_C74F1ED7-A8A7-45BF-918A-795CF34F5E0A--