Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
MIME-Version: 1.0
In-Reply-To: <CAGptfvY1Vvg=s-y0jAHWMFGRybqFpa=9fr_9RyQVw+Q3_rO_GA@mail.gmail.com>
References: <CANKcoOyC8Rn+A5H9yebq3x7xBd8T+ibaLanZU5BdX+ax9Y0siQ@mail.gmail.com>
 <CAG_0Gqub60oUFNSC+g6iohosaGtqaU_5q_kXSG+7mNPQssww4g@mail.gmail.com>
 <CANKcoOzo43_MsHOc6um1s72vftEKmgpDy2ShVTYer8At6OSERA@mail.gmail.com> <CAGptfvY1Vvg=s-y0jAHWMFGRybqFpa=9fr_9RyQVw+Q3_rO_GA@mail.gmail.com>
From: srinivasarao daruna <sree.srinu38@gmail.com>
Date: Thu, 16 Mar 2017 13:19:15 -0400
Message-ID: <CANKcoOzj08i2Kx82HAOQiuWtw4yC5U+732i-9HgdLYy8uf7F=A@mail.gmail.com>
Subject: Re: Issue with Cassandra consistency in results
To: user@cassandra.apache.org
Content-Type: multipart/alternative; boundary=001a114e45c0b8b639054adc4217
archived-at: Thu, 16 Mar 2017 17:19:23 -0000

--001a114e45c0b8b639054adc4217
Content-Type: text/plain; charset=UTF-8

Would switching to select partition_key instead of select count(*) help me
any way ?

I know that, Logically they both are same.. but just asking  out of
desperation. Is it worth a shot?


On Mar 16, 2017 1:09 PM, "Ryan Svihla" <rs@foundev.pro> wrote:

        Replication factor is 3, and write consistency is ONE and read
consistency is QUORUM.

That combination is not gonna work well:

*Write succeeds to NODE A but fails on node B,C*

*Read goes to NODE B, C*

If you can tolerate some temporary inaccuracy you can use QUORUM but may
still have the situation where

Write succeeds on node A a timestamp 1, B succeeds at timestamp 2
Read succeeds on node B and C at timestamp 1

If you need fully race condition free counts I'm afraid you need to use
SERIAL or LOCAL_SERIAL (for in DC only accuracy)

On Thu, Mar 16, 2017 at 1:04 PM, srinivasarao daruna <sree.srinu38@gmail.com
> wrote:

> Replication strategy is SimpleReplicationStrategy.
>
> Smith is : EC2 snitch. As we deployed cluster on EC2 instances.
>
> I was worried that CL=ALL have more read latency and read failures. But
> won't rule out trying it.
>
> Should I switch select count (*) to select partition_key column? Would
> that be of any help.?
>
>
> Thank you
> Regards
> Srini
>
> On Mar 16, 2017 12:46 PM, "Arvydas Jonusonis" <arvydas.jonusonis@gmail.com>
> wrote:
>
> What are your replication strategy and snitch settings?
>
> Have you tried doing a read at CL=ALL? If it's an actual inconsistency
> issue (missing data), this should cause the correct results to be returned.
> You'll need to run a repair to fix the inconsistencies.
>
> If all the data is actually there, you might have one or several nodes
> that aren't identifying the correct replicas.
>
> Arvydas
>
>
>
> On Thu, Mar 16, 2017 at 5:31 PM, srinivasarao daruna <
> sree.srinu38@gmail.com> wrote:
>
>> Hi Team,
>>
>> We are struggling with a problem related to cassandra counts, after
>> backup and restore of the cluster. Aaron Morton has suggested to send this
>> to user list, so some one of the list will be able to help me.
>>
>> We are have a rest api to talk to cassandra and one of our query which
>> fetches count is creating problems for us.
>>
>> We have done backup and restore and copied all the data to new cluster.
>> We have done nodetool refresh on the tables, and did the nodetool repair as
>> well.
>>
>> However, one of our key API call is returning inconsistent results. The
>> result count is 0 in the first call and giving the actual values for later
>> calls. The query frequency is bit high and failure rate has also raised
>> considerably.
>>
>> 1) The count query has partition keys in it. Didnt see any read timeout
>> or any errors from api logs.
>>
>> 2) This is how our code of creating session looks.
>>
>> val poolingOptions = new PoolingOptions
>>     poolingOptions
>>       .setCoreConnectionsPerHost(HostDistance.LOCAL, 4)
>>       .setMaxConnectionsPerHost(HostDistance.LOCAL, 10)
>>       .setCoreConnectionsPerHost(HostDistance.REMOTE, 4)
>>       .setMaxConnectionsPerHost( HostDistance.REMOTE, 10)
>>
>> val builtCluster = clusterBuilder.withCredentials(username, password)
>>       .withPoolingOptions(poolingOptions)
>>       .build()
>> val cassandraSession = builtCluster.get.connect()
>>
>> val preparedStatement = cassandraSession.prepare(state
>> ment).setConsistencyLevel(ConsistencyLevel.QUORUM)
>> cassandraSession.execute(preparedStatement.bind(args :_*))
>>
>> Query: SELECT count(*) FROM table_name WHERE parition_column=? AND
>> text_column_of_clustering_key=? AND date_column_of_clustering_key<=? AND
>> date_column_of_clustering_key>=?
>>
>> 3) Cluster configuration:
>>
>> 6 Machines: 3 seeds, we are using apache cassandra 3.9 version. Each
>> machine is equipped with 16 Cores and 64 GB Ram.
>>
>>         Replication factor is 3, and write consistency is ONE and read
>> consistency is QUORUM.
>>
>> 4) cassandra is never down on any machine
>>
>> 5) Using cassandra-driver-core artifact with 3.1.1 version in the api.
>>
>> 6) nodetool tpstats shows no read failures, and no other failures.
>>
>> 7) Do not see any other issues from system.log of cassandra. We just see
>> few warnings as below.
>>
>> Maximum memory usage reached (512.000MiB), cannot allocate chunk of
>> 1.000MiB
>> WARN  [ScheduledTasks:1] 2017-03-14 14:58:37,141 QueryProcessor.java:103
>> - 88 prepared statements discarded in the last minute because cache limit
>> reached (32 MB)
>> The first api call returns 0 and the api calls later gives right values.
>>
>> Please let me know, if any other details needed.
>> Could you please have a look at this issue once and kindly give me your
>> inputs? This issue literally broke the confidence on Cassandra from our
>> business team.
>>
>> Your inputs will be really helpful.
>>
>> Thank You,
>> Regards,
>> Srini
>>
>
>
>


-- 

Thanks,
Ryan Svihla

--001a114e45c0b8b639054adc4217
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

<div dir=3D"auto"><div>Would switching to select partition_key instead of s=
elect count(*) help me any way ?<div dir=3D"auto"><br></div><div dir=3D"aut=
o">I know that, Logically they both are same.. but just asking =C2=A0out of=
 desperation. Is it worth a shot?</div><br><div class=3D"gmail_extra"><br><=
div class=3D"gmail_quote">On Mar 16, 2017 1:09 PM, &quot;Ryan Svihla&quot; =
&lt;<a href=3D"mailto:rs@foundev.pro">rs@foundev.pro</a>&gt; wrote:<br type=
=3D"attribution"><blockquote class=3D"quote" style=3D"margin:0 0 0 .8ex;bor=
der-left:1px #ccc solid;padding-left:1ex"><div dir=3D"ltr"><div class=3D"qu=
oted-text"><div style=3D"font-size:12.8px">=C2=A0 =C2=A0 =C2=A0 =C2=A0 Repl=
ication factor is 3, and write consistency is ONE and read consistency is Q=
UORUM.</div><div><br></div></div><div>That combination is not gonna work we=
ll:</div><div><br></div><div><i>Write succeeds to NODE A but fails on node =
B,C</i></div><div><i><br></i></div><div><i>Read goes to NODE B, C</i></div>=
<div><br></div><div>If you can tolerate some temporary inaccuracy you can u=
se QUORUM but may still have the situation where</div><div><br></div><div>W=
rite succeeds on node A a timestamp 1, B succeeds at timestamp 2</div><div>=
Read succeeds on node B and C at timestamp 1=C2=A0</div><div><br></div><div=
>If you need fully race condition free counts I&#39;m afraid you need to us=
e SERIAL or LOCAL_SERIAL (for in DC only accuracy)</div></div><div class=3D=
"gmail_extra"><div class=3D"elided-text"><br><div class=3D"gmail_quote">On =
Thu, Mar 16, 2017 at 1:04 PM, srinivasarao daruna <span dir=3D"ltr">&lt;<a =
href=3D"mailto:sree.srinu38@gmail.com" target=3D"_blank">sree.srinu38@gmail=
.com</a>&gt;</span> wrote:<br><blockquote class=3D"gmail_quote" style=3D"ma=
rgin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir=3D"au=
to"><div>Replication strategy is SimpleReplicationStrategy.</div><div dir=
=3D"auto"><br></div><div dir=3D"auto">Smith is : EC2 snitch. As we deployed=
 cluster on EC2 instances.</div><div dir=3D"auto"><br></div><div dir=3D"aut=
o">I was worried that CL=3DALL have more read latency and read failures. Bu=
t won&#39;t rule out trying it.</div><div dir=3D"auto"><br></div><div dir=
=3D"auto">Should I switch select count (*) to select partition_key column? =
Would that be of any help.?</div><div dir=3D"auto"><br></div><div dir=3D"au=
to"><br></div><div dir=3D"auto">Thank you=C2=A0</div><div dir=3D"auto">Rega=
rds</div><div dir=3D"auto">Srini</div><div><div class=3D"m_7404529461033236=
165h5"><div dir=3D"auto"><div class=3D"gmail_extra" dir=3D"auto"><br><div c=
lass=3D"gmail_quote">On Mar 16, 2017 12:46 PM, &quot;Arvydas Jonusonis&quot=
; &lt;<a href=3D"mailto:arvydas.jonusonis@gmail.com" target=3D"_blank">arvy=
das.jonusonis@gmail.com</a>&gt; wrote:<br type=3D"attribution"><blockquote =
class=3D"m_7404529461033236165m_758019750998102207quote" style=3D"margin:0 =
0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir=3D"ltr">What=
 are your replication strategy and snitch settings?<div><br></div><div>Have=
 you tried doing a read at CL=3DALL? If it&#39;s an actual inconsistency is=
sue (missing data), this should cause the correct results to be returned. Y=
ou&#39;ll need to run a repair to fix the inconsistencies.</div><div><br></=
div><div>If all the data is actually there, you might have one or several n=
odes that aren&#39;t identifying the correct replicas.</div><font color=3D"=
#888888"><div><br></div><div>Arvydas</div></font><div class=3D"m_7404529461=
033236165m_758019750998102207elided-text"><div><br></div><div><br><div clas=
s=3D"gmail_extra"><br><div class=3D"gmail_quote">On Thu, Mar 16, 2017 at 5:=
31 PM, srinivasarao daruna <span dir=3D"ltr">&lt;<a href=3D"mailto:sree.sri=
nu38@gmail.com" target=3D"_blank">sree.srinu38@gmail.com</a>&gt;</span> wro=
te:<br><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-=
left:1px #ccc solid;padding-left:1ex"><div dir=3D"ltr">Hi Team,=C2=A0<br><d=
iv><div class=3D"m_7404529461033236165m_758019750998102207m_134891216027752=
1144m_-4200632199582104151gmail_signature"><div dir=3D"ltr"><div dir=3D"ltr=
"><div dir=3D"ltr"><div dir=3D"ltr"><div><div><br></div><div>We are struggl=
ing with a problem related to cassandra counts, after backup and restore of=
 the cluster. Aaron Morton has suggested to send this to user list, so some=
 one of the list will be able to help me.=C2=A0</div><div><br></div><div>We=
 are have a rest api to talk to cassandra and one of our query which fetche=
s count is creating problems for us.</div><div><br></div><div>We have done =
backup and restore and copied all the data to new cluster. We have done nod=
etool refresh on the tables, and did the nodetool repair as well.</div><div=
><br></div><div>However, one of our key API call is returning inconsistent =
results. The result count is 0 in the first call and giving the actual valu=
es for later calls. The query frequency is bit high and failure rate has al=
so raised considerably.</div><div><br></div><div>1) The count query has par=
tition keys in it. Didnt see any read timeout or any errors from api logs.<=
/div><div><br></div><div>2) This is how our code of creating session looks.=
</div><div><br></div><div>val poolingOptions =3D new PoolingOptions</div><d=
iv>=C2=A0 =C2=A0 poolingOptions</div><div>=C2=A0 =C2=A0 =C2=A0 .setCoreConn=
ectionsPerHost(Hos<wbr>tDistance.LOCAL, 4)</div><div>=C2=A0 =C2=A0 =C2=A0 .=
setMaxConnectionsPerHost(Host<wbr>Distance.LOCAL, 10)</div><div>=C2=A0 =C2=
=A0 =C2=A0 .setCoreConnectionsPerHost(Hos<wbr>tDistance.REMOTE, 4)</div><di=
v>=C2=A0 =C2=A0 =C2=A0 .setMaxConnectionsPerHost( HostDistance.REMOTE, 10)<=
/div><div><br></div><div>val builtCluster =3D clusterBuilder.withCredential=
s<wbr>(username, password)</div><div>=C2=A0 =C2=A0 =C2=A0 .withPoolingOptio=
ns(poolingOpt<wbr>ions)</div><div>=C2=A0 =C2=A0 =C2=A0 .build()</div><div>v=
al cassandraSession =3D builtCluster.get.connect()</div><div><br></div><div=
>val preparedStatement =3D cassandraSession.prepare(state<wbr>ment).setCons=
istencyLevel(Cons<wbr>istencyLevel.QUORUM)</div><div>cassandraSession.execu=
te(prepa<wbr>redStatement.bind(args :_*))</div><div><br></div><div>Query: S=
ELECT count(*) FROM table_name WHERE parition_column=3D? AND text_column_of=
_clustering_key=3D<wbr>? AND date_column_of_clustering_key&lt;<wbr>=3D? AND=
 date_column_of_clustering_key&gt;<wbr>=3D?</div><div><br></div><div>3) Clu=
ster configuration:</div><div><br></div><div><span class=3D"m_7404529461033=
236165m_758019750998102207m_1348912160277521144m_-4200632199582104151gmail-=
Apple-tab-span" style=3D"white-space:pre-wrap">	</span>6 Machines: 3 seeds,=
 we are using apache cassandra 3.9 version. Each machine is equipped with 1=
6 Cores and 64 GB Ram.</div><div><br></div><div>=C2=A0 =C2=A0 =C2=A0 =C2=A0=
 Replication factor is 3, and write consistency is ONE and read consistency=
 is QUORUM.</div><div><br></div><div>4) cassandra is never down on any mach=
ine</div><div><br></div><div>5) Using cassandra-driver-core artifact with 3=
.1.1 version in the api.</div><div><br></div><div>6) nodetool tpstats shows=
 no read failures, and no other failures.</div><div><br></div><div>7) Do no=
t see any other issues from system.log of cassandra. We just see few warnin=
gs as below.</div><div><br></div><div>Maximum memory usage reached (512.000=
MiB), cannot allocate chunk of 1.000MiB</div><div>WARN =C2=A0[ScheduledTask=
s:1] 2017-03-14 14:58:37,141 QueryProcessor.java:103 - 88 prepared statemen=
ts discarded in the last minute because cache limit reached (32 MB)</div><d=
iv>The first api call returns 0 and the api calls later gives right values.=
</div><div><br></div><div>Please let me know, if any other details needed.<=
/div><div>Could you please have a look at this issue once and kindly give m=
e your inputs? This issue literally broke the confidence on Cassandra from =
our business team.</div><div><br></div><div>Your inputs will be really help=
ful.</div><div><br></div>Thank You,<br></div><div>Regards,=C2=A0</div><div>=
Srini</div></div></div></div></div></div></div>
</div>
</blockquote></div><br></div></div></div></div>
</blockquote></div><br></div></div></div></div></div>
</blockquote></div><br><br clear=3D"all"><div><br></div></div><font color=
=3D"#888888">-- <br><div class=3D"m_7404529461033236165gmail_signature" dat=
a-smartmail=3D"gmail_signature"><div dir=3D"ltr"><div><p></p><p>Thanks,</p>=
<div>Ryan Svihla</div><p></p><p></p></div></div></div>
</font></div>
</blockquote></div><br></div></div></div>

--001a114e45c0b8b639054adc4217--