Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (athena.apache.org: domain of tivv00@gmail.com designates
 209.85.214.44 as permitted sender)
Message-ID: <4FDA0A93.50502@gmail.com>
Date: Thu, 14 Jun 2012 19:00:19 +0300
From: Vitalii Tymchyshyn <tivv00@gmail.com>
User-Agent: Mozilla/5.0 (X11; Linux x86_64;
 rv:12.0) Gecko/20120430 Thunderbird/12.0.1
MIME-Version: 1.0
To: crypto five <cryptofive@gmail.com>
CC: user@cassandra.apache.org
Subject: Re: Failing operations & repair
References: 
 <CABWW-d38GiMCjg98UrHkN1L9BT-ETjYPLi6KM0FtJXHU7j5Fpw@mail.gmail.com>
 <2B388F32-0289-42EE-AAA7-29264A973A1F@thelastpickle.com>
 <CABWW-d3vQ8oD3e0gRTaeE_kwFz_RjW2ONFGeC4rj2rQiPZgn4w@mail.gmail.com>
 <CAEAJz-rb5+10F8vMKf+-7+Awo5+JEMmba=1WgBphmH_ujDoa5w@mail.gmail.com>
In-Reply-To: 
 <CAEAJz-rb5+10F8vMKf+-7+Awo5+JEMmba=1WgBphmH_ujDoa5w@mail.gmail.com>
Content-Type: multipart/alternative;
 boundary="------------000200090907050509020400"

This is a multi-part message in MIME format.
--------------000200090907050509020400
Content-Type: text/plain; charset=ISO-8859-1; format=flowed
Content-Transfer-Encoding: 7bit

Hello.

For sure. Here they are: 
http://www.slideshare.net/vittim1/practical-cassandra
Slides are in english.
I've presented this presentation some time ago at JEEConf and once more 
yesterday in local developers club.
There should be video recording (russian) available somewhen, but it's 
not here yet.

Best regards, Vitalii Tymchyshyn

13.06.12 02:27, crypto five ???????(??):
> It would be really great to look at your slides. Do you have any plans 
> to share your presentation?
>
> On Sat, Jun 9, 2012 at 1:14 AM, ??????? ???????? <tivv00@gmail.com 
> <mailto:tivv00@gmail.com>> wrote:
>
>     Thanks a lot. I was not sure if coordinator somehow tries to
>     "roll-back" transactions that failed to reach it's consistency level.
>     (Yet I could not imagine a method to do this, without 2-phase
>     commit :) )
>
>
>     2012/6/8 aaron morton <aaron@thelastpickle.com
>     <mailto:aaron@thelastpickle.com>>
>
>>         I am making some cassandra presentations in Kyiv and would
>>         like to check that I am telling people truth :)
>         Thanks for spreading the word :)
>
>>         1) Failed (from client-side view) operation may still be
>>         applied to cluster
>         Yes.
>         If you fail with UnavailableException it's because from the
>         coordinators view of the cluster there is less than CL nodes
>         available. So retry. Somewhat similar story with
>         TimedOutException.
>
>>         2) Coordinator does not try anything to "roll-back" operation
>>         that failed because it was processed by less then consitency
>>         level number of nodes.
>         Correct.
>
>>         3) Hinted handoff works only for successfull operations.
>         HH will be stored if the coordinator proceeds with the request.
>         In 1.X HH is stored on the coordinator if a replica is down
>         when the request starts and if the node does not reply in
>         rpc_timeout.
>
>>         4) Counters are not reliable because of (1)
>         If you get a TimedOutException when writing a counter you
>         should not re-send the request.
>
>>         5) Read-repair may help to propagate operation that was
>>         failed it's consistency level, but was persisted to some nodes.
>         Yes. It works in the background, by default is only enabled on
>         10% of requests.
>         Note that RR is not the same as the Consistent Level for read.
>         If you work as a CL > ONE the results from CL nodes are always
>         compared and differences resolved. RR is concerned with the
>         replicas not involved in the CL read.
>
>>         6) Manual repair is still needed because of (2) and (3)
>         Manual repair is *the* was to achieve consistency of data on
>         disk. HH and RR are optimisations designed to reduce the
>         chance of a Digest Mismatch during a read with CL > ONE.
>         It is also essential for distributing Tombstones before they
>         are purged by compaction.
>>         P.S. If some points apply only to some cassandra versions, I
>>         will be happy to know this too.
>         Assume everyone for version 1.X
>
>         Thanks
>
>         -----------------
>         Aaron Morton
>         Freelance Developer
>         @aaronmorton
>         http://www.thelastpickle.com
>
>         On 8/06/2012, at 1:20 AM, ??????? ???????? wrote:
>
>>         Hello.
>>
>>         I am making some cassandra presentations in Kyiv and would
>>         like to check that I am telling people truth :)
>>         Could community tell me if next points are true:
>>         1) Failed (from client-side view) operation may still be
>>         applied to cluster
>>         2) Coordinator does not try anything to "roll-back" operation
>>         that failed because it was processed by less then consitency
>>         level number of nodes.
>>         3) Hinted handoff works only for successfull operations.
>>         4) Counters are not reliable because of (1)
>>         5) Read-repair may help to propagate operation that was
>>         failed it's consistency level, but was persisted to some nodes.
>>         6) Manual repair is still needed because of (2) and (3)
>>
>>         P.S. If some points apply only to some cassandra versions, I
>>         will be happy to know this too.
>>         -- 
>>         Best regards,
>>          Vitalii Tymchyshyn
>
>
>
>
>     -- 
>     Best regards,
>      Vitalii Tymchyshyn
>
>


--------------000200090907050509020400
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: 7bit

<html>
  <head>
    <meta content="text/html; charset=ISO-8859-1"
      http-equiv="Content-Type">
  </head>
  <body bgcolor="#FFFFFF" text="#000000">
    Hello. <br>
    <br>
    For sure. Here they are:
    <meta http-equiv="content-type" content="text/html;
      charset=ISO-8859-1">
    <a href="http://www.slideshare.net/vittim1/practical-cassandra">http://www.slideshare.net/vittim1/practical-cassandra</a><br>
    Slides are in english.<br>
    I've presented this presentation some time ago at JEEConf and once
    more yesterday in local developers club.<br>
    There should be video recording (russian) available somewhen, but
    it's not here yet.<br>
    <br>
    Best regards, Vitalii Tymchyshyn<br>
    <br>
    13.06.12 02:27, crypto five &#1085;&#1072;&#1087;&#1080;&#1089;&#1072;&#1074;(&#1083;&#1072;):
    <blockquote
cite="mid:CAEAJz-rb5+10F8vMKf+-7+Awo5+JEMmba=1WgBphmH_ujDoa5w@mail.gmail.com"
      type="cite">It would be really great to look at your slides. Do
      you have any plans to share your presentation?<br>
      <br>
      <div class="gmail_quote">On Sat, Jun 9, 2012 at 1:14 AM, &#1042;&#1110;&#1090;&#1072;&#1083;&#1110;&#1081;
        &#1058;&#1080;&#1084;&#1095;&#1080;&#1096;&#1080;&#1085; <span dir="ltr">&lt;<a moz-do-not-send="true"
            href="mailto:tivv00@gmail.com" target="_blank">tivv00@gmail.com</a>&gt;</span>
        wrote:<br>
        <blockquote class="gmail_quote" style="margin:0 0 0
          .8ex;border-left:1px #ccc solid;padding-left:1ex">Thanks a
          lot. I was not sure if coordinator somehow tries to
          "roll-back" transactions that failed to reach it's consistency
          level.
          <div>
            (Yet I could not imagine a method to do this, without
            2-phase commit :) )
            <div>
              <div class="h5"><br>
                <br>
                <div class="gmail_quote">2012/6/8 aaron morton <span
                    dir="ltr">&lt;<a moz-do-not-send="true"
                      href="mailto:aaron@thelastpickle.com"
                      target="_blank">aaron@thelastpickle.com</a>&gt;</span><br>
                  <blockquote class="gmail_quote" style="margin:0 0 0
                    .8ex;border-left:1px #ccc solid;padding-left:1ex">
                    <div style="word-wrap:break-word">
                      <div>
                        <blockquote type="cite">
                          <div>I am making some cassandra presentations
                            in Kyiv and would like to check that I am
                            telling people truth :)</div>
                        </blockquote>
                      </div>
                      <div>
                        <div>Thanks for spreading the word :)</div>
                      </div>
                      <div>
                        <div><br>
                        </div>
                        <div>
                          <blockquote type="cite">
                            <div>1) Failed&nbsp;(from client-side
                              view)&nbsp;operation may still be applied to
                              cluster</div>
                          </blockquote>
                        </div>
                      </div>
                      <div>
                        <div>Yes.&nbsp;</div>
                      </div>
                      <div>If you fail with UnavailableException it's
                        because from the coordinators view of the
                        cluster there is less than CL nodes available.
                        So retry. Somewhat similar story with
                        TimedOutException.&nbsp;</div>
                      <div>
                        <div><br>
                        </div>
                        <div>
                          <blockquote type="cite">
                            <div>2) Coordinator does not try anything to
                              "roll-back" operation that failed because
                              it was processed by less then consitency
                              level number of nodes.</div>
                          </blockquote>
                        </div>
                      </div>
                      <div>
                        <div>Correct.</div>
                      </div>
                      <div>
                        <div><br>
                        </div>
                        <div>
                          <blockquote type="cite">
                            <div>3) Hinted handoff works only for
                              successfull operations.</div>
                          </blockquote>
                        </div>
                      </div>
                      <div>
                        <div>HH will be stored if the coordinator
                          proceeds with the request.</div>
                        <div>In 1.X HH is stored on the coordinator if a
                          replica is down when the request starts and if
                          the node does not reply in rpc_timeout.&nbsp;</div>
                      </div>
                      <div>
                        <div><br>
                        </div>
                        <div>
                          <blockquote type="cite">
                            <div>4) Counters are not reliable because of
                              (1)</div>
                          </blockquote>
                        </div>
                      </div>
                      <div>
                        <div>If you get a TimedOutException when writing
                          a counter you should not re-send the request.&nbsp;</div>
                      </div>
                      <div>
                        <div><br>
                        </div>
                        <div>
                          <blockquote type="cite">
                            <div>5) Read-repair may help to propagate
                              operation that was failed it's consistency
                              level, but was persisted to some nodes.</div>
                          </blockquote>
                        </div>
                      </div>
                      <div>
                        <div>Yes. It works in the background, by default
                          is only enabled on 10% of requests.&nbsp;</div>
                      </div>
                      <div>Note that RR is not the same as the
                        Consistent Level for read. If you work as a CL
                        &gt; ONE the results from CL nodes are always
                        compared and differences resolved. RR is
                        concerned with the replicas not involved in the
                        CL read.&nbsp;</div>
                      <div>
                        <div><br>
                        </div>
                        <div>
                          <blockquote type="cite">
                            <div>6) Manual repair is still needed
                              because of (2) and (3)<br clear="all">
                            </div>
                          </blockquote>
                        </div>
                      </div>
                      <div>
                        <div>Manual repair is *the* was to achieve
                          consistency of data on disk. HH and RR are
                          optimisations designed to reduce the chance of
                          a Digest Mismatch during a read with CL &gt;
                          ONE.&nbsp;</div>
                      </div>
                      <div>It is also essential for distributing
                        Tombstones before they are purged by compaction.</div>
                      <div>
                        <div>
                          <blockquote type="cite">
                            <div>
                              <div>P.S. If some points apply only to
                                some cassandra versions, I will be happy
                                to know this too.</div>
                            </div>
                          </blockquote>
                        </div>
                      </div>
                      <div>
                        <div>
                          <div>Assume everyone for version 1.X</div>
                        </div>
                      </div>
                      <div><br>
                      </div>
                      <div>Thanks</div>
                      <div><br>
                      </div>
                      <div>
                        <span
style="text-indent:0px;letter-spacing:normal;font-variant:normal;text-align:-webkit-auto;font-style:normal;font-weight:normal;line-height:normal;border-collapse:separate;text-transform:none;font-size:medium;white-space:normal;font-family:Helvetica;word-spacing:0px"><span
style="text-indent:0px;letter-spacing:normal;font-variant:normal;font-style:normal;font-weight:normal;line-height:normal;border-collapse:separate;text-transform:none;font-size:medium;white-space:normal;font-family:Helvetica;word-spacing:0px">
                            <div style="word-wrap:break-word">
                              <span
style="text-indent:0px;letter-spacing:normal;font-variant:normal;font-style:normal;font-weight:normal;line-height:normal;border-collapse:separate;text-transform:none;font-size:medium;white-space:normal;font-family:Helvetica;word-spacing:0px">
                                <div style="word-wrap:break-word">
                                  <span
style="text-indent:0px;letter-spacing:normal;font-variant:normal;font-style:normal;font-weight:normal;line-height:normal;border-collapse:separate;text-transform:none;font-size:medium;white-space:normal;font-family:Helvetica;word-spacing:0px">
                                    <div style="word-wrap:break-word">
                                      <div>
                                        <div>-----------------</div>
                                        <div>Aaron Morton</div>
                                        <div>Freelance Developer</div>
                                        <div>@aaronmorton</div>
                                        <div><a moz-do-not-send="true"
                                            href="http://www.thelastpickle.com"
                                            target="_blank">http://www.thelastpickle.com</a></div>
                                      </div>
                                    </div>
                                  </span></div>
                              </span></div>
                          </span></span>
                      </div>
                      <div>
                        <div>
                          <br>
                          <div>
                            <div>On 8/06/2012, at 1:20 AM, &#1042;&#1110;&#1090;&#1072;&#1083;&#1110;&#1081;
                              &#1058;&#1080;&#1084;&#1095;&#1080;&#1096;&#1080;&#1085; wrote:</div>
                            <br>
                            <blockquote type="cite">Hello.
                              <div><br>
                              </div>
                              <div>I am making some cassandra
                                presentations in Kyiv and would like to
                                check that I am telling people truth :)</div>
                              <div>Could community tell me if next
                                points are true:</div>
                              <div>1) Failed&nbsp;(from client-side
                                view)&nbsp;operation may still be applied to
                                cluster</div>
                              <div>2) Coordinator does not try anything
                                to "roll-back" operation that failed
                                because it was processed by less then
                                consitency level number of nodes.</div>
                              <div>3) Hinted handoff works only for
                                successfull operations.</div>
                              <div>4) Counters are not reliable because
                                of (1)</div>
                              <div>5) Read-repair may help to propagate
                                operation that was failed it's
                                consistency level, but was persisted to
                                some nodes.</div>
                              <div>6) Manual repair is still needed
                                because of (2) and (3)<br clear="all">
                                <div><br>
                                </div>
                                <div>P.S. If some points apply only to
                                  some cassandra versions, I will be
                                  happy to know this too.</div>
                                -- <br>
                                Best regards,<br>
                                &nbsp;Vitalii Tymchyshyn<br>
                              </div>
                            </blockquote>
                          </div>
                          <br>
                        </div>
                      </div>
                    </div>
                  </blockquote>
                </div>
                <br>
                <br clear="all">
                <div><br>
                </div>
                -- <br>
                Best regards,<br>
                &nbsp;Vitalii Tymchyshyn<br>
              </div>
            </div>
          </div>
        </blockquote>
      </div>
      <br>
    </blockquote>
    <br>
  </body>
</html>

--------------000200090907050509020400--