cassandra-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <>
Subject [Cassandra Wiki] Update of "VirtualNodes/Balance" by JonathanEllis
Date Thu, 02 Aug 2012 18:29:28 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Cassandra Wiki" for change notification.

The "VirtualNodes/Balance" page has been changed by JonathanEllis:

   * Shuffling will entail moving a ''lot'' of data around the cluster and so has the potential
to consume a lot of disk and network I/O, and to take a considerable amount of time.  For
this to be an online operation, the shuffle will need to operate on a lower priority basis
to other streaming operations, and should be expected to take days or weeks to complete.
+  * Corollary: shuffling should tell the operator what vnodes it plans to move where, and
report progress whenever one completes successfully.  This will allow recovering from an interrupted
shuffle, if necessary.
+  * Shuffling can be sped up by parallelizing such that each node has one vnode moving to
or from it at a time.  With appropriate stream throttling this should be better than just
one vnode at a time cluster-wide.
  === Nodes / Cluster ===
  The most straightforward method of effecting ownership is a token move (i.e. relocating
a range from one node to another).  Exposing this with JMX would allow implementing all of
the required operations client-side.

View raw message