cassandra-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From xavier manach <...@tekio.org>
Subject Granularity SSTables.
Date Thu, 10 Jun 2010 10:54:45 GMT
Hi.

  I try to understand tricks that I can use with the SSTables, for
faster manipulation of datas in clusters.

I learn I how copy a keyspaces from data directories to a new node and
change replicationfactor (thx Jonathan).

If I understood, Each SSTable have 3 files :
  ColumnFamily-ID-Datas.db
  ColumnFamily-ID-Index.db
  ColumnFamily-ID-Filter.db

  If I want merge datas from 2 clusters, with differents keys (each
key is only in one cluster) but with the same ColumnFamily.
Can I copy all the files from SSTables with the same methode ?
> 1. nodetool drain & stop original node
> 2. copy everything  ***files sstables*** in data/ directories (but not system keyspace!)
to new node
> 3. restart and autobootstrap=false [the default]

Thx.



On Tue, Jun 8, 2010 at 7:12 AM, xavier manach <xav@tekio.org> wrote:
> Hi.
>
>   I have a cluster with only 1 node with a lot of datas (500 Go) .
>   I want add a new node with the same datas (with a ReplicationFactor
> 2)
>
> The method normal is :
> stop node.
> add a node.
> change replication factor to 2.
> start nodes
> use nodetool repair
>
>   But , I didn't know if this other method is valid, and if it's can
> be faster :
> stop nodes.
> copy all SSTables
> change replication factor.
> start nodes
> and
> use nodetool repair
>
>   Have you an idea for the faster valid method ?
>
> Thx.
>

Mime
View raw message