Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: neutral (nike.apache.org: local policy)
MIME-Version: 1.0
In-Reply-To: <1407238578145-7596119.post@n2.nabble.com>
References: 
 <CAAZU44nPw1H5MiSRJj=6sp_nnoJejmdT6oEri7KUaDNe5FYF=w@mail.gmail.com>
	<CAEDUwd02sXC4-Sj+Bx-MBpFSbMLgj4PeKgTBT+cA2wctaWmhKQ@mail.gmail.com>
	<CAAZU44=Ee1Nx+tFxYjRM18-mBaii09OY+iRWwnfgZ1Qi2ttWdQ@mail.gmail.com>
	<CAEDUwd3bJA20uVNcVyEJJPM90cFZB5uHTGCdjhm=1fYCcrh9kQ@mail.gmail.com>
	<CD66E7784F3542F6ACB79F060207258D@JackKrupansky14>
	<1407226178633-7596106.post@n2.nabble.com>
	<CAA-p0Hru+jzY8XAhZiAXs-W+_JRrys34i=_GXxWTFFVCVPOnOQ@mail.gmail.com>
	<1407238578145-7596119.post@n2.nabble.com>
Date: Tue, 5 Aug 2014 13:43:18 +0100
Message-ID: 
 <CAFNWgMj6TSSzE4XNmCb_HYHE5YrqJG+cbPpto_5e0Azy_AVGqg@mail.gmail.com>
Subject: Re: Reasonable range for the max number of tables?
From: Michal Michalski <michal.michalski@boxever.com>
To: user@cassandra.apache.org
Content-Type: multipart/alternative; boundary=047d7bdc174233c03d04ffe13398

--047d7bdc174233c03d04ffe13398
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

>> - Use a keyspace per customer
> These effectively amount to the same thing and they both fall foul to the
> limit in the number of column families so do not scale.

But then you can scale by moving some of the customers to a new cluster
easily. If you keep everything in a single keyspace or - worse - if you do
your multitenancy by prefixing row keys with customer ids of some kind, it
won't be that easy, as you wrote later in your e-mail.

M.


Kind regards,
Micha=C5=82 Michalski,
michal.michalski@boxever.com


On 5 August 2014 12:36, Phil Luckhurst <phil.luckhurst@powerassure.com>
wrote:

> Hi Mark,
>
> Mark Reddy wrote
> > To segregate customer data, you could:
> > - Use customer specific column families under a single keyspace
> > - Use a keyspace per customer
>
> These effectively amount to the same thing and they both fall foul to the
> limit in the number of column families so do not scale.
>
>
> Mark Reddy wrote
> > - Use the same column families and have a column that identifies the
> > customer. On the application layer ensure that there are sufficient
> checks
> > so one customer can't read another customers data
>
> And while this gets around the column family limit it does not allow the
> same level of data segregation. For example with a separate keyspace or
> column families it is trivial to remove a single customer's data or move
> that data to another system. With one set of column families for all
> customers these types of actions become much more difficult as any change
> impacts all customers but perhaps that's the price we have to pay to scal=
e.
>
> And I still think this needs to be made more prominent in the
> documentation.
>
> Thanks
> Phil
>
>
>
> --
> View this message in context:
> http://cassandra-user-incubator-apache-org.3065146.n2.nabble.com/Reasonab=
le-range-for-the-max-number-of-tables-tp7596094p7596119.html
> Sent from the cassandra-user@incubator.apache.org mailing list archive at
> Nabble.com.
>

--047d7bdc174233c03d04ffe13398
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr"><div class=3D"im" style=3D"font-family:arial,sans-serif;fo=
nt-size:13px">&gt;&gt; - Use a keyspace per customer<br><span style=3D"colo=
r:rgb(34,34,34)">&gt; These effectively amount to the same thing and they b=
oth fall foul to the</span><br>
</div><span style=3D"font-family:arial,sans-serif;font-size:13px">&gt; limi=
t in the number of column families so do not scale.</span><br style=3D"font=
-family:arial,sans-serif;font-size:13px"><div><span style=3D"font-family:ar=
ial,sans-serif;font-size:13px"><br>
</span></div><div><span style=3D"font-family:arial,sans-serif;font-size:13p=
x">But then you can scale by moving some of the customers to a new cluster =
easily. If you keep everything in a single keyspace or - worse - if you do =
your multitenancy by prefixing row keys with customer ids of some kind, it =
won&#39;t be that easy, as you wrote later in your e-mail.</span></div>
<div><span style=3D"font-family:arial,sans-serif;font-size:13px"><br></span=
></div><div><span style=3D"font-family:arial,sans-serif;font-size:13px">M.<=
/span></div><div><span style=3D"font-family:arial,sans-serif;font-size:13px=
"><br>
</span></div><div><span style=3D"font-family:arial,sans-serif;font-size:13p=
x">=C2=A0</span></div></div><div class=3D"gmail_extra"><br clear=3D"all"><d=
iv><div dir=3D"ltr">Kind regards,<div>Micha=C5=82 Michalski,</div><div><a h=
ref=3D"mailto:michal.michalski@boxever.com" target=3D"_blank">michal.michal=
ski@boxever.com</a></div>
</div></div>
<br><br><div class=3D"gmail_quote">On 5 August 2014 12:36, Phil Luckhurst <=
span dir=3D"ltr">&lt;<a href=3D"mailto:phil.luckhurst@powerassure.com" targ=
et=3D"_blank">phil.luckhurst@powerassure.com</a>&gt;</span> wrote:<br><bloc=
kquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #cc=
c solid;padding-left:1ex">
Hi Mark,<br>
<br>
Mark Reddy wrote<br>
<div class=3D"">&gt; To segregate customer data, you could:<br>
&gt; - Use customer specific column families under a single keyspace<br>
&gt; - Use a keyspace per customer<br>
<br>
</div>These effectively amount to the same thing and they both fall foul to=
 the<br>
limit in the number of column families so do not scale.<br>
<br>
<br>
Mark Reddy wrote<br>
<div class=3D"">&gt; - Use the same column families and have a column that =
identifies the<br>
&gt; customer. On the application layer ensure that there are sufficient ch=
ecks<br>
&gt; so one customer can&#39;t read another customers data<br>
<br>
</div>And while this gets around the column family limit it does not allow =
the<br>
same level of data segregation. For example with a separate keyspace or<br>
column families it is trivial to remove a single customer&#39;s data or mov=
e<br>
that data to another system. With one set of column families for all<br>
customers these types of actions become much more difficult as any change<b=
r>
impacts all customers but perhaps that&#39;s the price we have to pay to sc=
ale.<br>
<br>
And I still think this needs to be made more prominent in the documentation=
.<br>
<br>
Thanks<br>
Phil<br>
<br>
<br>
<br>
--<br>
View this message in context: <a href=3D"http://cassandra-user-incubator-ap=
ache-org.3065146.n2.nabble.com/Reasonable-range-for-the-max-number-of-table=
s-tp7596094p7596119.html" target=3D"_blank">http://cassandra-user-incubator=
-apache-org.3065146.n2.nabble.com/Reasonable-range-for-the-max-number-of-ta=
bles-tp7596094p7596119.html</a><br>

<div class=3D"HOEnZb"><div class=3D"h5">Sent from the <a href=3D"mailto:cas=
sandra-user@incubator.apache.org">cassandra-user@incubator.apache.org</a> m=
ailing list archive at Nabble.com.<br>
</div></div></blockquote></div><br></div>

--047d7bdc174233c03d04ffe13398--