Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (nike.apache.org: domain of colpclark@gmail.com designates
 209.85.216.180 as permitted sender)
Subject: Re: Possible to Add multiple columns in one query ?
References: <00d601cf781e$4e958ed0$ebc0ac70$@petrolink.com>
 <2DA633FC78F54A9B969EE87EDB593C14@JackKrupansky14>
From: Colin <colpclark@gmail.com>
Content-Type: multipart/alternative;
	boundary=Apple-Mail-8F2945C0-E3E5-434F-A923-6F0D7C2AEA14
In-Reply-To: <2DA633FC78F54A9B969EE87EDB593C14@JackKrupansky14>
Message-Id: <A177DB39-8FA1-492E-A7CB-556864B19B46@gmail.com>
Date: Sun, 25 May 2014 14:01:43 -0500
To: "user@cassandra.apache.org" <user@cassandra.apache.org>
Content-Transfer-Encoding: 7bit
Mime-Version: 1.0 (1.0)


--Apple-Mail-8F2945C0-E3E5-434F-A923-6F0D7C2AEA14
Content-Type: text/plain;
	charset=utf-8
Content-Transfer-Encoding: quoted-printable

Try asynch updates, and collect the futures at 1,000 and play around from th=
ere. =20

Also, in the real world, you'd want to use load balancing and token aware po=
licies when connecting to the cluster.  This will actually bypass the coordi=
nator and write directly to the correct nodes.

I will post a link to my github with an example when I get off the road

--
Colin
320-221-9531


> On May 25, 2014, at 1:56 PM, "Jack Krupansky" <jack@basetechnology.com> wr=
ote:
>=20
> Typo: I presume =E2=80=9Cchannelid=E2=80=9D should be =E2=80=9Ctagid=E2=80=
=9D for the partition key for your table.
> =20
> Yes, BATCH statements are the way to go, but be careful not to make your b=
atches too large, otherwise you could lose performance when Cassandra is rel=
atively idle while the batch is slowly streaming in to the coordinator node o=
ver the network. Better to break up a large batch into multiple moderate siz=
e batches (exact size and number will vary and need testing to deduce) that w=
ill transmit quicker and can be executed in parallel.
> =20
> I=E2=80=99m not sure Cassandra on a laptop would be the best measure of pe=
rformance for a real cluster, especially compared to a server with more CPU c=
ores than your laptop.
> =20
> And for a real cluster, rows with different partition keys can be sent to a=
 coordinator node that owns that partition key, which could be multiple node=
s for RF>1.
> =20
> -- Jack Krupansky
> =20
> From: Mark Farnan
> Sent: Sunday, May 25, 2014 9:36 AM
> To: user@cassandra.apache.org
> Subject: Possible to Add multiple columns in one query ?
> =20
> I=E2=80=99m sure this is a  CQL 101 question, but. =20
> =20
> Is it possible to add MULTIPLE   Rows/Columns  to a single Partition in a s=
ingle CQL 3  Query / Call.=20
> =20
> Need:
> I=E2=80=99m trying to find the most efficient way to add multiple time ser=
ies events to a table in a single call.
> Whilst most time series data comes in sequentially, we have a case where i=
t is often loaded in bulk,  say sent  100,000 points for 50  channels/tags  a=
t one go.  (sometimes more), and this needs to be loaded as quickly and effi=
ciently as possible.
> =20
> Fairly standard Time-Series schema (this is for testing purposes only at t=
his point, and doesn=E2=80=99t represent final schemas)
> =20
> CREATE TABLE tag (
>   tagid int,
>   idx timestamp,
>   value double,
>   PRIMARY KEY (channelid, idx)
> ) WITH CLUSTERING ORDER BY (idx DESC);
> =20
> =20
> Currently I=E2=80=99m using Batch statements, but even that is not fast en=
ough.
> =20
> Note: At this point I=E2=80=99m testing on a single node cluster on laptop=
, to compare different versions.
> =20
> We are using DataStax C# 2.0 (beta) client. And Cassandra 2.0.7
> =20
> Regards
> Mark.

--Apple-Mail-8F2945C0-E3E5-434F-A923-6F0D7C2AEA14
Content-Type: text/html;
	charset=utf-8
Content-Transfer-Encoding: quoted-printable

<html><head><meta http-equiv=3D"content-type" content=3D"text/html; charset=3D=
utf-8"></head><body dir=3D"auto"><div>Try asynch updates, and collect the fu=
tures at 1,000 and play around from there. &nbsp;</div><div><br></div><div>A=
lso, in the real world, you'd want to use load balancing and token aware pol=
icies when connecting to the cluster. &nbsp;This will actually bypass the co=
ordinator and write directly to the correct nodes.</div><div><br></div><div>=
I will post a link to my github with an example when I get off the road</div=
><div><br>--<div>Colin</div><div>320-221-9531</div><div><br></div></div><div=
><br>On May 25, 2014, at 1:56 PM, "Jack Krupansky" &lt;<a href=3D"mailto:jac=
k@basetechnology.com">jack@basetechnology.com</a>&gt; wrote:<br><br></div><b=
lockquote type=3D"cite"><div>
<meta content=3D"text/html; charset=3Dus-ascii" http-equiv=3D"Content-Type">=

<meta name=3D"Generator" content=3D"Microsoft Word 15 (filtered medium)">
<style><!--
/* Font Definitions */
@font-face
	{font-family:"Cambria Math";
	panose-1:2 4 5 3 5 4 6 3 2 4;}
@font-face
	{font-family:Calibri;
	panose-1:2 15 5 2 2 2 4 3 2 4;}
/* Style Definitions */
p.MsoNormal, li.MsoNormal, div.MsoNormal
	{margin:0in;
	margin-bottom:.0001pt;
	font-size:11.0pt;
	font-family:"Calibri","sans-serif";}
a:link, span.MsoHyperlink
	{mso-style-priority:99;
	color:#0563C1;
	text-decoration:underline;}
a:visited, span.MsoHyperlinkFollowed
	{mso-style-priority:99;
	color:#954F72;
	text-decoration:underline;}
span.EmailStyle17
	{mso-style-type:personal-compose;
	font-family:"Calibri","sans-serif";
	color:windowtext;}
.MsoChpDefault
	{mso-style-type:export-only;
	font-family:"Calibri","sans-serif";}
@page WordSection1
	{size:8.5in 11.0in;
	margin:1.0in 1.0in 1.0in 1.0in;}
div.WordSection1
	{page:WordSection1;}
--></style>


<div dir=3D"ltr">
<div style=3D"FONT-SIZE: 12pt; FONT-FAMILY: 'Calibri'; COLOR: #000000">
<div>Typo: I presume =E2=80=9Cchannelid=E2=80=9D should be =E2=80=9Ctagid=E2=
=80=9D for the partition key for=20
your table.</div>
<div>&nbsp;</div>
<div>Yes, BATCH statements are the way to go, but be careful not to make you=
r=20
batches too large, otherwise you could lose performance when Cassandra is=20=

relatively idle while the batch is slowly streaming in to the coordinator no=
de=20
over the network. Better to break up a large batch into multiple moderate si=
ze=20
batches (exact size and number will vary and need testing to deduce) that wi=
ll=20
transmit quicker and can be executed in parallel.</div>
<div>&nbsp;</div>
<div>I=E2=80=99m not sure Cassandra on a laptop would be the best measure of=
 performance=20
for a real cluster, especially compared to a server with more CPU cores than=
=20
your laptop.</div>
<div>&nbsp;</div>
<div>And for a real cluster, rows with different partition keys can be sent t=
o a=20
coordinator node that owns that partition key, which could be multiple nodes=
 for=20
RF&gt;1.</div>
<div>&nbsp;</div>
<div style=3D"FONT-SIZE: 12pt; FONT-FAMILY: 'Calibri'; COLOR: #000000">-- Ja=
ck=20
Krupansky</div>
<div style=3D"FONT-SIZE: small; TEXT-DECORATION: none; FONT-FAMILY: &quot;Ca=
libri&quot;; FONT-WEIGHT: normal; COLOR: #000000; FONT-STYLE: normal; DISPLA=
Y: inline">
<div style=3D"FONT: 10pt tahoma">
<div>&nbsp;</div>
<div style=3D"BACKGROUND: #f5f5f5">
<div style=3D"font-color: black"><b>From:</b> <a title=3D"devmail@petrolink.=
com" href=3D"mailto:devmail@petrolink.com">Mark Farnan</a> </div>
<div><b>Sent:</b> Sunday, May 25, 2014 9:36 AM</div>
<div><b>To:</b> <a title=3D"user@cassandra.apache.org" href=3D"mailto:user@c=
assandra.apache.org">user@cassandra.apache.org</a> </div>
<div><b>Subject:</b> Possible to Add multiple columns in one query=20
?</div></div></div>
<div>&nbsp;</div></div>
<div style=3D"FONT-SIZE: small; TEXT-DECORATION: none; FONT-FAMILY: &quot;Ca=
libri&quot;; FONT-WEIGHT: normal; COLOR: #000000; FONT-STYLE: normal; DISPLA=
Y: inline">
<div class=3D"WordSection1">
<p class=3D"MsoNormal">I=E2=80=99m sure this is a&nbsp; CQL 101 question, bu=
t.=20
<o:p></o:p></p>
<p class=3D"MsoNormal"><o:p></o:p>&nbsp;</p>
<p class=3D"MsoNormal">Is it possible to add MULTIPLE&nbsp;&nbsp; Rows/Colum=
ns&nbsp;=20
to a single Partition in a single CQL 3&nbsp; Query / Call.&nbsp;=20
<o:p></o:p></p>
<p class=3D"MsoNormal"><o:p></o:p>&nbsp;</p>
<p class=3D"MsoNormal">Need: <o:p></o:p></p>
<p class=3D"MsoNormal" style=3D"TEXT-INDENT: 0.5in">I=E2=80=99m trying to fi=
nd the most=20
efficient way to add multiple time series events to a table in a single call=
.=20
<o:p></o:p></p>
<p class=3D"MsoNormal" style=3D"TEXT-INDENT: 0.5in">Whilst most time series d=
ata comes=20
in sequentially, we have a case where it is often loaded in bulk,&nbsp; say=20=

sent&nbsp; 100,000 points for 50&nbsp; channels/tags&nbsp; at one go.&nbsp;=20=

(sometimes more), and this needs to be loaded as quickly and efficiently as=20=

possible. <o:p></o:p></p>
<p class=3D"MsoNormal"><o:p></o:p>&nbsp;</p>
<p class=3D"MsoNormal">Fairly standard Time-Series schema (this is for testi=
ng=20
purposes only at this point, and doesn=E2=80=99t represent final schemas)=20=

<o:p></o:p></p>
<p class=3D"MsoNormal"><o:p></o:p>&nbsp;</p>
<p class=3D"MsoNormal" style=3D"MARGIN-LEFT: 0.5in; TEXT-AUTOSPACE: "><span s=
tyle=3D"FONT-SIZE: 10pt; FONT-FAMILY: &quot;Courier New&quot;; COLOR: #a5790=
0">CREATE</span><span style=3D"FONT-SIZE: 10pt; FONT-FAMILY: &quot;Courier N=
ew&quot;; COLOR: black"> </span><span style=3D"FONT-SIZE: 10pt; FONT-FAMILY:=
 &quot;Courier New&quot;; COLOR: #a57900">TABLE</span><b><span style=3D"FONT=
-SIZE: 10pt; FONT-FAMILY: &quot;Courier New&quot;; COLOR: #268bd2">=20
<u>tag</u></span></b><span style=3D"FONT-SIZE: 10pt; FONT-FAMILY: &quot;Cour=
ier New&quot;; COLOR: black"> (</span><span style=3D"FONT-SIZE: 10pt; FONT-FA=
MILY: &quot;Courier New&quot;"><o:p></o:p></span></p>
<p class=3D"MsoNormal" style=3D"MARGIN-LEFT: 0.5in; TEXT-AUTOSPACE: "><b><sp=
an style=3D"FONT-SIZE: 10pt; FONT-FAMILY: &quot;Courier New&quot;; COLOR: #6=
c71c4">&nbsp;=20
tagid</span></b><span style=3D"FONT-SIZE: 10pt; FONT-FAMILY: &quot;Courier N=
ew&quot;; COLOR: black"> </span><span style=3D"FONT-SIZE: 10pt; FONT-FAMILY:=
 &quot;Courier New&quot;; COLOR: #a57900">int</span><span style=3D"FONT-SIZE=
: 10pt; FONT-FAMILY: &quot;Courier New&quot;; COLOR: black">,</span><span st=
yle=3D"FONT-SIZE: 10pt; FONT-FAMILY: &quot;Courier New&quot;"><o:p></o:p></s=
pan></p>
<p class=3D"MsoNormal" style=3D"MARGIN-LEFT: 0.5in; TEXT-AUTOSPACE: "><b><sp=
an style=3D"FONT-SIZE: 10pt; FONT-FAMILY: &quot;Courier New&quot;; COLOR: #6=
c71c4">&nbsp;=20
idx</span></b><span style=3D"FONT-SIZE: 10pt; FONT-FAMILY: &quot;Courier New=
&quot;; COLOR: black"> </span><span style=3D"FONT-SIZE: 10pt; FONT-FAMILY: &=
quot;Courier New&quot;; COLOR: #a57900">timestamp</span><span style=3D"FONT-=
SIZE: 10pt; FONT-FAMILY: &quot;Courier New&quot;; COLOR: black">,</span><spa=
n style=3D"FONT-SIZE: 10pt; FONT-FAMILY: &quot;Courier New&quot;"><o:p></o:p=
></span></p>
<p class=3D"MsoNormal" style=3D"MARGIN-LEFT: 0.5in; TEXT-AUTOSPACE: "><b><sp=
an style=3D"FONT-SIZE: 10pt; FONT-FAMILY: &quot;Courier New&quot;; COLOR: #6=
c71c4">&nbsp;=20
value</span></b><span style=3D"FONT-SIZE: 10pt; FONT-FAMILY: &quot;Courier N=
ew&quot;; COLOR: black"> </span><span style=3D"FONT-SIZE: 10pt; FONT-FAMILY:=
 &quot;Courier New&quot;; COLOR: #a57900">double</span><span style=3D"FONT-S=
IZE: 10pt; FONT-FAMILY: &quot;Courier New&quot;; COLOR: black">,</span><span=
 style=3D"FONT-SIZE: 10pt; FONT-FAMILY: &quot;Courier New&quot;"><o:p></o:p>=
</span></p>
<p class=3D"MsoNormal" style=3D"MARGIN-LEFT: 0.5in; TEXT-AUTOSPACE: "><span s=
tyle=3D"FONT-SIZE: 10pt; FONT-FAMILY: &quot;Courier New&quot;; COLOR: black"=
>&nbsp;=20
</span><span style=3D"FONT-SIZE: 10pt; FONT-FAMILY: &quot;Courier New&quot;;=
 COLOR: #a57900">PRIMARY</span><span style=3D"FONT-SIZE: 10pt; FONT-FAMILY: &=
quot;Courier New&quot;; COLOR: black"> </span><span style=3D"FONT-SIZE: 10pt=
; FONT-FAMILY: &quot;Courier New&quot;; COLOR: #a57900">KEY</span><span styl=
e=3D"FONT-SIZE: 10pt; FONT-FAMILY: &quot;Courier New&quot;; COLOR: black">=20=

(</span><b><span style=3D"FONT-SIZE: 10pt; FONT-FAMILY: &quot;Courier New&qu=
ot;; COLOR: #6c71c4">channelid</span></b><span style=3D"FONT-SIZE: 10pt; FON=
T-FAMILY: &quot;Courier New&quot;; COLOR: black">,</span><b><span style=3D"FO=
NT-SIZE: 10pt; FONT-FAMILY: &quot;Courier New&quot;; COLOR: #6c71c4">=20
idx</span></b><span style=3D"FONT-SIZE: 10pt; FONT-FAMILY: &quot;Courier New=
&quot;; COLOR: black">)</span><span style=3D"FONT-SIZE: 10pt; FONT-FAMILY: &=
quot;Courier New&quot;"><o:p></o:p></span></p>
<p class=3D"MsoNormal" style=3D"MARGIN-LEFT: 0.5in"><span style=3D"FONT-SIZE=
: 10pt; FONT-FAMILY: &quot;Courier New&quot;; COLOR: black">) </span><span s=
tyle=3D"FONT-SIZE: 10pt; FONT-FAMILY: &quot;Courier New&quot;; COLOR: #a5790=
0">WITH</span><span style=3D"FONT-SIZE: 10pt; FONT-FAMILY: &quot;Courier New=
&quot;; COLOR: black"> </span><span style=3D"FONT-SIZE: 10pt; FONT-FAMILY: &=
quot;Courier New&quot;; COLOR: #a57900">CLUSTERING</span><span style=3D"FONT=
-SIZE: 10pt; FONT-FAMILY: &quot;Courier New&quot;; COLOR: black"> </span><sp=
an style=3D"FONT-SIZE: 10pt; FONT-FAMILY: &quot;Courier New&quot;; COLOR: #a=
57900">ORDER</span><span style=3D"FONT-SIZE: 10pt; FONT-FAMILY: &quot;Courie=
r New&quot;; COLOR: black"> </span><span style=3D"FONT-SIZE: 10pt; FONT-FAMI=
LY: &quot;Courier New&quot;; COLOR: #a57900">BY</span><span style=3D"FONT-SI=
ZE: 10pt; FONT-FAMILY: &quot;Courier New&quot;; COLOR: black">=20
(</span><b><span style=3D"FONT-SIZE: 10pt; FONT-FAMILY: &quot;Courier New&qu=
ot;; COLOR: #6c71c4">idx</span></b><span style=3D"FONT-SIZE: 10pt; FONT-FAMI=
LY: &quot;Courier New&quot;; COLOR: black"> </span><span style=3D"FONT-SIZE:=
 10pt; FONT-FAMILY: &quot;Courier New&quot;; COLOR: #a57900">DESC</span><spa=
n style=3D"FONT-SIZE: 10pt; FONT-FAMILY: &quot;Courier New&quot;; COLOR: bla=
ck">);<o:p></o:p></span></p>
<p class=3D"MsoNormal"><o:p></o:p>&nbsp;</p>
<p class=3D"MsoNormal"><o:p></o:p>&nbsp;</p>
<p class=3D"MsoNormal">Currently I=E2=80=99m using Batch statements, but eve=
n that is not=20
fast enough. <o:p></o:p></p>
<p class=3D"MsoNormal"><o:p></o:p>&nbsp;</p>
<p class=3D"MsoNormal">Note: At this point I=E2=80=99m testing on a single n=
ode cluster on=20
laptop, to compare different versions.<o:p></o:p></p>
<p class=3D"MsoNormal"><o:p></o:p>&nbsp;</p>
<p class=3D"MsoNormal">We are using DataStax C# 2.0 (beta) client. And Cassa=
ndra=20
2.0.7<o:p></o:p></p>
<p class=3D"MsoNormal"><o:p></o:p>&nbsp;</p>
<p class=3D"MsoNormal">Regards<o:p></o:p></p>
<p class=3D"MsoNormal">Mark. <o:p></o:p></p></div></div></div></div>
</div></blockquote></body></html>=

--Apple-Mail-8F2945C0-E3E5-434F-A923-6F0D7C2AEA14--