Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
MIME-Version: 1.0
In-Reply-To: <CADQ6LYmR97EGadrBZrTMADWNzK4OTZH+bmsTFDZuVa8qc=9prw@mail.gmail.com>
References: <CAA0zji4K8=8iLHmTzpvwRy0CAZVuTNGZyxGNRy3sv4NJNk=TOw@mail.gmail.com>
 <CABNXB2Asv4aocR1=sszaCft+GXU9dfR6EZqvtc7aBJofLhxkNw@mail.gmail.com>
 <CAA0zji46tTme6ecdFr2iwcz-7OniLPXeC2q9rGcsdKOsR_vF0g@mail.gmail.com>
 <CACUnPaDKjVRdJTiavg4rXoB_-oLbGfvCkw88nvL-oMF_bWVhVA@mail.gmail.com>
 <CAPph6F+fs1Q+daSW-38ZUco1NGM+AcFtA3EVXj+W6kkh6g8-0g@mail.gmail.com>
 <CACUnPaCb7oQ6xwcjhH4AEbPN9YbbbeQrNU4McbqbG5if=+aMJg@mail.gmail.com>
 <CAA0zji6zVd5RFC76WUCkiKnWZ73iE2N827NmOGxSG+_8NF-Zyw@mail.gmail.com> <CADQ6LYmR97EGadrBZrTMADWNzK4OTZH+bmsTFDZuVa8qc=9prw@mail.gmail.com>
From: Edward Capriolo <edlinuxguru@gmail.com>
Date: Mon, 3 Oct 2016 16:38:24 -0400
Message-ID: <CAENxBwxhr5-Em7-mKAUrf74xq6yavJsqJM-CSJcJU59Cdzk9Yg@mail.gmail.com>
Subject: Re: An extremely fast cassandra table full scan utility
To: "user@cassandra.apache.org" <user@cassandra.apache.org>
Content-Type: multipart/alternative; boundary=001a11401bb8f874c2053dfbec68
archived-at: Mon, 03 Oct 2016 20:38:32 -0000

--001a11401bb8f874c2053dfbec68
Content-Type: text/plain; charset=UTF-8

I undertook a similar effort a while ago.

https://issues.apache.org/jira/browse/CASSANDRA-7014

Other than the fact that it was closed with no comments, I can tell you
that other efforts I had to embed things in Cassandra did not go
swimmingly. Although at the time ideas were rejected like groovy udfs

On Mon, Oct 3, 2016 at 4:22 PM, Bhuvan Rawal <bhu1rawal@gmail.com> wrote:

> Hi Jonathan,
>
> If full scan is a regular requirement then setting up a spark cluster in
> locality with Cassandra nodes makes perfect sense. But supposing that it is
> a one off requirement, say a weekly or a fortnightly task, a spark cluster
> could be an added overhead with additional capacity, resource planning as
> far as operations / maintenance is concerned.
>
> So this could be thought a simple substitute for a single threaded scan
> without additional efforts to setup and maintain another technology.
>
> Regards,
> Bhuvan
>
> On Tue, Oct 4, 2016 at 1:37 AM, siddharth verma <
> sidd.verma29.list@gmail.com> wrote:
>
>> Hi Jon,
>> It wan't allowed.
>> Moreover, if someone who isn't familiar with spark, and might be new to
>> map filter reduce etc. operations, could also use the utility for some
>> simple operations assuming a sequential scan of the cassandra table.
>>
>> Regards
>> Siddharth Verma
>>
>> On Tue, Oct 4, 2016 at 1:32 AM, Jonathan Haddad <jon@jonhaddad.com>
>> wrote:
>>
>>> Couldn't set up as couldn't get it working, or its not allowed?
>>>
>>> On Mon, Oct 3, 2016 at 3:23 PM Siddharth Verma <
>>> verma.siddharth@snapdeal.com> wrote:
>>>
>>>> Hi Jon,
>>>> We couldn't setup a spark cluster.
>>>>
>>>> For some use case, a spark cluster was required, but for some reason we
>>>> couldn't create spark cluster. Hence, one may use this utility to iterate
>>>> through the entire table at very high speed.
>>>>
>>>> Had to find a work around, that would be faster than paging on result
>>>> set.
>>>>
>>>> Regards
>>>>
>>>> Siddharth Verma
>>>> *Software Engineer I - CaMS*
>>>> *M*: +91 9013689856, *T*: 011 22791596 *EXT*: 14697
>>>> CA2125, 2nd Floor, ASF Centre-A, Jwala Mill Road,
>>>> Udyog Vihar Phase - IV, Gurgaon-122016, INDIA
>>>> Download Our App
>>>> [image: A]
>>>> <https://play.google.com/store/apps/details?id=com.snapdeal.main&utm_source=mobileAppLp&utm_campaign=android> [image:
>>>> A]
>>>> <https://itunes.apple.com/in/app/snapdeal-mobile-shopping/id721124909?ls=1&mt=8&utm_source=mobileAppLp&utm_campaign=ios> [image:
>>>> W]
>>>> <http://www.windowsphone.com/en-in/store/app/snapdeal/ee17fccf-40d0-4a59-80a3-04da47a5553f>
>>>>
>>>> On Tue, Oct 4, 2016 at 12:41 AM, Jonathan Haddad <jon@jonhaddad.com>
>>>> wrote:
>>>>
>>>> It almost sounds like you're duplicating all the work of both spark and
>>>> the connector. May I ask why you decided to not use the existing tools?
>>>>
>>>> On Mon, Oct 3, 2016 at 2:21 PM siddharth verma <
>>>> sidd.verma29.list@gmail.com> wrote:
>>>>
>>>> Hi DuyHai,
>>>> Thanks for your reply.
>>>> A few more features planned in the next one(if there is one) like,
>>>> custom policy keeping in mind the replication of token range on
>>>> specific nodes,
>>>> fine graining the token range(for more speedup),
>>>> and a few more.
>>>>
>>>> I think, as fine graining a token range,
>>>> If one token range is split further in say, 2-3 parts, divided among
>>>> threads, this would exploit the possible parallelism on a large scaled out
>>>> cluster.
>>>>
>>>> And, as you mentioned the JIRA, streaming of request, that would of
>>>> huge help with further splitting the range.
>>>>
>>>> Thanks once again for your valuable comments. :-)
>>>>
>>>> Regards,
>>>> Siddharth Verma
>>>>
>>>>
>>>>
>>
>

--001a11401bb8f874c2053dfbec68
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">I undertook a similar effort a while ago.=C2=A0<br><br><a =
href=3D"https://issues.apache.org/jira/browse/CASSANDRA-7014">https://issue=
s.apache.org/jira/browse/CASSANDRA-7014</a><br><br>Other than the fact that=
 it was closed with no comments, I can tell you that other efforts I had to=
 embed things in Cassandra did not go swimmingly. Although at the time idea=
s were rejected like groovy udfs=C2=A0</div><div class=3D"gmail_extra"><br>=
<div class=3D"gmail_quote">On Mon, Oct 3, 2016 at 4:22 PM, Bhuvan Rawal <sp=
an dir=3D"ltr">&lt;<a href=3D"mailto:bhu1rawal@gmail.com" target=3D"_blank"=
>bhu1rawal@gmail.com</a>&gt;</span> wrote:<br><blockquote class=3D"gmail_qu=
ote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex=
"><div dir=3D"ltr"><div>Hi Jonathan,</div><div><br></div>If full scan is a =
regular requirement then setting up a spark cluster in locality with Cassan=
dra nodes makes perfect sense. But supposing that it is a one off requireme=
nt, say a weekly or a fortnightly task, a spark cluster could be an added o=
verhead with additional capacity, resource planning as far as operations / =
maintenance is concerned.=C2=A0<div><br></div><div>So this could be thought=
 a simple substitute for a single threaded scan without additional efforts =
to setup and maintain another technology.</div><div><br></div><div>Regards,=
</div><div>Bhuvan</div></div><div class=3D"gmail_extra"><br><div class=3D"g=
mail_quote">On Tue, Oct 4, 2016 at 1:37 AM, siddharth verma <span dir=3D"lt=
r">&lt;<a href=3D"mailto:sidd.verma29.list@gmail.com" target=3D"_blank">sid=
d.verma29.list@gmail.com</a>&gt;</span> wrote:<br><blockquote class=3D"gmai=
l_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left=
:1ex"><div dir=3D"ltr">Hi Jon,=C2=A0<div>It wan&#39;t allowed.<div>Moreover=
, if someone who isn&#39;t familiar with spark, and might be new to map fil=
ter reduce etc. operations, could also use the utility for some simple oper=
ations assuming a sequential scan of the cassandra table.</div></div><div><=
br></div><div>Regards</div><span class=3D"m_-1909826017677187474HOEnZb"><fo=
nt color=3D"#888888"><div>Siddharth Verma</div></font></span></div><div cla=
ss=3D"m_-1909826017677187474HOEnZb"><div class=3D"m_-1909826017677187474h5"=
><div class=3D"gmail_extra"><br><div class=3D"gmail_quote">On Tue, Oct 4, 2=
016 at 1:32 AM, Jonathan Haddad <span dir=3D"ltr">&lt;<a href=3D"mailto:jon=
@jonhaddad.com" target=3D"_blank">jon@jonhaddad.com</a>&gt;</span> wrote:<b=
r><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:=
1px #ccc solid;padding-left:1ex">Couldn&#39;t set up as couldn&#39;t get it=
 working, or its not allowed?<div class=3D"m_-1909826017677187474m_28786372=
74358824050HOEnZb"><div class=3D"m_-1909826017677187474m_287863727435882405=
0h5"><br><div class=3D"gmail_quote"><div dir=3D"ltr">On Mon, Oct 3, 2016 at=
 3:23 PM Siddharth Verma &lt;<a href=3D"mailto:verma.siddharth@snapdeal.com=
" target=3D"_blank">verma.siddharth@snapdeal.com</a>&gt; wrote:<br></div><b=
lockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px =
#ccc solid;padding-left:1ex"><div dir=3D"ltr" class=3D"m_-19098260176771874=
74m_2878637274358824050m_6918240071323427633gmail_msg"><div class=3D"m_-190=
9826017677187474m_2878637274358824050m_6918240071323427633gmail_msg"><div c=
lass=3D"m_-1909826017677187474m_2878637274358824050m_6918240071323427633gma=
il_msg">Hi Jon,<br class=3D"m_-1909826017677187474m_2878637274358824050m_69=
18240071323427633gmail_msg"></div>We couldn&#39;t setup a spark cluster.</d=
iv></div><div dir=3D"ltr" class=3D"m_-1909826017677187474m_2878637274358824=
050m_6918240071323427633gmail_msg"><div class=3D"m_-1909826017677187474m_28=
78637274358824050m_6918240071323427633gmail_msg"><br class=3D"m_-1909826017=
677187474m_2878637274358824050m_6918240071323427633gmail_msg"><div style=3D=
"margin-left:40px" class=3D"m_-1909826017677187474m_2878637274358824050m_69=
18240071323427633gmail_msg">For some use case, a spark cluster was required=
, but for some=20
reason we couldn&#39;t create spark cluster. Hence, one may use this utilit=
y
 to iterate through the entire table at very high speed.<br class=3D"m_-190=
9826017677187474m_2878637274358824050m_6918240071323427633gmail_msg"><br cl=
ass=3D"m_-1909826017677187474m_2878637274358824050m_6918240071323427633gmai=
l_msg"></div></div></div><div dir=3D"ltr" class=3D"m_-1909826017677187474m_=
2878637274358824050m_6918240071323427633gmail_msg"><div class=3D"m_-1909826=
017677187474m_2878637274358824050m_6918240071323427633gmail_msg">Had to fin=
d a work around, that would be faster than paging on result set.<br class=
=3D"m_-1909826017677187474m_2878637274358824050m_6918240071323427633gmail_m=
sg"><br class=3D"m_-1909826017677187474m_2878637274358824050m_6918240071323=
427633gmail_msg"></div>Regards<br class=3D"m_-1909826017677187474m_28786372=
74358824050m_6918240071323427633gmail_msg"></div><div class=3D"gmail_extra =
m_-1909826017677187474m_2878637274358824050m_6918240071323427633gmail_msg">=
<br clear=3D"all" class=3D"m_-1909826017677187474m_2878637274358824050m_691=
8240071323427633gmail_msg"><div class=3D"m_-1909826017677187474m_2878637274=
358824050m_6918240071323427633gmail_msg"><div class=3D"m_-19098260176771874=
74m_2878637274358824050m_6918240071323427633m_-2923170385263507973gmail_sig=
nature m_-1909826017677187474m_2878637274358824050m_6918240071323427633gmai=
l_msg" data-smartmail=3D"gmail_signature"><div dir=3D"ltr" class=3D"m_-1909=
826017677187474m_2878637274358824050m_6918240071323427633gmail_msg"><div cl=
ass=3D"m_-1909826017677187474m_2878637274358824050m_6918240071323427633gmai=
l_msg"><div dir=3D"ltr" class=3D"m_-1909826017677187474m_287863727435882405=
0m_6918240071323427633gmail_msg"><div class=3D"m_-1909826017677187474m_2878=
637274358824050m_6918240071323427633gmail_msg"><div dir=3D"ltr" class=3D"m_=
-1909826017677187474m_2878637274358824050m_6918240071323427633gmail_msg"><t=
able width=3D"602" border=3D"0" cellspacing=3D"0" cellpadding=3D"0" style=
=3D"font-family:&#39;Times New Roman&#39;" class=3D"m_-1909826017677187474m=
_2878637274358824050m_6918240071323427633gmail_msg"><tbody class=3D"m_-1909=
826017677187474m_2878637274358824050m_6918240071323427633gmail_msg"><tr cla=
ss=3D"m_-1909826017677187474m_2878637274358824050m_6918240071323427633gmail=
_msg"><td colspan=3D"3" height=3D"1" class=3D"m_-1909826017677187474m_28786=
37274358824050m_6918240071323427633gmail_msg"><img src=3D"http://i.sdlcdn.c=
om/static/img/marketing-mailers/mailer/2014/signature_16may/images/img1.png=
" width=3D"602" height=3D"5" hspace=3D"0" vspace=3D"0" align=3D"right" clas=
s=3D"m_-1909826017677187474m_2878637274358824050m_6918240071323427633gmail_=
msg"></td></tr><tr class=3D"m_-1909826017677187474m_2878637274358824050m_69=
18240071323427633gmail_msg"><td width=3D"5" height=3D"36" class=3D"m_-19098=
26017677187474m_2878637274358824050m_6918240071323427633gmail_msg"></td><td=
 width=3D"445" class=3D"m_-1909826017677187474m_2878637274358824050m_691824=
0071323427633gmail_msg"><div style=3D"font-weight:bold;font-stretch:normal;=
font-size:15px;line-height:16px;font-family:Arial,Helvetica,sans-serif" cla=
ss=3D"m_-1909826017677187474m_2878637274358824050m_6918240071323427633gmail=
_msg"><span style=3D"color:rgb(178,0,0)" class=3D"m_-1909826017677187474m_2=
878637274358824050m_6918240071323427633gmail_msg">Siddharth Verma</span><br=
 class=3D"m_-1909826017677187474m_2878637274358824050m_6918240071323427633g=
mail_msg"><span style=3D"font-stretch:normal;font-size:11px;color:rgb(0,0,0=
)" class=3D"m_-1909826017677187474m_2878637274358824050m_691824007132342763=
3gmail_msg"><strong class=3D"m_-1909826017677187474m_2878637274358824050m_6=
918240071323427633gmail_msg">Software Engineer I - CaMS</strong></span></di=
v></td><td width=3D"179" rowspan=3D"3" valign=3D"bottom" class=3D"m_-190982=
6017677187474m_2878637274358824050m_6918240071323427633gmail_msg"><img src=
=3D"http://i.sdlcdn.com/img/marketing-mailers/mailer/2015/signature_24mar/i=
mages/dilkideal_logo.png" width=3D"179" height=3D"93" align=3D"right" class=
=3D"m_-1909826017677187474m_2878637274358824050m_6918240071323427633gmail_m=
sg"></td></tr><tr class=3D"m_-1909826017677187474m_2878637274358824050m_691=
8240071323427633gmail_msg"><td height=3D"46" class=3D"m_-190982601767718747=
4m_2878637274358824050m_6918240071323427633gmail_msg"></td><td class=3D"m_-=
1909826017677187474m_2878637274358824050m_6918240071323427633gmail_msg"><sp=
an style=3D"font-stretch:normal;font-size:11px;line-height:16px;font-family=
:Arial,Helvetica,sans-serif;color:rgb(139,139,139)" class=3D"m_-19098260176=
77187474m_2878637274358824050m_6918240071323427633gmail_msg"><strong style=
=3D"color:rgb(116,116,116)" class=3D"m_-1909826017677187474m_28786372743588=
24050m_6918240071323427633gmail_msg">M</strong>: <a href=3D"tel:%2B91%20901=
3689856" value=3D"+919013689856" target=3D"_blank">+91 9013689856</a>,=C2=
=A0<strong class=3D"m_-1909826017677187474m_2878637274358824050m_6918240071=
323427633gmail_msg">T</strong>: 011 22791596=C2=A0<strong class=3D"m_-19098=
26017677187474m_2878637274358824050m_6918240071323427633gmail_msg">EXT</str=
ong>: 14697<br class=3D"m_-1909826017677187474m_2878637274358824050m_691824=
0071323427633gmail_msg">CA2125, 2nd Floor, ASF Centre-A, Jwala Mill Road,=
=C2=A0<br class=3D"m_-1909826017677187474m_2878637274358824050m_69182400713=
23427633gmail_msg">Udyog Vihar Phase - IV, Gurgaon-122016, INDIA<br class=
=3D"m_-1909826017677187474m_2878637274358824050m_6918240071323427633gmail_m=
sg"></span></td></tr><tr class=3D"m_-1909826017677187474m_28786372743588240=
50m_6918240071323427633gmail_msg"><td class=3D"m_-1909826017677187474m_2878=
637274358824050m_6918240071323427633gmail_msg"></td><td valign=3D"bottom" h=
eight=3D"44" class=3D"m_-1909826017677187474m_2878637274358824050m_69182400=
71323427633gmail_msg"><table width=3D"100%" border=3D"0" cellspacing=3D"0" =
cellpadding=3D"0" class=3D"m_-1909826017677187474m_2878637274358824050m_691=
8240071323427633gmail_msg"><tbody class=3D"m_-1909826017677187474m_28786372=
74358824050m_6918240071323427633gmail_msg"><tr class=3D"m_-1909826017677187=
474m_2878637274358824050m_6918240071323427633gmail_msg"><td width=3D"300" c=
lass=3D"m_-1909826017677187474m_2878637274358824050m_6918240071323427633gma=
il_msg"><img src=3D"http://i3.sdlcdn.com/img/homepage/03/sirertPlaceWrk2.pn=
g" width=3D"288" height=3D"71" class=3D"m_-1909826017677187474m_28786372743=
58824050m_6918240071323427633gmail_msg"></td><td class=3D"m_-19098260176771=
87474m_2878637274358824050m_6918240071323427633gmail_msg"><table width=3D"1=
20" border=3D"0" cellspacing=3D"0" cellpadding=3D"0" class=3D"m_-1909826017=
677187474m_2878637274358824050m_6918240071323427633gmail_msg"><tbody class=
=3D"m_-1909826017677187474m_2878637274358824050m_6918240071323427633gmail_m=
sg"><tr class=3D"m_-1909826017677187474m_2878637274358824050m_6918240071323=
427633gmail_msg"><td align=3D"left" class=3D"m_-1909826017677187474m_287863=
7274358824050m_6918240071323427633gmail_msg"><font color=3D"#8b8b8b" style=
=3D"font-stretch:normal;font-size:11px;line-height:16px;font-family:Arial,H=
elvetica" class=3D"m_-1909826017677187474m_2878637274358824050m_69182400713=
23427633gmail_msg">Download Our App</font></td></tr><tr class=3D"m_-1909826=
017677187474m_2878637274358824050m_6918240071323427633gmail_msg"><td align=
=3D"left" class=3D"m_-1909826017677187474m_2878637274358824050m_69182400713=
23427633gmail_msg"><table width=3D"120" border=3D"0" cellspacing=3D"0" cell=
padding=3D"0" align=3D"left" class=3D"m_-1909826017677187474m_2878637274358=
824050m_6918240071323427633gmail_msg"><tbody class=3D"m_-190982601767718747=
4m_2878637274358824050m_6918240071323427633gmail_msg"><tr class=3D"m_-19098=
26017677187474m_2878637274358824050m_6918240071323427633gmail_msg"><td clas=
s=3D"m_-1909826017677187474m_2878637274358824050m_6918240071323427633gmail_=
msg"><a href=3D"https://play.google.com/store/apps/details?id=3Dcom.snapdea=
l.main&amp;utm_source=3DmobileAppLp&amp;utm_campaign=3Dandroid" class=3D"m_=
-1909826017677187474m_2878637274358824050m_6918240071323427633gmail_msg" ta=
rget=3D"_blank"><img src=3D"http://i.sdlcdn.com/img/marketing-mailers/maile=
r/2015/signature_24mar/images/android.png" alt=3D" A" width=3D"26" height=
=3D"27" hspace=3D"0" vspace=3D"0" border=3D"0" align=3D"left" style=3D"colo=
r:rgb(255,255,255);background:rgb(80,113,182)" class=3D"m_-1909826017677187=
474m_2878637274358824050m_6918240071323427633gmail_msg"></a></td><td width=
=3D"8" class=3D"m_-1909826017677187474m_2878637274358824050m_69182400713234=
27633gmail_msg"></td><td class=3D"m_-1909826017677187474m_28786372743588240=
50m_6918240071323427633gmail_msg"><a href=3D"https://itunes.apple.com/in/ap=
p/snapdeal-mobile-shopping/id721124909?ls=3D1&amp;mt=3D8&amp;utm_source=3Dm=
obileAppLp&amp;utm_campaign=3Dios" class=3D"m_-1909826017677187474m_2878637=
274358824050m_6918240071323427633gmail_msg" target=3D"_blank"><img src=3D"h=
ttp://i.sdlcdn.com/img/marketing-mailers/mailer/2015/signature_24mar/images=
/apple.png" alt=3D" A" width=3D"26" height=3D"27" hspace=3D"0" vspace=3D"0"=
 border=3D"0" align=3D"left" style=3D"color:rgb(255,255,255);background:rgb=
(80,113,182)" class=3D"m_-1909826017677187474m_2878637274358824050m_6918240=
071323427633gmail_msg"></a></td><td width=3D"8" class=3D"m_-190982601767718=
7474m_2878637274358824050m_6918240071323427633gmail_msg"></td><td class=3D"=
m_-1909826017677187474m_2878637274358824050m_6918240071323427633gmail_msg">=
<a href=3D"http://www.windowsphone.com/en-in/store/app/snapdeal/ee17fccf-40=
d0-4a59-80a3-04da47a5553f" class=3D"m_-1909826017677187474m_287863727435882=
4050m_6918240071323427633gmail_msg" target=3D"_blank"><img src=3D"http://i.=
sdlcdn.com/img/marketing-mailers/mailer/2015/signature_24mar/images/window.=
png" alt=3D" W" width=3D"26" height=3D"27" hspace=3D"0" vspace=3D"0" border=
=3D"0" align=3D"left" style=3D"color:rgb(255,255,255);background:rgb(80,113=
,182)" class=3D"m_-1909826017677187474m_2878637274358824050m_69182400713234=
27633gmail_msg"></a></td></tr></tbody></table></td></tr></tbody></table></t=
d></tr></tbody></table></td></tr><tr class=3D"m_-1909826017677187474m_28786=
37274358824050m_6918240071323427633gmail_msg"><td colspan=3D"3" height=3D"1=
" class=3D"m_-1909826017677187474m_2878637274358824050m_6918240071323427633=
gmail_msg"><img src=3D"http://i.sdlcdn.com/static/img/marketing-mailers/mai=
ler/2014/signature_16may/images/img1.png" width=3D"602" height=3D"5" hspace=
=3D"0" vspace=3D"0" align=3D"right" class=3D"m_-1909826017677187474m_287863=
7274358824050m_6918240071323427633gmail_msg"></td></tr></tbody></table></di=
v></div></div></div></div></div></div></div><div class=3D"gmail_extra m_-19=
09826017677187474m_2878637274358824050m_6918240071323427633gmail_msg">
<br class=3D"m_-1909826017677187474m_2878637274358824050m_69182400713234276=
33gmail_msg"><div class=3D"gmail_quote m_-1909826017677187474m_287863727435=
8824050m_6918240071323427633gmail_msg">On Tue, Oct 4, 2016 at 12:41 AM, Jon=
athan Haddad <span dir=3D"ltr" class=3D"m_-1909826017677187474m_28786372743=
58824050m_6918240071323427633gmail_msg">&lt;<a href=3D"mailto:jon@jonhaddad=
.com" class=3D"m_-1909826017677187474m_2878637274358824050m_691824007132342=
7633gmail_msg" target=3D"_blank">jon@jonhaddad.com</a>&gt;</span> wrote:<br=
 class=3D"m_-1909826017677187474m_2878637274358824050m_6918240071323427633g=
mail_msg"><blockquote class=3D"gmail_quote m_-1909826017677187474m_28786372=
74358824050m_6918240071323427633gmail_msg" style=3D"margin:0 0 0 .8ex;borde=
r-left:1px #ccc solid;padding-left:1ex"><div style=3D"white-space:pre-wrap"=
 class=3D"m_-1909826017677187474m_2878637274358824050m_6918240071323427633g=
mail_msg">It almost sounds like you&#39;re duplicating all the work of both=
 spark and the connector. May I ask why you decided to not use the existing=
 tools?</div><div class=3D"m_-1909826017677187474m_2878637274358824050m_691=
8240071323427633m_-2923170385263507973HOEnZb m_-1909826017677187474m_287863=
7274358824050m_6918240071323427633gmail_msg"><div class=3D"m_-1909826017677=
187474m_2878637274358824050m_6918240071323427633m_-2923170385263507973h5 m_=
-1909826017677187474m_2878637274358824050m_6918240071323427633gmail_msg"><b=
r class=3D"m_-1909826017677187474m_2878637274358824050m_6918240071323427633=
gmail_msg"><div class=3D"gmail_quote m_-1909826017677187474m_28786372743588=
24050m_6918240071323427633gmail_msg"><div dir=3D"ltr" class=3D"m_-190982601=
7677187474m_2878637274358824050m_6918240071323427633gmail_msg">On Mon, Oct =
3, 2016 at 2:21 PM siddharth verma &lt;<a href=3D"mailto:sidd.verma29.list@=
gmail.com" class=3D"m_-1909826017677187474m_2878637274358824050m_6918240071=
323427633gmail_msg" target=3D"_blank">sidd.verma29.list@gmail.com</a>&gt; w=
rote:<br class=3D"m_-1909826017677187474m_2878637274358824050m_691824007132=
3427633gmail_msg"></div><blockquote class=3D"gmail_quote m_-190982601767718=
7474m_2878637274358824050m_6918240071323427633gmail_msg" style=3D"margin:0 =
0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir=3D"ltr" clas=
s=3D"m_-1909826017677187474m_2878637274358824050m_6918240071323427633gmail_=
msg">Hi DuyHai,<div class=3D"m_-1909826017677187474m_2878637274358824050m_6=
918240071323427633gmail_msg">Thanks for your reply.</div><div class=3D"m_-1=
909826017677187474m_2878637274358824050m_6918240071323427633gmail_msg">A fe=
w more features planned in the next one(if there is one) like,</div><div cl=
ass=3D"m_-1909826017677187474m_2878637274358824050m_6918240071323427633gmai=
l_msg">custom policy keeping in mind the replication of token range on spec=
ific nodes,</div><div class=3D"m_-1909826017677187474m_2878637274358824050m=
_6918240071323427633gmail_msg">fine graining the token range(for more speed=
up),=C2=A0</div><div class=3D"m_-1909826017677187474m_2878637274358824050m_=
6918240071323427633gmail_msg">and a few more.</div><div class=3D"m_-1909826=
017677187474m_2878637274358824050m_6918240071323427633gmail_msg"><br class=
=3D"m_-1909826017677187474m_2878637274358824050m_6918240071323427633gmail_m=
sg"></div><div class=3D"m_-1909826017677187474m_2878637274358824050m_691824=
0071323427633gmail_msg">I think, as fine graining a token range,</div><div =
class=3D"m_-1909826017677187474m_2878637274358824050m_6918240071323427633gm=
ail_msg">If one token range is split further in say, 2-3 parts, divided amo=
ng threads, this would exploit the possible parallelism on a large scaled o=
ut cluster.</div><div class=3D"m_-1909826017677187474m_2878637274358824050m=
_6918240071323427633gmail_msg"><br class=3D"m_-1909826017677187474m_2878637=
274358824050m_6918240071323427633gmail_msg"></div><div class=3D"m_-19098260=
17677187474m_2878637274358824050m_6918240071323427633gmail_msg">And, as you=
 mentioned the JIRA, streaming of request, that would of huge help with fur=
ther splitting the range.</div><div class=3D"m_-1909826017677187474m_287863=
7274358824050m_6918240071323427633gmail_msg"><br class=3D"m_-19098260176771=
87474m_2878637274358824050m_6918240071323427633gmail_msg"></div><div class=
=3D"m_-1909826017677187474m_2878637274358824050m_6918240071323427633gmail_m=
sg">Thanks once again for your valuable comments. :-)=C2=A0</div><div class=
=3D"m_-1909826017677187474m_2878637274358824050m_6918240071323427633gmail_m=
sg"><br class=3D"m_-1909826017677187474m_2878637274358824050m_6918240071323=
427633gmail_msg"></div><div class=3D"m_-1909826017677187474m_28786372743588=
24050m_6918240071323427633gmail_msg">Regards,</div><div class=3D"m_-1909826=
017677187474m_2878637274358824050m_6918240071323427633gmail_msg">Siddharth =
Verma</div></div>
</blockquote></div>
</div></div></blockquote></div><br class=3D"m_-1909826017677187474m_2878637=
274358824050m_6918240071323427633gmail_msg"></div></blockquote></div>
</div></div></blockquote></div><br></div>
</div></div></blockquote></div><br></div>
</blockquote></div><br></div>

--001a11401bb8f874c2053dfbec68--