Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@cassandra.apache.org
Received-SPF: pass (nike.apache.org: domain of boneill42@gmail.com designates
 209.85.216.44 as permitted sender)
User-Agent: Microsoft-MacOutlook/14.13.0.110805
Date: Wed, 19 Oct 2011 09:20:21 -0400
Subject: Re: Using elasticsearch on cassandra nodes
From: Brian O'Neill <boneill42@gmail.com>
To: <user@cassandra.apache.org>
Message-ID: <CAC444B1.11FD4%boneill42@gmail.com>
Thread-Topic: Using elasticsearch on cassandra nodes
In-Reply-To: 
 <CAFk=5qB7g0ov+QarGJbYW3BWEwcFLt235Aywv20c1Jp1LP07aw@mail.gmail.com>
Mime-version: 1.0
Content-type: multipart/alternative;
	boundary="B_3401860825_148008"

> This message is in MIME format. Since your mail reader does not understand
this format, some or all of this message may not be legible.

--B_3401860825_148008
Content-type: text/plain;
	charset="ISO-8859-1"
Content-transfer-encoding: quoted-printable

Anthony,

We're in exactly the same boat.  We are waiting on DataStax Enterprise to
see if it can ease the pain of SOLR schemas.

In the meantime, I just submitted a native REST layer for Cassandra.
https://issues.apache.org/jira/browse/CASSANDRA-3380
(Hopefully, it will get integrated soon. Vote it up ;)

With a  simple REST layer, I'm making the case that we can use Cassandra
just like CouchDB. (so we don't have to deploy both)
Extending that assertion, I think I could enhance the REST layer to provide
a stream of changes just like CouchDB does.  Elastic Search could tap into
that stream as a river.  Just like this=8A
http://www.elasticsearch.org/guide/reference/river/couchdb.html

That combination would be pretty powerful.  If we can't get that setup, we
may fallback to an AOPish strategy as well.

Definitely let me know where you end up.   I'll share our findings as well.

cheers,
-brian

----=20
Brian O'Neill
Lead Architect, Software Development
Health Market Science | 2700 Horizon Drive | King of Prussia, PA 19406
p: 215.588.6024
blog: http://weblogs.java.net/blog/boneill42/
blog: http://brianoneill.blogspot.com/


From:  Anthony Ikeda <anthony.ikeda.dev@gmail.com>
Reply-To:  <user@cassandra.apache.org>
Date:  Tue, 18 Oct 2011 14:18:17 -0700
To:  <user@cassandra.apache.org>
Subject:  Re: Using elasticsearch on cassandra nodes

At the moment we are only prototyping so we haven't bridged the two at all.
We had planned on creating a write-through operation that allowed us to
filter the calls (AOP perhaps?) to manage the indexing as we stored it in
Cassandra.

We are still trying to work out if we go the elastic search route or not as
DataStax will be releasing DataStax Enterprise 2.0 early next year with Sol=
r
built in and as you said the index schemas seem to be difficult to deal wit=
h
- I really don't want to have to configure Solr, the no schema approach
sounds much faster to get up and running.

Anthony


On Tue, Oct 18, 2011 at 6:14 AM, Brian O'Neill <bone@alumni.brown.edu>
wrote:
> Anthony,
>=20
> We've been looking at elastic search as well.  Presently we have SOLR in
> place, but it is cumbersome dealing with SOLR schemas when indexing
> information out of Cassandra (since you can't anticipate all the columns =
ahead
> of time). =20
>=20
> What are you using as your bridge between Cassandra and ES?  Are you
> developing a Cassandra river?
>=20
> -brian
>=20
>=20
>=20
>=20
> On Mon, Oct 17, 2011 at 5:29 PM, Anthony Ikeda <anthony.ikeda.dev@gmail.c=
om>
> wrote:
>> I've already posted to the elasticsearch groups and thought it prudent t=
o
>> also ask here.
>>=20
>> We are looking at using elastic search to index our data that we current=
ly
>> store to Cassandra. I was wondering if there are any concerns running el=
astic
>> search on the same nodes that we use for Cassandra? We have a ring of 6 =
nodes
>> (2 DCs each with 3 nodes) I was thinking of installing elastic search on=
 2
>> nodes in each datacentre - maybe all three. The only reason I'd use the =
same
>> infrastructure would be because we have the distributed visibility alrea=
dy in
>> place.
>>=20
>> Has anyone else taken this approach? Pros? Cons?
>>=20
>> Anthony
>>=20
>=20
>=20
>=20
> --=20
> Brian ONeill
> Lead Architect, Health Market Science (http://healthmarketscience.com)
> mobile:215.588.6024 <tel:215.588.6024>
> blog: http://weblogs.java.net/blog/boneill42/
> blog: http://brianoneill.blogspot.com/
>=20


--B_3401860825_148008
Content-type: text/html;
	charset="ISO-8859-1"
Content-transfer-encoding: quoted-printable

<html><head></head><body style=3D"word-wrap: break-word; -webkit-nbsp-mode: s=
pace; -webkit-line-break: after-white-space; color: rgb(0, 0, 0); font-size:=
 14px; font-family: Calibri, sans-serif; "><div><div><div>Anthony,</div><div=
><br></div><div>We're in exactly the same boat. &nbsp;We are waiting on Data=
Stax Enterprise to see if it can ease the pain of SOLR schemas.</div><div><b=
r></div><div>In the meantime, I just submitted a native REST layer for Cassa=
ndra.</div><div><a href=3D"https://issues.apache.org/jira/browse/CASSANDRA-338=
0">https://issues.apache.org/jira/browse/CASSANDRA-3380</a></div><div>(Hopef=
ully, it will get integrated soon. Vote it up ;)</div><div><br></div><div>Wi=
th a &nbsp;simple REST layer, I'm making the case that we can use Cassandra =
just like CouchDB. (so we don't have to deploy both)</div><div>Extending tha=
t assertion, I think I could enhance the REST layer to provide a stream of c=
hanges just like CouchDB does. &nbsp;Elastic Search could tap into that stre=
am as a river. &nbsp;Just like this&#8230;</div><div><a href=3D"http://www.ela=
sticsearch.org/guide/reference/river/couchdb.html">http://www.elasticsearch.=
org/guide/reference/river/couchdb.html</a></div><div><br></div><div>That com=
bination would be pretty powerful. &nbsp;If we can't get that setup, we may =
fallback to an AOPish strategy as well.</div><div><br></div><div>Definitely =
let me know where you end up. &nbsp; I'll share our findings as well.</div><=
div><br></div><div>cheers,</div><div>-brian</div><div><br></div><div><div><d=
iv style=3D"font-size: 14px; color: rgb(0, 0, 0); font-family: Calibri, sans-s=
erif; ">--<span class=3D"Apple-style-span" style=3D"font-size: 15px; font-family=
: Calibri, Verdana, Helvetica, Arial; ">--&nbsp;</span></div><font color=3D"#2=
62626" style=3D"font-size: 14px; color: rgb(0, 0, 0); font-family: Calibri, sa=
ns-serif; "><font size=3D"1"><font face=3D"Verdana Bold"><span style=3D"font-size:=
 9pt; ">Brian O'Neill<br></span></font><font face=3D"Verdana,Helvetica,Arial">=
<span style=3D"font-size: 8pt; ">Lead Architect, Software Development<br></spa=
n></font></font></font><font size=3D"2" style=3D"font-size: 14px; "><font face=3D"=
Calibri Bold" style=3D"color: rgb(0, 0, 0); font-family: Calibri, sans-serif; =
"><span style=3D"font-size: 10pt; ">Health Market&nbsp;<font color=3D"#D7172D">S=
cience</font><font color=3D"#808080">&nbsp;</font></span></font><span style=3D"f=
ont-size: 10pt; "><font><font face=3D"Calibri,Verdana,Helvetica,Arial"><font c=
lass=3D"Apple-style-span" face=3D"Calibri,sans-serif">| 2700 Horizon Drive | Kin=
g of Prussia, PA 19406</font><br><font class=3D"Apple-style-span" face=3D"Calibr=
i,sans-serif">p: 215.588.6024</font></font></font></span></font><div style=3D"=
font-size: 14px; color: rgb(0, 0, 0); font-family: Calibri, sans-serif; ">bl=
og: <a href=3D"http://weblogs.java.net/blog/boneill42">http://weblogs.java.net=
/blog/boneill42</a>/</div><div style=3D"font-size: 14px; color: rgb(0, 0, 0); =
font-family: Calibri, sans-serif; ">blog: <a href=3D"http://brianoneill.blogsp=
ot.com">http://brianoneill.blogspot.com</a>/</div><div style=3D"font-size: 14p=
x; color: rgb(0, 0, 0); font-family: Calibri, sans-serif; "><br></div><div s=
tyle=3D"font-size: 14px; color: rgb(0, 0, 0); font-family: Calibri, sans-serif=
; "><br></div></div></div></div></div><div><br></div><span id=3D"OLK_SRC_BODY_=
SECTION"><div style=3D"font-family:Calibri; font-size:11pt; text-align:left; c=
olor:black; BORDER-BOTTOM: medium none; BORDER-LEFT: medium none; PADDING-BO=
TTOM: 0in; PADDING-LEFT: 0in; PADDING-RIGHT: 0in; BORDER-TOP: #b5c4df 1pt so=
lid; BORDER-RIGHT: medium none; PADDING-TOP: 3pt"><span style=3D"font-weight:b=
old">From: </span> Anthony Ikeda &lt;<a href=3D"mailto:anthony.ikeda.dev@gmail=
.com">anthony.ikeda.dev@gmail.com</a>&gt;<br><span style=3D"font-weight:bold">=
Reply-To: </span> &lt;<a href=3D"mailto:user@cassandra.apache.org">user@cassan=
dra.apache.org</a>&gt;<br><span style=3D"font-weight:bold">Date: </span> Tue, =
18 Oct 2011 14:18:17 -0700<br><span style=3D"font-weight:bold">To: </span> &lt=
;<a href=3D"mailto:user@cassandra.apache.org">user@cassandra.apache.org</a>&gt=
;<br><span style=3D"font-weight:bold">Subject: </span> Re: Using elasticsearch=
 on cassandra nodes<br></div><div><br></div>At the moment we are only protot=
yping so we haven't bridged the two at all. We had planned on creating a wri=
te-through operation that allowed us to filter the calls (AOP perhaps?) to m=
anage the indexing as we stored it in Cassandra.<div><br></div><div>We are s=
till trying to work out if we go the elastic search route or not as DataStax=
 will be releasing DataStax Enterprise 2.0 early next year with Solr built i=
n and as you said the index schemas seem to be difficult to deal with - I re=
ally don't want to have to configure Solr, the no schema approach sounds muc=
h faster to get up and running.</div><div><br></div><div>Anthony</div><div><=
br><br><div class=3D"gmail_quote">On Tue, Oct 18, 2011 at 6:14 AM, Brian O'Nei=
ll <span dir=3D"ltr">&lt;<a href=3D"mailto:bone@alumni.brown.edu">bone@alumni.br=
own.edu</a>&gt;</span> wrote:<br><blockquote class=3D"gmail_quote" style=3D"marg=
in:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex;">Anthony,<br><br>=
We've been looking at elastic search as well.&nbsp; Presently we have SOLR i=
n place, but it is cumbersome dealing with SOLR schemas when indexing inform=
ation out of Cassandra (since you can't anticipate all the columns ahead of =
time).&nbsp; <br><br>What are you using as your bridge between Cassandra and=
 ES?&nbsp; Are you developing a Cassandra river?<br><br>-brian<div><div></di=
v><div class=3D"h5"><br><br><br><br><div class=3D"gmail_quote">On Mon, Oct 17, 2=
011 at 5:29 PM, Anthony Ikeda <span dir=3D"ltr">&lt;<a href=3D"mailto:anthony.ik=
eda.dev@gmail.com" target=3D"_blank">anthony.ikeda.dev@gmail.com</a>&gt;</span=
> wrote:<br><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-=
left:1px #ccc solid;padding-left:1ex">I've already posted to the elasticsear=
ch groups and thought it prudent to also ask here.<div><br></div><div><span =
style=3D"font-size: 13px; background-color: rgb(255, 255, 255); font-family: a=
rial, sans-serif; ">We are looking at using elastic search to index our data=
 that we&nbsp;currently store to Cassandra. I was wondering if there are any=
&nbsp;concerns running elastic search on the same nodes that we use for&nbsp=
;Cassandra? We have a ring of 6 nodes (2 DCs each with 3 nodes) I was thinki=
ng of installing elastic search on 2 nodes in each datacentre - maybe all th=
ree. The only reason I'd use the same infrastructure would be because we hav=
e the distributed visibility already in place.</span></div><div><span style=3D=
"font-size: 13px; background-color: rgb(255, 255, 255); font-family: arial, =
sans-serif; "><br></span></div><div><font face=3D"arial,sans-serif">Has anyone=
 else taken this approach? Pros? Cons?</font></div><div><font face=3D"arial,sa=
ns-serif"><br></font></div><font color=3D"#888888"><div><font face=3D"arial,sans=
-serif">Anthony</font></div><div><font face=3D"arial,sans-serif"><br></font></=
div></font></blockquote></div><br><br clear=3D"all"><br></div></div><font colo=
r=3D"#888888">-- <br>Brian ONeill<br>Lead Architect, Health Market Science (<a=
 href=3D"http://healthmarketscience.com" target=3D"_blank">http://healthmarketsc=
ience.com</a>)<br>
mobile:<a href=3D"tel:215.588.6024" value=3D"+12155886024" target=3D"_blank">215.=
588.6024</a><br>
blog: <a href=3D"http://weblogs.java.net/blog/boneill42/" target=3D"_blank">htt=
p://weblogs.java.net/blog/boneill42/</a><br>blog: <a href=3D"http://brianoneil=
l.blogspot.com/" target=3D"_blank">http://brianoneill.blogspot.com/</a><br><br=
></font></blockquote></div><br></div></span></body></html>

--B_3401860825_148008--