cassandra-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Brian O'Neill <b...@alumni.brown.edu>
Subject Re: Adhoc querying in Cassandra?
Date Wed, 22 Apr 2015 11:55:47 GMT

+1, I think many organizations (including ours) pair Elastic Search with
Cassandra.
Use Cassandra as your system of record, then index the data with ES.

-brian

---
Brian O'Neill 
Chief Technology Officer
Health Market Science, a LexisNexis Company
215.588.6024 Mobile € @boneill42 <http://www.twitter.com/boneill42>


This information transmitted in this email message is for the intended
recipient only and may contain confidential and/or privileged material. If
you received this email in error and are not the intended recipient, or the
person responsible to deliver it to the intended recipient, please contact
the sender at the email above and delete this email and any attachments and
destroy any copies thereof. Any review, retransmission, dissemination,
copying or other use of, or taking any action in reliance upon, this
information by persons or entities other than the intended recipient is
strictly prohibited.
 


From:  Ali Akhtar <ali.rac200@gmail.com>
Reply-To:  <user@cassandra.apache.org>
Date:  Wednesday, April 22, 2015 at 7:52 AM
To:  <user@cassandra.apache.org>
Subject:  Re: Adhoc querying in Cassandra?


You might find it better to use elasticsearch for your aggregate queries and
analytics. Cassandra is more of just a data store.

On Apr 22, 2015 4:42 PM, "Matthew Johnson" <matt.johnson@algomi.com> wrote:
> Hi all,
>  
> Currently we are setting up a ³big² data cluster, but we are only going to
> have a couple of servers to start with but we need to be able to scale out
> quickly when usage ramps up. Previously we have used Hadoop/HBase for our big
> data cluster, but since we are starting this one on only two nodes I think
> Cassandra will be a much better fit, as Hadoop and HBase really need at least
> 3 to achieve any sort of resilience (zookeeper quorum etc).
>  
> My question is this:
>  
> I have used Apache Phoenix as a JDBC layer on top of HBase, which allows me to
> issue ad-hoc SQL-style queries. (eg count the number of times users have
> clicked on a certain button after clicking a different button in the last 3
> weeks etc). My understanding is that CQL does not support this style of adhoc
> aggregate querying out of the box. Is there a recommended way to do count,
> sum, average etc without writing client code (in my case Java) every time I
> want to run one? I have been looking at projects like Drill, Spark etc that
> could potentially sit on top of Cassandra but without actually setting
> everything up and testing them it is difficult to figure out what they would
> give us.
>  
> Does anyone else interactively issue adhoc aggregate queries to Cassandra, and
> if so, what stack do you use?
>  
> Thanks!
> Matt
>  



Mime
View raw message