cassandra-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ben Standefer (JIRA)" <j...@apache.org>
Subject [jira] Issue Comment Edited: (CASSANDRA-873) Create a Cassandra demo application
Date Wed, 24 Mar 2010 20:40:27 GMT

    [ https://issues.apache.org/jira/browse/CASSANDRA-873?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12849382#action_12849382
] 

Ben Standefer edited comment on CASSANDRA-873 at 3/24/10 8:39 PM:
------------------------------------------------------------------

A good idea brought up by Edward Capriolo (not a student, so students feel free to run with
this one) is a Splunk knock-off.  Splunk is software that indexes logs (syslog, Apache logs,
app logs, whatever) in lots of different ways and makes your logs highly searchable and filter-able
via a front-end web interface.  http://www.splunk.com/product.

While the Splunk product is powerful and awesome, the licensing is not (they license by usage
instead of per-seat).

I think a Splunk knock-off would be a good demo app for people just getting into Cassandra
because parsing logs is an easy concept to understand and it could start off very simple.
 There is a lot of opportunity to utilize all features of the Cassandra API (range queries,
search indexes, property-specific indexes).  This could be made very high scalable by utilizing
Scribed ( http://github.com/facebook/scribe ), a scalable logging solution that many Cassandra
users are already using to store their logs.  It's like rsyslogd on crack.

Think Facebook's personalized search indexes for each of their 400M users, but applied to
log data and properties.

      was (Author: aguynamedben):
    A good idea brought up by Edward Capriolo (not a student, so students feel free to run
with this one) is a Splunk knock-off.  Splunk is software that indexes logs (syslog, Apache
logs, app logs, whatever) in lots of different ways and makes your logs highly searchable
and filter-able via a front-end web interface.  http://www.splunk.com/product.

While the Splunk product is powerful and awesome, the licensing is not (they license by usage
instead of per-seat).

I think a Splunk knock-off would be a good demo app for people just getting into Cassandra
because parsing logs is an easy concept to understand and it could start off very simple.
 There is a lot of opportunity to utilize all features of the Cassandra API (range queries,
search indexes, property-specific indexes).  This could be made very high scalable by utilizing
Scribed ( http://github.com/facebook/scribe ), a scalable logging solution that many Cassandra
users are already using to store their logs.  It's like rsyslogd on crack.

Think Facebook's customized "Inbox Search" index for each of their 400M users, but applied
to log data and properties.
  
> Create a Cassandra demo application
> -----------------------------------
>
>                 Key: CASSANDRA-873
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-873
>             Project: Cassandra
>          Issue Type: Task
>            Reporter: Jonathan Ellis
>            Priority: Minor
>
> http://twissandra.com/ is a demo Cassandra application built on django + pycassa.  It's
a great Cassandra showcase and very useful for people learning Cassandra.  We could use more
of those.
> Jake Luciani suggested one that presents full-text search of Wikipedia using Lucandra
(see http://blog.sematext.com/2010/02/09/lucandra-a-cassandra-based-lucene-backend/ and http://github.com/tjake/Lucandra).
 Feel free to propose other application ideas here.
> Rackspace is willing to provide a VM to deploy on for a live demo, but remember, to be
really useful this needs full DIY instructions, the final product is not the demo but the
code + instructions.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message