cassandra-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ben Vogan <...@shopkick.com>
Subject Replicating Cassandra data to HDFS
Date Tue, 09 Aug 2016 16:09:05 GMT
Hi all,

We are investigating using Cassandra in our data platform.  We would like
data to go into Cassandra first and to eventually be replicated into our
data lake in HDFS for long term cold storage.  Does anyone know of a good
way of doing this?  We would rather not have parallel writes to HDFS and
Cassandra because we were hoping that we could use Cassandra primary keys
to de-duplicate events.

Thanks,
-- 
<http://shopkick.com/>
*BENJAMIN VOGAN* | Data Platform Team Lead
shopkick <http://www.shopkick.com/>
<http://facebook.com/shopkick> <http://instagram.com/shopkick>
<http://pinterest.com/shopkick> <http://twitter.com/shopkick>
<https://www.linkedin.com/company/831240?trk=tyah&trkInfo=clickedVertical%3Acompany%2CentityType%3AentityHistoryName%2CclickedEntityId%3Acompany_831240%2Cidx%3A0>

The indispensable app that rewards you for shopping.

Mime
View raw message