incubator-cvs mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <wikidi...@apache.org>
Subject [Incubator Wiki] Update of "SamzaProposal" by ChrisRiccomini
Date Tue, 23 Jul 2013 17:06:22 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Incubator Wiki" for change notification.

The "SamzaProposal" page has been changed by ChrisRiccomini:
https://wiki.apache.org/incubator/SamzaProposal?action=diff&rev1=3&rev2=4

  == Abstract ==
  
- Samza is a stream processing system for continuous computation of infinite streams of data.
+ Samza is a stream processing system for running continuous computation on infinite streams
of data.
  
  == Proposal ==
  
@@ -10, +10 @@

  
  == Background ==
  
- Samza was developed at LinkedIn to enable easier processing of streaming data on top of
Apache Kafka. Current use cases include aggregating operational log data, data ingestion into
distributed database infrastructure, and measuring user activity across different aggregation
types.
+ Samza was developed at LinkedIn to enable easier processing of streaming data on top of
Apache Kafka. Current use cases include content processing pipelines, aggregating operational
log data, data ingestion into distributed database infrastructure, and measuring user activity
across different aggregation types.
  
  Samza is focused on providing an easy to use framework to process streams. It uses Apache
YARN to provide a mechanism for deploying stream processing tasks in a distributed cluster.
Samza also takes advantage of YARN to make decisions about stream processor locality, co-partition
of streams, and provide security. Apache Kafka is also leveraged to provide a mechanism to
pass messages from one stream processor to the next. Apache Kafka is also used to help manage
a stream processor's state, so that it can be recovered in the event of a failure.
  
@@ -18, +18 @@

  
  == Rationale ==
  
- Many organizations can benefit from a reliable stream processing system such as Samza. While
our use case of processing events from a very large website like LinkedIn has driven the design
of Samza, its uses are varied and we expect many new use cases to emerge. Samza provides a
generic API to process messages from streaming infrastructure and will appeal to many users.
+ Many organizations can benefit from a reliable stream processing system such as Samza. While
our use case of processing events from a large website like LinkedIn has driven the design
of Samza, its uses are varied and we expect many new use cases to emerge. Samza provides a
generic API to process messages from streaming infrastructure and will appeal to many users.
  
  == Current Status ==
  

---------------------------------------------------------------------
To unsubscribe, e-mail: cvs-unsubscribe@incubator.apache.org
For additional commands, e-mail: cvs-help@incubator.apache.org


Mime
View raw message