manifoldcf-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Rafa Haro (JIRA)" <j...@apache.org>
Subject [jira] [Created] (CONNECTORS-1162) Apache Kafka Output Connector
Date Thu, 12 Feb 2015 14:42:11 GMT
Rafa Haro created CONNECTORS-1162:
-------------------------------------

             Summary: Apache Kafka Output Connector
                 Key: CONNECTORS-1162
                 URL: https://issues.apache.org/jira/browse/CONNECTORS-1162
             Project: ManifoldCF
          Issue Type: Wish
    Affects Versions: ManifoldCF 2.0.1, ManifoldCF 1.8.1
            Reporter: Rafa Haro
             Fix For: ManifoldCF 1.9, ManifoldCF 2.1


Kafka is a distributed, partitioned, replicated commit log service. It provides the functionality
of a messaging system, but with a unique design. A single Kafka broker can handle hundreds
of megabytes of reads and writes per second from thousands of clients.

Apache Kafka is being used for a number of uses cases. One of them is to use Kafka as a feeding
system for streaming BigData processes, both in Apache Spark or Hadoop environment. A Kafka
output connector could be used for streaming or dispatching crawled documents or metadata
and put them in a BigData processing pipeline



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message