lucene-solr-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <>
Subject [Solr Wiki] Update of "SolrEcosystem" by DavidSmiley
Date Mon, 22 Aug 2011 18:00:11 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Solr Wiki" for change notification.

The "SolrEcosystem" page has been changed by DavidSmiley:

Add Talend Open Studio, and comment on ETLs.

   * [[|OpenPipe]] ([[|alt]])
[[|Solr Info]]
   * [[|OpenPipeline]]
   * ETL (Extract Transform Load) -- many are applicable; these are a couple notable ones:
+   * [[|Talend Open
Studio (TOS)]]
+   * [[|Kettle (Pentaho)]]
    * [[|CloverETL]]
-   * [[|Kettle (Pentaho)]]
+ A common problem amongst the ETLs is that each step in the pipeline accepts and emits records
in a fixed flat schema, they don't support dynamic name-value pairs. And these are not document
oriented; if you want to pass a DOM of some kind then you serialize it into a field. However,
the ETLs are all far more mature than nascent document or XML oriented pipelines.
   * ESBs (Enterprise Service Buses) -- not listed; various
   * One of the [[|XProc implementations]] (an XML pipeline
spec) such as [[|Calabash]]
@@ -45, +49 @@

   * [[|Cascading]] - [[|Solr
   * [[|Katta]] - [[KattaIntegration]]
  = Misc =
   * [[|Solandra]] - A tight integration of Solr and Cassandra.
The result is Solr with the awesome scalability properties of Cassandra.

View raw message