lucene-solr-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <wikidi...@apache.org>
Subject [Solr Wiki] Update of "SolrEcosystem" by DavidSmiley
Date Mon, 22 Aug 2011 18:00:11 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Solr Wiki" for change notification.

The "SolrEcosystem" page has been changed by DavidSmiley:
http://wiki.apache.org/solr/SolrEcosystem?action=diff&rev1=7&rev2=8

Comment:
Add Talend Open Studio, and comment on ETLs.

   * [[https://github.com/kolstae/openpipe|OpenPipe]] ([[http://openpipe.berlios.de/|alt]])
[[http://openpipe.berlios.de/|Solr Info]]
   * [[http://www.openpipeline.org|OpenPipeline]]
   * ETL (Extract Transform Load) -- many are applicable; these are a couple notable ones:
+   * [[http://www.talend.com/products-data-integration/talend-open-studio.php|Talend Open
Studio (TOS)]]
+   * [[http://kettle.pentaho.com/|Kettle (Pentaho)]]
    * [[http://sourceforge.net/projects/cloveretl/|CloverETL]]
-   * [[http://kettle.pentaho.com/|Kettle (Pentaho)]]
+ 
+ A common problem amongst the ETLs is that each step in the pipeline accepts and emits records
in a fixed flat schema, they don't support dynamic name-value pairs. And these are not document
oriented; if you want to pass a DOM of some kind then you serialize it into a field. However,
the ETLs are all far more mature than nascent document or XML oriented pipelines.
+ 
   * ESBs (Enterprise Service Buses) -- not listed; various
   * One of the [[http://xproc.org/implementations/|XProc implementations]] (an XML pipeline
spec) such as [[http://xmlcalabash.com/|Calabash]]
  
@@ -45, +49 @@

  Hadoop:
   * [[http://www.cascading.org/|Cascading]] - [[https://github.com/bixolabs/cascading.solr|Solr
"Tap"]]
   * [[http://katta.sourceforge.net/|Katta]] - [[KattaIntegration]]
- 
  = Misc =
  
   * [[https://github.com/tjake/Solandra|Solandra]] - A tight integration of Solr and Cassandra.
The result is Solr with the awesome scalability properties of Cassandra.

Mime
View raw message