lucene-solr-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <wikidi...@apache.org>
Subject [Solr Wiki] Update of "SolrEcosystem" by DavidSmiley
Date Mon, 11 Jul 2011 04:59:13 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Solr Wiki" for change notification.

The "SolrEcosystem" page has been changed by DavidSmiley:
http://wiki.apache.org/solr/SolrEcosystem?action=diff&rev1=2&rev2=3

  
  There are numerous ways to bring data into Solr. Many people roll their own solution or
use the DIH. 
  
- == Crawlers ==
+ == Crawlers And Connectors ==
  
  Web, email, and file crawlers.
  
-  * [[http://lucene.apache.org/nutch/|Nutch]] (web, ...?) '''*S*'''
+  * [[http://lucene.apache.org/nutch/|Nutch]] (web) '''*S*'''
-  * [[http://en.wikipedia.org/wiki/Heritrix|Heritrix]] (web, ...?)
+  * [[http://en.wikipedia.org/wiki/Heritrix|Heritrix]] (web)
-  * [[http://incubator.apache.org/droids/|Droids]] ( ? )
-  * [[http://www.crawl-anywhere.com/|Crawl-Anywhere]] (web, ...?)
+  * [[http://www.crawl-anywhere.com/|Crawl-Anywhere]] (web)  '''*S*''' 
-  * [[DataImportHandler]] (email, files) '''*S*'''
+  * [[DataImportHandler]] (email, file) '''*S*''' 
-  * [[http://incubator.apache.org/connectors/|ManifoldCF]] (web, file) '''*S*'''
+  * [[http://incubator.apache.org/connectors/|ManifoldCF]] (web, file) '''*S*''' 
+  * [[http://aperture.sourceforge.net/|Aperture]] (web, email, file) 
+  * [[http://incubator.apache.org/droids/|Droids]] ( none ) '''*S*''' Presently, more of
a framework for a crawler.
  
  == Pipelines / Document Processing ==
   
- Frameworks for flexible document processing. See [[DocumentProcessing]] for more background
and criteria for a proposal.
+ Frameworks for flexible document processing. See [[DocumentProcessing]] for more background
and criteria for a proposal. Some crawlers/connectors have their own pipeline capability and
they are not repeated here.
  
-  * ETL (Extract Transform Load)
+  * [[http://www.openpipeline.org|OpenPipeline]] '''*S*'''
+  * [[https://github.com/kolstae/openpipe|OpenPipe]]
+  * ETL (Extract Transform Load) -- many are applicable; these are a couple notable ones:
-   * [[http://sourceforge.net/projects/cloveretl/|CloverETL]] LGPL
+   * [[http://sourceforge.net/projects/cloveretl/|CloverETL]]
    * [[http://kettle.pentaho.com/|Pentaho Kettle]]
-  * [[http://www.openpipeline.org|OpenPipeline]], Solr integration.
-  * [[https://github.com/kolstae/openpipe|OpenPipe]]
-  * [[DataImportHandler]] '''*S*'''
+  * ESBs (Enterprise Service Buses) -- not listed; various
+  * One of the [[http://xproc.org/implementations/|XProc implementations]] (an XML pipeline
spec) such as [[http://xmlcalabash.com/|Calabash]]
+ 
+ = Clients =
+ 
+ http://wiki.apache.org/solr/IntegratingSolr
+ 
+ = Integration with other Software =
+ 
+ http://wiki.apache.org/solr/IntegratingSolr
+ 
+ = Misc =
+ 
+  * [[https://github.com/tjake/Solandra|Solandra]] - A tight integration of Solr and Cassandra.
The result is Solr with the awesome scalability properties of Cassandra.
+ 
   
  

Mime
View raw message