lucene-solr-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <wikidi...@apache.org>
Subject [Solr Wiki] Trivial Update of "ExtractingRequestHandler" by JanHoydahl
Date Sun, 29 Jul 2012 00:16:38 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Solr Wiki" for change notification.

The "ExtractingRequestHandler" page has been changed by JanHoydahl:
http://wiki.apache.org/solr/ExtractingRequestHandler?action=diff&rev1=77&rev2=78

Comment:
Link to Tika1.2

  
  = Additional Resources =
   * [[http://www.lucidimagination.com/Community/Hear-from-the-Experts/Articles/Content-Extraction-Tika#example.source|Lucid
Imagination article]] 
-  * [[http://tika.apache.org/0.10/formats.html|Supported document formats via Tika (0.10)]]
+  * [[http://tika.apache.org/1.2/formats.html|Supported document formats via Tika (1.2)]]
  
  = What's in a Name =
  Grant was writing the javadocs for the code and needed an entry for the <title> tag
and wrote out "Solr Content Extraction Library", since the contrib directory is named "extraction".
 This then lead to an "acronym":  Solr CEL which then gets mashed to: Solr Cell.  Hence, the
project name is "Solr Cell".  It's also appropriate because a Solar Cell's job is to convert
the raw energy of the Sun to electricity, and this contrib's module is responsible for converting
the "raw" content of a document to something usable by Solr. http://en.wikipedia.org/wiki/Solar_cell

Mime
View raw message