lucene-solr-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <>
Subject [Solr Wiki] Update of "ExtractingRequestHandler" by JanHoydahl
Date Thu, 02 Aug 2012 22:17:09 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Solr Wiki" for change notification.

The "ExtractingRequestHandler" page has been changed by JanHoydahl:

Paragraph on posting rich docs with post.jar

  <!> NOTE, this literally streams the file as the body of the POST, which does not,
then, provide info to Solr about the name of the file.
+ == SimplePostTool (post.jar) ==
+ The simple post tool post.jar which ships with Solr in the {{{example/exampledocs}}} folder
can post a file to ExtractingRequestHandler:
+ {{{
+ java -Durl=http://localhost:8983/solr/update/extract -Dtype=text/html
-jar post.jar tutorial.html
+ }}}
+ Since <!> [[Solr4.0]] post.jar also has an {{{auto}}} mode which guesses content-type
for you, and also sets a default ID and filename when sending to Solr. Also, a {{{recursive}}}
option lets you automatically post a whole directory tree:
+ {{{
+ java -Dauto -jar post.jar tutorial.html
+ java -Dauto -Drecursive -jar post.jar .
+ }}}
+ <!> NOTE: The post.jar utility is not meant for production use, but as a convenience
tool for experimenting with Solr. It is made as a single .java file (see [[|SVN]])
without dependencies, so it does on purpose not use SolrJ.
  == SolrJ ==
  Use the !ContentStreamUpdateRequest (see ContentStreamUpdateRequestExample for a full example):

View raw message