lucene-solr-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <wikidi...@apache.org>
Subject [Solr Wiki] Trivial Update of "ExtractingRequestHandler" by AndreHagenbruch
Date Mon, 06 Dec 2010 22:23:26 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Solr Wiki" for change notification.

The "ExtractingRequestHandler" page has been changed by AndreHagenbruch.
The comment on this change is: Fixed a typo.
http://wiki.apache.org/solr/ExtractingRequestHandler?action=diff&rev1=65&rev2=66

--------------------------------------------------

  
  Now, you should be able to execute a query and find that document (open the following link
in your browser): http://localhost:8983/solr/select?q=tutorial
  
- You may notice that although you can search on any of the text in the sample document, you
may not be able to see that text when the document is retrieved.  This is simply because the
"content" field generated by Tika is mapped to the Solr field called "text", which is indexed
but not stored. This is done via the default map rule in the {{{/udate/extract}}} handler
in {{{solrconfig.xml}}} and can be easily changed or overridden. For example, to store and
see all metadata and content, execute the following:
+ You may notice that although you can search on any of the text in the sample document, you
may not be able to see that text when the document is retrieved.  This is simply because the
"content" field generated by Tika is mapped to the Solr field called "text", which is indexed
but not stored. This is done via the default map rule in the {{{/update/extract}}} handler
in {{{solrconfig.xml}}} and can be easily changed or overridden. For example, to store and
see all metadata and content, execute the following:
  
  {{{
  curl "http://localhost:8983/solr/update/extract?literal.id=doc1&uprefix=attr_&fmap.content=attr_content&commit=true"
-F "myfile=@tutorial.html"

Mime
View raw message