lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Grant Ingersoll <>
Subject Re: Question regarding ExtractingRequestHandler
Date Wed, 08 Jul 2009 12:31:30 GMT
For metadata, you can add the ext.metadata.prefix field and then use a  
dynamic field that maps that prefix, such as:


  <dynamicField name="metadata_*"  type="string"    indexed="true"   

Note, some of this is currently under review to be changed.  See


On Jul 7, 2009, at 10:49 AM, ahammad wrote:

> Hello,
> I've recently started using this handler to index MS Word and PDF  
> files.
> When I set ext.extract.only=true, I get back all the metadata that is
> associated with that file.
> If I want to index, I need to set ext.extract.only=false. If I want  
> to index
> all that metadata along with the contents, what inputs do I need to  
> pass to
> the http request? Do I have to specifically define all the fields in  
> the
> schema or can Solr dynamically generate those fields?
> Thanks.
> -- 
> View this message in context:
> Sent from the Solr - User mailing list archive at

Grant Ingersoll

Search the Lucene ecosystem (Lucene/Solr/Nutch/Mahout/Tika/Droids)  
using Solr/Lucene:

View raw message