lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From sweety <sweetyshind...@yahoo.com>
Subject Re: using extract handler: data not extracted
Date Sat, 11 Jan 2014 13:36:05 GMT
Sorry, that my question was not clear.
Initially when indexed pdf files it showed the data within this pdf in the
contents field.as follows:(this is output for initially indexed documents)
<str name="contents">
Cloud ctured As tale in size as well as complexity. We need a cloud based
system that will solve this problem.  Provide interfaces to registeP CSS
Client Measurements Benchmarkinse times by varying Number of documents
fromnds to millions Nuervers from 1 to 5 Storage and search options as
discussed abo
</str>

But for newly indexed documents, the contents field is empty, 
Actually coding.pdf is of 3mb size, but as shown in the output the contents
of this pdf are not extracted, indexing extracts the metadata,but not the
contents of the file,
the contents field is empty, <str name="contents"></str>  

what is the reason for this? Is is because of some jar missing?





--
View this message in context: http://lucene.472066.n3.nabble.com/using-extract-handler-data-not-extracted-tp4110850p4110873.html
Sent from the Solr - User mailing list archive at Nabble.com.

Mime
View raw message