lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Davis, Daniel (NIH/NLM) [C]" <daniel.da...@nih.gov>
Subject RE: Indexing PDF and MS Office files
Date Thu, 16 Apr 2015 16:09:52 GMT
Indeed.   Another solution is to purchase ABBYY or Nuance as a server, and have them do that
work.
You will even get OCR.    Both offer a Linux SDK.

-----Original Message-----
From: Allison, Timothy B. [mailto:tallison@mitre.org] 
Sent: Thursday, April 16, 2015 7:56 AM
To: solr-user@lucene.apache.org
Subject: RE: Indexing PDF and MS Office files

+1

:)

>PS: one more thing - please, tell your management that you will never 
>ever successfully all real-world PDFs and cater for that fact in your 
>requirements :-)

Mime
View raw message