lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Davis, Daniel (NIH/NLM) [C]" <>
Subject RE: Indexing PDF and MS Office files
Date Thu, 16 Apr 2015 16:09:52 GMT
Indeed.   Another solution is to purchase ABBYY or Nuance as a server, and have them do that
You will even get OCR.    Both offer a Linux SDK.

-----Original Message-----
From: Allison, Timothy B. [] 
Sent: Thursday, April 16, 2015 7:56 AM
Subject: RE: Indexing PDF and MS Office files



>PS: one more thing - please, tell your management that you will never 
>ever successfully all real-world PDFs and cater for that fact in your 
>requirements :-)

View raw message