jackrabbit-users mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Eliott <eliott...@gmail.com>
Subject Tiff extraction question
Date Wed, 09 Mar 2011 10:19:32 GMT
Dear Jackrabbit users!

during the final phase of a project came into my attention that tiff 
files are also capable of storing the image and the ocr-ed text in a 
same file, just like PDFs do. Since we have many of such files, we have 
a business need to extract text from these tiffs.

Has anybody written a text extractor or knows a library that can get the 
text layer from these files? Is there any specific reason why JR does 
not support this out of the box?


View raw message