pdfbox-users mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Tilman Hausherr <THaush...@t-online.de>
Subject Re: Discrepancy between rendered and extracted characters.
Date Tue, 22 Apr 2014 14:42:10 GMT
Indeed. If you can, use 600dpi, and take care that the paper is properly 
aligned. And clean your scanner.

Tilman

Am 22.04.2014 15:26, schrieb Tres Finocchiaro:
> I've also noticed scanning the document at higher resolutions helps with
> these types of issues as well, but I've never had a "perfect" OCR scan.
>   They always have spaces or unrecognized characters in some place of the
> document. :)
>
> -Tres
>


Mime
View raw message