jackrabbit-users mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Paco Avila <monk...@gmail.com>
Subject Re: searching in OCRed pdf
Date Mon, 26 Jan 2009 16:36:04 GMT
You can make a text extractor which perform an OCR.

On Mon, Jan 26, 2009 at 5:25 PM, Péterfi Balázs <b.peterfi@i-deal.hu> wrote:
> Hello,
>
> I'm developing an application that uses jackrabbit and have some problem
> with searching in pdf files. When I search in a pdf that was generated from
> a word document it works. When I try to search in a pdf that has a scanned
> document inside it and I can search through its contents from within Adobe
> Reader (some sort of Optical Character Recognition) but my application does
> not obtain results. I don't know how does this kind of pdf work but I need
> to search in it. Does jackrabbit support it?
>
> Thank you!
> Balazs
>
>



-- 
Paco Avila
GIT Consultors
tel: +34 971 498310
fax: +34 971496189
e-mail: pavila@git.es
http://www.git.es

Mime
View raw message