lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jeroen Reijn <j.re...@hippo.nl>
Subject Re: PDFBox PDFExtractor
Date Mon, 12 Sep 2005 15:58:00 GMT
Hi Rod,

PDFBox is a seperate project. The PDFExtractor in Jakarta Slide uses PDFBox's 
functionality to extract the information from the .pdf file.

Hope this answers your question.

Jeroen


Rod.Madden@ferguson.com wrote:
> Hi,
> 
>  
> 
> I am new to Lucene and looking at some existing Lucene code....
> 
>  
> 
> I am confused about the relationship ( if any ) between 
> 
> org.apache.slide.extractor.PDFExtractor methods and org.PDFBox.cos
> methods
> 
> for the purposes of working with PDF files.
> 
>  
> 
> I have found info on the web regarding PDFBox, however, I have found
> little
> 
> regarding .PDFExtractor.
> 
>  
> 
> I am curious since we are having some issues with indexing PDF files and
> 
> I am wondering if PDFExtractor implements PDFBox or if it is a separate 
> 
> utility set.
> 
>  
> 
> Rod.
> 
>  
> 
> 

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message