Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 63189 invoked from network); 12 Sep 2005 15:58:09 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur.apache.org with SMTP; 12 Sep 2005 15:58:09 -0000 Received: (qmail 70109 invoked by uid 500); 12 Sep 2005 15:58:03 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 70091 invoked by uid 500); 12 Sep 2005 15:58:03 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 70077 invoked by uid 99); 12 Sep 2005 15:58:03 -0000 Received: from asf.osuosl.org (HELO asf.osuosl.org) (140.211.166.49) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 12 Sep 2005 08:58:02 -0700 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests= X-Spam-Check-By: apache.org Received-SPF: neutral (asf.osuosl.org: local policy) Received: from [213.133.33.30] (HELO mailrelay.is.nl) (213.133.33.30) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 12 Sep 2005 08:58:14 -0700 Received: from [213.133.51.241] (HELO hai01.hippo.local) by mailrelay.is.nl (CommuniGate Pro SMTP 4.3.5) with ESMTP id 5369630 for java-user@lucene.apache.org; Mon, 12 Sep 2005 17:59:07 +0200 Received: from [10.10.100.205] ([10.10.100.205]) by hai01.hippo.local with Microsoft SMTPSVC(5.0.2195.6713); Mon, 12 Sep 2005 17:58:00 +0200 Message-ID: <4325A588.8000900@hippo.nl> Date: Mon, 12 Sep 2005 17:58:00 +0200 From: Jeroen Reijn Reply-To: j.reijn@hippo.nl Organization: Hippo User-Agent: Mozilla Thunderbird 1.0.6 (Windows/20050716) X-Accept-Language: en-us, en MIME-Version: 1.0 To: java-user@lucene.apache.org Subject: Re: PDFBox PDFExtractor References: <8C60F95D154DD34AA3AD319A5A093D8402718244@fei0018m001.hq.ferg.com> In-Reply-To: <8C60F95D154DD34AA3AD319A5A093D8402718244@fei0018m001.hq.ferg.com> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-OriginalArrivalTime: 12 Sep 2005 15:58:00.0760 (UTC) FILETIME=[C042EB80:01C5B7B2] X-Virus-Checked: Checked by ClamAV on apache.org X-Spam-Rating: minotaur.apache.org 1.6.2 0/1000/N Hi Rod, PDFBox is a seperate project. The PDFExtractor in Jakarta Slide uses PDFBox's functionality to extract the information from the .pdf file. Hope this answers your question. Jeroen Rod.Madden@ferguson.com wrote: > Hi, > > > > I am new to Lucene and looking at some existing Lucene code.... > > > > I am confused about the relationship ( if any ) between > > org.apache.slide.extractor.PDFExtractor methods and org.PDFBox.cos > methods > > for the purposes of working with PDF files. > > > > I have found info on the web regarding PDFBox, however, I have found > little > > regarding .PDFExtractor. > > > > I am curious since we are having some issues with indexing PDF files and > > I am wondering if PDFExtractor implements PDFBox or if it is a separate > > utility set. > > > > Rod. > > > > --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org