Return-Path: Delivered-To: apmail-jakarta-poi-user-archive@www.apache.org Received: (qmail 37651 invoked from network); 5 Jan 2006 23:38:48 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur.apache.org with SMTP; 5 Jan 2006 23:38:48 -0000 Received: (qmail 19061 invoked by uid 500); 5 Jan 2006 23:38:46 -0000 Delivered-To: apmail-jakarta-poi-user-archive@jakarta.apache.org Received: (qmail 19041 invoked by uid 500); 5 Jan 2006 23:38:46 -0000 Mailing-List: contact poi-user-help@jakarta.apache.org; run by ezmlm Precedence: bulk List-Unsubscribe: List-Help: List-Post: List-Id: "POI Users List" Reply-To: "POI Users List" Delivered-To: mailing list poi-user@jakarta.apache.org Received: (qmail 19030 invoked by uid 99); 5 Jan 2006 23:38:46 -0000 Received: from asf.osuosl.org (HELO asf.osuosl.org) (140.211.166.49) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 05 Jan 2006 15:38:46 -0800 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests=HTML_MESSAGE X-Spam-Check-By: apache.org Received-SPF: neutral (asf.osuosl.org: local policy) Received: from [212.227.126.171] (HELO moutng.kundenserver.de) (212.227.126.171) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 05 Jan 2006 15:38:45 -0800 Received: from [80.104.140.137] (helo=Hydra) by mrelayeu.kundenserver.de (node=mrelayeu9) with ESMTP (Nemesis), id 0ML2xA-1EuegY1Q2G-00016X; Fri, 06 Jan 2006 00:38:24 +0100 Message-ID: <002201c61251$b08cdb70$898c6850@Hydra> From: "Alessandro Ilardo" To: Subject: wordExtractor cannot be found to compile with javac Date: Fri, 6 Jan 2006 00:42:26 +0100 MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="----=_NextPart_000_001F_01C6125A.10CF1040" X-Priority: 3 X-MSMail-Priority: Normal X-Mailer: Microsoft Outlook Express 6.00.2900.2180 X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2900.2180 X-Antivirus: avast! (VPS 0601-1, 03/01/2006), Outbound message X-Antivirus-Status: Clean X-Provags-ID: kundenserver.de abuse@kundenserver.de login:375037a952cd550e4a3ca43e640d7589 X-Virus-Checked: Checked by ClamAV on apache.org X-Spam-Rating: minotaur.apache.org 1.6.2 0/1000/N ------=_NextPart_000_001F_01C6125A.10CF1040 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable Hello there, I'm new with POI, and i want use it to integrate the word files in =3D Lucene. I know that something suitable for my purpose should be on =3D http://www.textmining.org/ but everytime I try to open that adress I =3D just get this message: Hacked Fotolog ? ITALY owned .org ? HACKED BY ITALY=3D20 and so, I was not able to check out that package. In anycase, I tried to download all three latest .jar files from =3D http://encore.torchbox.com/poi-cvs-build/ and compile a test class in order to test POI with Lucene. It's not able to compile because it doesn't find the WordExtractor =3D class. C:\Documents and Settings\Alessandro\Desktop\Copia di =3D PDFBox-0.7.2\PDFBox-0.7.2> javac -classpath =3D lucene-1.4.3.jar;lucene-demos-1.4.3.jar;PDFBox-0.7.2.jar;poi-3. 0.jar;poi-scratchpad-3.0.jar =3D src\org\pdfbox\searchengine\lucene\luceneWORDDocume nt.java src\org\pdfbox\searchengine\lucene\luceneWORDDocument.java:294: cannot = =3D resolve s ymbol symbol : class WordExtractor location: class org.pdfbox.searchengine.lucene.LuceneWORDDocument WordExtractor extractor =3D3D new WordExtractor(); ^ src\org\pdfbox\searchengine\lucene\luceneWORDDocument.java:294: cannot = =3D resolve s ymbol symbol : class WordExtractor location: class org.pdfbox.searchengine.lucene.LuceneWORDDocument WordExtractor extractor =3D3D new WordExtractor(); ^ 2 errors This's the code used HWPFDocument wdoc =3D3D new HWPFDocument(is); WordExtractor extractor =3D3D new WordExtractor(); String contents =3D3D extractor.extractText(wdoc); StringReader reader =3D3D new StringReader( contents ); document.add( Field.Text( "contents", reader ) ); I'll apreciate any comments or suggestions to solve my problem. Thanks in advance ------=_NextPart_000_001F_01C6125A.10CF1040--