From general-return-139-apmail-lucene-general-archive=lucene.apache.org@lucene.apache.org Sun Sep 11 18:57:08 2005 Return-Path: Delivered-To: apmail-lucene-general-archive@www.apache.org Received: (qmail 99631 invoked from network); 11 Sep 2005 18:57:08 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur.apache.org with SMTP; 11 Sep 2005 18:57:08 -0000 Received: (qmail 5589 invoked by uid 500); 11 Sep 2005 18:57:07 -0000 Delivered-To: apmail-lucene-general-archive@lucene.apache.org Received: (qmail 5570 invoked by uid 500); 11 Sep 2005 18:57:07 -0000 Mailing-List: contact general-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: general@lucene.apache.org Delivered-To: mailing list general@lucene.apache.org Received: (qmail 5557 invoked by uid 99); 11 Sep 2005 18:57:06 -0000 Received: from asf.osuosl.org (HELO asf.osuosl.org) (140.211.166.49) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 11 Sep 2005 11:57:06 -0700 X-ASF-Spam-Status: No, hits=0.4 required=10.0 tests=DNS_FROM_RFC_ABUSE,HTML_30_40,HTML_MESSAGE X-Spam-Check-By: apache.org Received-SPF: pass (asf.osuosl.org: local policy) Received: from [206.190.49.117] (HELO web54307.mail.yahoo.com) (206.190.49.117) by apache.org (qpsmtpd/0.29) with SMTP; Sun, 11 Sep 2005 11:57:17 -0700 Received: (qmail 5471 invoked by uid 60001); 11 Sep 2005 18:57:03 -0000 DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=s1024; d=yahoo.com; h=Message-ID:Received:Date:From:Subject:To:MIME-Version:Content-Type:Content-Transfer-Encoding; b=PeCBwpQQs2LPJBSv2o2+61enHZI9OxbIPouQrhQYZg+QK2IL0ZeP1bGgemTZCXCJfJzMzz1eMPd+7pN17hRlPhMSydKC/Rz6nvCD7y96mh+7Ev6kE8mgM6Tom3hlYR1A2umX46O0tNkvoO9YhDRcXnNceleGGoBR5cBy8nozYK4= ; Message-ID: <20050911185703.5469.qmail@web54307.mail.yahoo.com> Received: from [141.70.115.175] by web54307.mail.yahoo.com via HTTP; Sun, 11 Sep 2005 11:57:03 PDT Date: Sun, 11 Sep 2005 11:57:03 -0700 (PDT) From: tirupathi reddy Subject: Problem of indexing pdf files To: general@lucene.apache.org MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="0-1363397451-1126465023=:5088" Content-Transfer-Encoding: 8bit X-Virus-Checked: Checked by ClamAV on apache.org X-Spam-Rating: minotaur.apache.org 1.6.2 0/1000/N --0-1363397451-1126465023=:5088 Content-Type: text/plain; charset=iso-8859-1 Content-Transfer-Encoding: 8bit Hello, I am getting the following warning message when I am indexing the pdf files using Lucene Indexing. log4j:WARN No appenders could be found for logger (org.pdfbox.pdfparser.PDFParser). log4j:WARN Please initialize the log4j system properly. This is the code I am using: if(pdf.exists()) { String text = ""; try{ PDDocument document = PDDocument.load(pdf); // laden des Files PDFTextStripper pts = new PDFTextStripper(); //Extrahieren des Textes text = pts.getText(document); document.close(); } catch(IOException e){ System.out.println("File not found"); } mDocument.add(Field.Text("fulltext", text)); thanx, MTREDDY Tirupati Reddy Manyam 24-06-08, Sundugaullee-24, 79110 Freiburg GERMANY. Phone: 00497618811257 cell : 004917624649007 __________________________________________________ Do You Yahoo!? Tired of spam? Yahoo! Mail has the best spam protection around http://mail.yahoo.com --0-1363397451-1126465023=:5088--