Mailing-List: contact dev-help@pdfbox.apache.org; run by ezmlm
Precedence: bulk
Reply-To: dev@pdfbox.apache.org
Message-ID: <816764664.125131267971447252.JavaMail.jira@brutus.apache.org>
Date: Sun, 7 Mar 2010 14:17:27 +0000 (UTC)
From: =?utf-8?Q?Andreas_Lehmk=C3=BChler_=28JIRA=29?= <jira@apache.org>
To: dev@pdfbox.apache.org
Subject: [jira] Created: (PDFBOX-650) Remove dependency to lucene
MIME-Version: 1.0
Content-Type: text/plain; charset=utf-8
Content-Transfer-Encoding: quoted-printable

Remove dependency to lucene
---------------------------

                 Key: PDFBOX-650
                 URL: https://issues.apache.org/jira/browse/PDFBOX-650
             Project: PDFBox
          Issue Type: Improvement
          Components: Lucene, Utilities
    Affects Versions: 1.0.0
            Reporter: Andreas Lehmk=C3=BChler
            Assignee: Andreas Lehmk=C3=BChler


The current pdfbox version extracts all needed data from a pdf document and=
 uses lucene to create an index for the lucene search engine.=20

To avoid the dependency to lucene pdfbox should only extract the data which=
 can be used to create a lucene index outside from pdfbox. That would decra=
se the number of external jars and woukld eliminate an other potential issu=
e because of changing apis like those coming with lucene 3.0.=20

I've created 2 new classes (one for the extraction and one as example how t=
o use that feature) based on existing code and attached it as patch.

WDYT?

If that patch will be added to the trunk the existing code will be removed =
including both lucene jars.


--=20
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.