lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ian Lea <>
Subject Re: Help wanted with Indexing PDF Documents
Date Tue, 02 Mar 2010 17:58:30 GMT
Sounds like a question for the PDFBox mailing list.  Once you've got
the relevant info out of the PDF you can index it however you like.


On Tue, Mar 2, 2010 at 4:11 PM, Ching Zheng <> wrote:
> Hi,
> I have about 50 PDF douments with size of each is around 10MB. I am using
> PDFbox for parsing, just wondering how I can index bookmarsk with its
> corresponded page information?
> I use PDDocumentOutline to get bookmark's title, but I only have
> PDNamedDestination which offers no page number info. Can someone shed some
> light on this? Thanks  a lot.

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message