lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ching Zheng <zchin...@gmail.com>
Subject Help wanted with Indexing PDF Documents
Date Tue, 02 Mar 2010 16:11:30 GMT
Hi,
I have about 50 PDF douments with size of each is around 10MB. I am using
PDFbox for parsing, just wondering how I can index bookmarsk with its
corresponded page information?

I use PDDocumentOutline to get bookmark's title, but I only have
PDNamedDestination which offers no page number info. Can someone shed some
light on this? Thanks  a lot.

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message