lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ben Litchfield <>
Subject PDF Highlighter Package
Date Tue, 01 Mar 2005 03:50:14 GMT

For those of you that support indexing PDF documents, PDFBox now supports
Adobe's PDF Highlight specification

PDFBox is now capable of generating an XML document that describes words
in a PDF document to highlight.

An "in action" example can be seen at

You can enter any web accessible PDF and any keywords.  The PDF will open
normally and after a short pause(this is running on an old slow server)
will jump to the first selected keyword.

Source code is available in CVS or in tonight's nightly build.

Any comments/suggestions are welcome.

Special thanks to Stephan Lagraulet, who made this possible with code


To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message