lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ben Litchfield <...@csh.rit.edu>
Subject PDF Highlighter Package
Date Tue, 01 Mar 2005 03:50:14 GMT

For those of you that support indexing PDF documents, PDFBox now supports
Adobe's PDF Highlight specification
(http://partners.adobe.com/public/developer/en/pdf/HighlightFileFormat.pdf)

PDFBox is now capable of generating an XML document that describes words
in a PDF document to highlight.

An "in action" example can be seen at

http://pavilion.csh.rit.edu:8080/pdfbox/index.html

You can enter any web accessible PDF and any keywords.  The PDF will open
normally and after a short pause(this is running on an old slow server)
will jump to the first selected keyword.

Source code is available in CVS or in tonight's nightly build.

Any comments/suggestions are welcome.

Special thanks to Stephan Lagraulet, who made this possible with code
contributions.

Ben
http://www.pdfbox.org

---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-user-help@jakarta.apache.org


Mime
View raw message