lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ben Litchfield <>
Subject [ANN]PDFBox-0.6.2
Date Mon, 21 Apr 2003 03:12:26 GMT

I am proud to announce the latest version of PDFBox.  This version comes
with some exciting changes to the project.  Bob Dickinson joins PDFBox the
development team and brings 20 years of programming expertise to this
project.  Bob is responsible for all changes in the 0.6.2 release.

PDFBox has also been moved to sourceforge which offers many features that
people have been requesting such as bug tracking and mailing lists.

Version 0.6.2 has fixed many issues with text extraction and it is
recommended that all Lucene users that use PDFBox upgrade to this latest

PDFBox homepage

PDFBox Sourceforge site

PDFBox-0.6.2 release notes
-Modified build so that settings are no longer required
-Added required libraries to CVS
-Added log4j logging
-Significant text extraction work
-Added automatic handling of files encrypted with the empty password
-Added automated tests and test data for text extraction
-Removed unimplemented decoders from filters test
-Fixed several LZW decode bugs introduced after 0.5.6
-Fixed bugs relating to processing out of spec PDF's with bad # escaping
    the name (" Error: expected hex number" bug)
-Fixed Lucene UID generation bug
-Fixed GetFontWidths null pointer exception bug


To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message