pdfbox-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From le...@apache.org
Subject svn commit: r1568811 - /pdfbox/cmssite/trunk/content/ideas.mdtext
Date Sun, 16 Feb 2014 19:34:53 GMT
Author: lehmi
Date: Sun Feb 16 19:34:53 2014
New Revision: 1568811

URL: http://svn.apache.org/r1568811
Log:
Added some new ideas and more information about the targeted version

Modified:
    pdfbox/cmssite/trunk/content/ideas.mdtext

Modified: pdfbox/cmssite/trunk/content/ideas.mdtext
URL: http://svn.apache.org/viewvc/pdfbox/cmssite/trunk/content/ideas.mdtext?rev=1568811&r1=1568810&r2=1568811&view=diff
==============================================================================
--- pdfbox/cmssite/trunk/content/ideas.mdtext (original)
+++ pdfbox/cmssite/trunk/content/ideas.mdtext Sun Feb 16 19:34:53 2014
@@ -3,7 +3,7 @@ Title: Ideas
 ## Ideas
 
 There are several ideas to enhance PDFBox. These are outlined below together with 
-comments and te releases they are planned for as soon as there is agreement to do the
+comments and the releases they are planned for as soon as there is agreement to do the
 implementation.
 
 ### Enhance type safety
@@ -12,19 +12,27 @@ Enhance the type safety of PDFBox and ad
 
 ### Remove all deprecated methods
 
-#### handle large pdf files
+This is an ongoing effort and most/all deprecated methods will be removed in PDFBox 2.0.0
+
+### Handle large pdf files
 
 In addition to the pdf parsing pdfbox does not always handle large pdf files well as some

 of the references are implemented as int instead of long
 
 ### Switch to Java 1.6
 
+PDFBox 2.0.0 has Java 6 as minimum requirement.
+
 ### Break PDFBox into modules
 
-In order to support different use cases and provide a minimal toolset PDFBox should be 
+In order to support different use cases and provide a minimal toolset PDFBox 2.0.0 should
be 
 separated into different modules. This goes inline with rearranging some of the code
-e.g. remove awt from PDDocument.
+e.g. remove AWT from PDDocument.
 
+### Enhance the font rendering
+
+PDFBox 2.0.0 will render most of the fonts without using AWT.
+ 
 ### Replace/enhance PDF parsing
 
 The old "classic" PDF parser in PDFBox is not in line with the PDF specification as it parses
@@ -36,10 +44,14 @@ enhanced that situation but there is a n
 - parsing according to structure
 - COS level document
 - PD level document
+- add some self healing mechanism to process corrupt files
 
 In addition handling documents which are not conforming shouldn't be part of the core parser
 but of a extentable approach e.g. by adding hooks to allow for handling parsing exceptions.
 
+### Add the ability to create pdfs using unicode encoded text
+
+The recent PDFBox version is limited to WinANSI encoded text. 2.0.0 should have unicode support
as well.
 
 ### Rearchitect the COS level objects
 



Mime
View raw message