pdfbox-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From msahy...@apache.org
Subject svn commit: r1711439 - /pdfbox/cmssite/branches/jekyll-migration/content/2.0/migration.md
Date Fri, 30 Oct 2015 10:13:32 GMT
Author: msahyoun
Date: Fri Oct 30 10:13:32 2015
New Revision: 1711439

URL: http://svn.apache.org/viewvc?rev=1711439&view=rev
Log:
PDFBOX-3030: correct Text Extraction information

Modified:
    pdfbox/cmssite/branches/jekyll-migration/content/2.0/migration.md

Modified: pdfbox/cmssite/branches/jekyll-migration/content/2.0/migration.md
URL: http://svn.apache.org/viewvc/pdfbox/cmssite/branches/jekyll-migration/content/2.0/migration.md?rev=1711439&r1=1711438&r2=1711439&view=diff
==============================================================================
--- pdfbox/cmssite/branches/jekyll-migration/content/2.0/migration.md (original)
+++ pdfbox/cmssite/branches/jekyll-migration/content/2.0/migration.md Fri Oct 30 10:13:32
2015
@@ -123,8 +123,8 @@ if (job.printDialog()) {
 Advanced use case examples can be found in th examples package under org/apache/pdfbox/examples/printing/Printing.java
 
 ### Text Extraction
-``PDFTextStripper`` no longer sets the color information in the ``PDGraphicsState ``. If
you need color information for the text being processed
-you can extend ``PDFTextStripper``and add the following ``Operators`` to the constructor:
+In 1.8, to get the text colors, one method was to pass an expanded .properties file to the
PDFStripper constructor. To achieve the same 
+in PDFBox 2.0 you can extend ``PDFTextStripper``and add the following ``Operators`` to the
constructor:
 
 ~~~java
 addOperator(new SetStrokingColorSpace());
@@ -149,6 +149,8 @@ tree are now represented by the `PDNonTe
 With PDFBox 2.0.0 the prefered way to iterate through the fields is now
 
 ~~~java
+PDAcroForm form;
+...
 for (PDField field : form.getFieldTree())
 {
     ... (do something)



Mime
View raw message