Return-Path: X-Original-To: apmail-pdfbox-commits-archive@www.apache.org Delivered-To: apmail-pdfbox-commits-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 18B1018C4A for ; Wed, 28 Oct 2015 10:51:40 +0000 (UTC) Received: (qmail 52107 invoked by uid 500); 28 Oct 2015 10:51:40 -0000 Delivered-To: apmail-pdfbox-commits-archive@pdfbox.apache.org Received: (qmail 52085 invoked by uid 500); 28 Oct 2015 10:51:40 -0000 Mailing-List: contact commits-help@pdfbox.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@pdfbox.apache.org Delivered-To: mailing list commits@pdfbox.apache.org Received: (qmail 52076 invoked by uid 99); 28 Oct 2015 10:51:39 -0000 Received: from Unknown (HELO spamd4-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 28 Oct 2015 10:51:39 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd4-us-west.apache.org (ASF Mail Server at spamd4-us-west.apache.org) with ESMTP id 6BCF9C0BD9 for ; Wed, 28 Oct 2015 10:51:39 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd4-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 0.99 X-Spam-Level: X-Spam-Status: No, score=0.99 tagged_above=-999 required=6.31 tests=[KAM_LAZY_DOMAIN_SECURITY=1, T_RP_MATCHES_RCVD=-0.01] autolearn=disabled Received: from mx1-us-east.apache.org ([10.40.0.8]) by localhost (spamd4-us-west.apache.org [10.40.0.11]) (amavisd-new, port 10024) with ESMTP id w8U9Pg6D_w47 for ; Wed, 28 Oct 2015 10:51:38 +0000 (UTC) Received: from mailrelay1-us-west.apache.org (mailrelay1-us-west.apache.org [209.188.14.139]) by mx1-us-east.apache.org (ASF Mail Server at mx1-us-east.apache.org) with ESMTP id 610EA429A6 for ; Wed, 28 Oct 2015 10:51:38 +0000 (UTC) Received: from svn01-us-west.apache.org (svn.apache.org [10.41.0.6]) by mailrelay1-us-west.apache.org (ASF Mail Server at mailrelay1-us-west.apache.org) with ESMTP id EDB8FE0451 for ; Wed, 28 Oct 2015 10:51:37 +0000 (UTC) Received: from svn01-us-west.apache.org (localhost [127.0.0.1]) by svn01-us-west.apache.org (ASF Mail Server at svn01-us-west.apache.org) with ESMTP id EFC813A0178 for ; Wed, 28 Oct 2015 10:51:37 +0000 (UTC) Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit Subject: svn commit: r970484 - /websites/production/pdfbox/content/2.0/migration.html Date: Wed, 28 Oct 2015 10:51:37 -0000 To: commits@pdfbox.apache.org From: msahyoun@apache.org X-Mailer: svnmailer-1.0.9 Message-Id: <20151028105137.EFC813A0178@svn01-us-west.apache.org> Author: msahyoun Date: Wed Oct 28 10:51:37 2015 New Revision: 970484 Log: Site checkin for project Apache PDFBox Website Modified: websites/production/pdfbox/content/2.0/migration.html Modified: websites/production/pdfbox/content/2.0/migration.html ============================================================================== --- websites/production/pdfbox/content/2.0/migration.html (original) +++ websites/production/pdfbox/content/2.0/migration.html Wed Oct 28 10:51:37 2015 @@ -285,6 +285,23 @@ and so on. The add method n

Advanced use case examples can be found in th examples package under org/apache/pdfbox/examples/printing/Printing.java

+

Text Extraction

+ +

PDFTextStripper no longer sets the color information in the PDGraphicsState. If you need color information for the text being processed +you can extend PDFTextStripperand add the following Operators to the constructor:

+
addOperator(new SetStrokingColorSpace());
+addOperator(new SetNonStrokingColorSpace());
+addOperator(new SetStrokingDeviceCMYKColor());
+addOperator(new SetNonStrokingDeviceCMYKColor());
+addOperator(new SetNonStrokingDeviceRGBColor());
+addOperator(new SetStrokingDeviceRGBColor());
+addOperator(new SetNonStrokingDeviceGrayColor());
+addOperator(new SetStrokingDeviceGrayColor());
+addOperator(new SetStrokingColor());
+addOperator(new SetStrokingColorN());
+addOperator(new SetNonStrokingColor());
+addOperator(new SetNonStrokingColorN());
+

Interactive Forms

Large parts of the support for interactive forms (AcroForms) has been rewritten. The most notable change from 1.8.x is that