Return-Path: X-Original-To: apmail-pdfbox-users-archive@www.apache.org Delivered-To: apmail-pdfbox-users-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id C1DAC1882D for ; Sat, 31 Oct 2015 01:37:20 +0000 (UTC) Received: (qmail 27279 invoked by uid 500); 31 Oct 2015 01:37:20 -0000 Delivered-To: apmail-pdfbox-users-archive@pdfbox.apache.org Received: (qmail 27253 invoked by uid 500); 31 Oct 2015 01:37:20 -0000 Mailing-List: contact users-help@pdfbox.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: users@pdfbox.apache.org Delivered-To: mailing list users@pdfbox.apache.org Received: (qmail 27240 invoked by uid 99); 31 Oct 2015 01:37:20 -0000 Received: from Unknown (HELO spamd2-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 31 Oct 2015 01:37:20 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd2-us-west.apache.org (ASF Mail Server at spamd2-us-west.apache.org) with ESMTP id CF05D1A082F for ; Sat, 31 Oct 2015 01:37:19 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd2-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 2.879 X-Spam-Level: ** X-Spam-Status: No, score=2.879 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=3, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd2-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-us-east.apache.org ([10.40.0.8]) by localhost (spamd2-us-west.apache.org [10.40.0.9]) (amavisd-new, port 10024) with ESMTP id FzylMteRufty for ; Sat, 31 Oct 2015 01:37:19 +0000 (UTC) Received: from mail-vk0-f52.google.com (mail-vk0-f52.google.com [209.85.213.52]) by mx1-us-east.apache.org (ASF Mail Server at mx1-us-east.apache.org) with ESMTPS id 0C780444C7 for ; Sat, 31 Oct 2015 01:37:19 +0000 (UTC) Received: by vkgs66 with SMTP id s66so57959593vkg.1 for ; Fri, 30 Oct 2015 18:37:18 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:date:message-id:subject:from:to:content-type; bh=J7T2KelEyjLguk1VS2GzwYnjMpyQzJyQTl1Z+zIKodE=; b=BFwFQNklf+DzMcSzf7Gv+HtxTWyyVGj29Yx7Y5p/+dSGGdAwsbh+60bOYi/6hdnf9h pnhygkono9FB8gXgk5edHeNceR2yNyGGmY6oVy6B9kTMa+WiLqOOtKN/B2aU3zS/KcGF amdsSOyWmmyXcu5700fXyk9Rw4VHkOp7yeT+lkJ1wschWkhtx57U6fofk+I5wTcABY4P 7h3VcdWTPNAlYNxT/620SdBcQsao6Ica5VxfLLzmF7WM0HREbzUgHOa/9OzSn7aUPctL fm3f9l7I6oixYalyyI1OmLtGAvPNEAUN+dJRLuAXvbZnKcZBJT6i8qUs6ZeZsXUQifed YpiQ== MIME-Version: 1.0 X-Received: by 10.31.132.195 with SMTP id g186mr7377331vkd.13.1446255438753; Fri, 30 Oct 2015 18:37:18 -0700 (PDT) Received: by 10.31.209.1 with HTTP; Fri, 30 Oct 2015 18:37:18 -0700 (PDT) Date: Fri, 30 Oct 2015 18:37:18 -0700 Message-ID: Subject: Strip Data out of PDF and save only skeleton. From: Sriram Varadharajan To: users@pdfbox.apache.org Content-Type: multipart/alternative; boundary=001a114418b2ae77ab05235c9593 --001a114418b2ae77ab05235c9593 Content-Type: text/plain; charset=UTF-8 We are using PDFBox to process PDF that contains sensitive data . Currently we don't store these PDF (even after encrypting) due to security compliance . If there is an ability to strip the data out of PDF we can save the file and we can use them for analytical purposes Question is Does PDF box or any other utility out there gives the ability to blank out all the Data in the PDF and just save the skeleton alone ? Please share any custom solutions or ideas if any !! Thanks --001a114418b2ae77ab05235c9593--