pdfbox-users mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From John Hewson <j...@jahewson.com>
Subject Re: Strip Data out of PDF and save only skeleton.
Date Sat, 31 Oct 2015 04:02:19 GMT

> On 30 Oct 2015, at 20:05, Sriram Varadharajan <varadhukutty@gmail.com> wrote:
> 
> Is there any other alternative like overlaying an opaque rectangle on top
> of the rectangle box that has the data .

No. Your sensitive data will of course still be in the document even if you cover it with
an opaque square!

Do not go down this road.

-- John

> I know the coordinates as i use it
> to extract the data from the PDF at the first place .
> 
> I am also OK filling out rectangles with dark colors . At the end i need
> only the borders and no data .
> 
> 
> 
>> On Fri, Oct 30, 2015 at 7:11 PM, John Hewson <john@jahewson.com> wrote:
>> 
>> This is a very hard thing to get right, especially if you have compliance
>> needs.
>> There are just so many ways that sensitive data could remain embedded in
>> the resulting document.
>> 
>> If you want my advice, don’t attempt this.
>> 
>> — John
>> 
>>>> On 30 Oct 2015, at 18:37, Sriram Varadharajan <varadhukutty@gmail.com>
>>> wrote:
>>> 
>>> We are using PDFBox to process PDF that contains sensitive data .
>> Currently
>>> we don't store these PDF (even after encrypting) due to security
>> compliance
>>> . If there is an ability to strip the data out of PDF we can save the
>> file
>>> and we can use them for analytical purposes
>>> 
>>> Question is  Does PDF box or any other utility out there gives the
>> ability
>>> to blank out all the Data in the PDF and just save the skeleton alone ?
>>> Please share any custom solutions or ideas if any !!
>>> 
>>> Thanks
>> 
>> 
>> ---------------------------------------------------------------------
>> To unsubscribe, e-mail: users-unsubscribe@pdfbox.apache.org
>> For additional commands, e-mail: users-help@pdfbox.apache.org
>> 
>> 

---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe@pdfbox.apache.org
For additional commands, e-mail: users-help@pdfbox.apache.org


Mime
View raw message