pdfbox-users mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Neil Pitman <neil.pit...@aquaforest.com>
Subject RE: C# Version of PDFBox?
Date Wed, 30 Mar 2016 14:51:31 GMT
Our primary use case is converting Image or Partially Searchable PDFs to fully text searchable
PDFs.

This involves reading the input PDF to determine which pages are image only, extracting the
page images, running OCR processing with our own engine and then adding a hidden text layer
to the PDF.

We do also have other use cases such as text extraction / split / merge etc but the OCR use
case is the most performance critical.

-----Original Message-----
From: Maruan Sahyoun [mailto:sahyoun@fileaffairs.de] 
Sent: 30 March 2016 15:47
To: users@pdfbox.apache.org
Subject: Re: C# Version of PDFBox?

Hi,

> Am 30.03.2016 um 16:44 schrieb Neil Pitman <neil.pitman@aquaforest.com>:
> 
> Our main goal is performance and whilst IKVM is great it does add a significant overhead.
 
> 
> We would certainly rather avoid a complete rewrite so would be interested to hear thoughts
of any other options!
> 

are there specific use cases you'd like to address? Like splitting & merging, form filling,
rendering …

BR
Maruan

> -----Original Message-----
> From: Daniel Wilson [mailto:williamstonconsulting@gmail.com]
> Sent: 30 March 2016 15:14
> To: users@pdfbox.apache.org
> Subject: Re: C# Version of PDFBox?
> 
> Neat idea.  I'm not sure if I have time to contribute.
> 
> I guess ... why not IKVM?  Is it that there are gaps in IKVM that are messing you up?
 I've done a bit of my own tweaking to IKVM on a fork of the code to address one or two such
gaps.  I'd be happy to show you the Git repo.
> 
> But ... is there another reason?  And another way BESIDES rewriting all the Java code
in C#?
> 
> On Wed, Mar 30, 2016 at 10:07 AM, Neil Pitman 
> <neil.pitman@aquaforest.com>
> wrote:
> 
>> 
>> 
>> We are looking at ways of getting to a C# version of PDFBox to avoid 
>> the overhead of IKVM in our .Net-based solutions.
>> 
>> 
>> 
>> We are certainly willing to put engineering effort into this but were 
>> hoping that there  were also other interested parties who would also 
>> be willing to contribute!
>> 
>> 
>> 
>> We would be very interested to gauge the level of interest in such a 
>> project both from the user viewpoint and from the core development 
>> community.
>> 
>> 
>> 
>> Best Regards
>> 
>> 
>> 
>> Neil Pitman
>> 
>> Aquaforest
>> 
>> 
>> 
>> [image: cid:image001.png@01CF8B0A.409F4FB0][image: Description:
>> Description:
>> https://encrypted-tbn0.gstatic.com/images?q=tbn:ANd9GcQ5uvll4GirLE2_e
>> B 6kFZfTu6QAR34oHE6RlagToVxnE0rZHObiDA]
>> 
>> Aquaforest Ltd
>> 
>> Suite 32
>> 
>> Midshires House
>> 
>> Smeaton Close
>> 
>> Aylesbury
>> 
>> Bucks HP19 8HL
>> 
>> UNITED KINGDOM
>> 
>> Email: info@aquaforest.com
>> 
>> Web: www.aquaforest.com
>> 
>> Tel:  +44 (0) 1296 768727
>> 
>> Aquaforest Ltd - England - Company No.4344383 – Registered Office: As 
>> above
>> 
>> *****EMAIL CONFIDENTIALITY NOTICE*****
>> 
>> This message is or maybe private and confidential. If you have 
>> received this message in error, please notify us and remove it from 
>> your system. The recipient of this email should not without the 
>> sender’s permission disclose or copy any of the confidential information received.
>> 
>> 
>> 
>> 
>> 
>> 
>> 
>> _____________________________________________________________________
>> _ This email has been scanned by the Symantec Email Security.cloud 
>> service.
>> For more information please visit http://www.symanteccloud.com 
>> _____________________________________________________________________
>> _
>> 
> 
> ______________________________________________________________________
> This email has been scanned by the Symantec Email Security.cloud service.
> For more information please visit http://www.symanteccloud.com 
> ______________________________________________________________________
> 
> ______________________________________________________________________
> This email has been scanned by the Symantec Email Security.cloud service.
> For more information please visit http://www.symanteccloud.com 
> ______________________________________________________________________
> 
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: users-unsubscribe@pdfbox.apache.org
> For additional commands, e-mail: users-help@pdfbox.apache.org
> 


---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe@pdfbox.apache.org
For additional commands, e-mail: users-help@pdfbox.apache.org

______________________________________________________________________
This email has been scanned by the Symantec Email Security.cloud service.
For more information please visit http://www.symanteccloud.com ______________________________________________________________________

______________________________________________________________________
This email has been scanned by the Symantec Email Security.cloud service.
For more information please visit http://www.symanteccloud.com
______________________________________________________________________

---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe@pdfbox.apache.org
For additional commands, e-mail: users-help@pdfbox.apache.org

Mime
View raw message