Return-Path: X-Original-To: apmail-pdfbox-users-archive@www.apache.org Delivered-To: apmail-pdfbox-users-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 714D410C50 for ; Tue, 11 Jun 2013 09:23:21 +0000 (UTC) Received: (qmail 38285 invoked by uid 500); 11 Jun 2013 09:23:20 -0000 Delivered-To: apmail-pdfbox-users-archive@pdfbox.apache.org Received: (qmail 37764 invoked by uid 500); 11 Jun 2013 09:23:13 -0000 Mailing-List: contact users-help@pdfbox.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: users@pdfbox.apache.org Delivered-To: mailing list users@pdfbox.apache.org Received: (qmail 37734 invoked by uid 99); 11 Jun 2013 09:23:10 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 11 Jun 2013 09:23:10 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW X-Spam-Check-By: apache.org Received-SPF: error (nike.apache.org: local policy) Received: from [74.125.83.47] (HELO mail-ee0-f47.google.com) (74.125.83.47) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 11 Jun 2013 09:23:04 +0000 Received: by mail-ee0-f47.google.com with SMTP id e49so3508911eek.20 for ; Tue, 11 Jun 2013 02:22:23 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type:x-gm-message-state; bh=AcGTtf37H+OFkFlCJu9d/+zviTBBWdKokZOfnXJWuv8=; b=muMG67soXzQgX677Fm5806DWHv0eO+cMbN3skclcg9/T0BIdPRjSIhxyHhREZEC7Ra U2XTHAJIh+7LahYWYWYoCWbE1X3dsOLe7xP0cyQJS88dmUMKgMzOMqNfcqPp6hdijOzJ jbj4q2ojfCxfvg8ZYCL5DazOZzKljl1pteMQh/Md23cNT0zIg8iELZKUtV7uChVblc9x rszQyTFNnxwoseuP/MSJcIoTBC+TjnzxpfkQhpJLjwKv7kaSkEQH0bAbzLu1HldbNeBU phd2P4dVUwiNmQ1slRwSGl1mFTOacjr2rJQ0APexigLBVonYNr9beQ4aAi8ZAXF1uWae blRw== MIME-Version: 1.0 X-Received: by 10.15.35.71 with SMTP id f47mr15639548eev.15.1370942543508; Tue, 11 Jun 2013 02:22:23 -0700 (PDT) Received: by 10.14.185.12 with HTTP; Tue, 11 Jun 2013 02:22:23 -0700 (PDT) In-Reply-To: <58B51A3B-9F5F-420B-939B-F5EF56420086@fileaffairs.de> References: <4544C14C-8FE5-4227-9D74-C737A8E5E60A@fileaffairs.de> <113DDBBA-E583-4884-BEEC-C51F812DEA24@fileaffairs.de> <58B51A3B-9F5F-420B-939B-F5EF56420086@fileaffairs.de> Date: Tue, 11 Jun 2013 14:52:23 +0530 Message-ID: Subject: Re: PDF to Image conversion From: Ankur Tripathi To: users@pdfbox.apache.org Content-Type: multipart/alternative; boundary=089e016282b850716a04dedd6f37 X-Gm-Message-State: ALoCoQmQ/8BDB4RVV72BCOWLMkNtWFy45wk47dSBn5Ijc+jbl5tRU2n8z7dQZ5rBnSKztn2zK8gF X-Virus-Checked: Checked by ClamAV on apache.org --089e016282b850716a04dedd6f37 Content-Type: text/plain; charset=windows-1252 Content-Transfer-Encoding: quoted-printable Hi Maruan, Any update on this ? On Thu, Jun 6, 2013 at 10:05 PM, Maruan Sahyoun wro= te: > Hi Ankur, > > unfortunately there are some issues with rendering these files. It needs > some further analysis to find out why that happens. > > BR > Maruan Sahyoun > > Am 06.06.2013 um 17:05 schrieb Ankur Tripathi : > > > Maruan, > > > > I had added pdf's to dropbox you can download it from here > > https://www.dropbox.com/sh/ruwz0l5hya2l4tp/9-zLVv96uw > > > > Let me know if you cannot access this link. > > > > > > On Thu, Jun 6, 2013 at 8:22 PM, Maruan Sahyoun >wrote: > > > >> Hi Ankur, > >> > >> due to limitations of the mailing list the attachment didn't make it - > >> could you upload to a public site? > >> > >> BR > >> Maruan > >> > >> Am 06.06.2013 um 15:57 schrieb Ankur Tripathi = : > >> > >>> Hi Maruan, > >>> > >>> I have successfully converted pdf to images using pdfbox. However the= re > >> are couple of attached pdf which does not shows up filled in details. > >>> > >>> In my use case i need to merge different pdf's and than convert to > >> series of images. Since these pdf are not converted to image properly = i > >> would like to avoid merge or fix image generation. Can you please poin= t > me > >> to right direction. > >>> > >>> Converting this pdf with imagemagick works fine but i would like to > >> avoid os level tool. > >>> > >>> Really appreciate your help > >>> > >>> Thanks > >>> -Ankur Tripathi > >>> > >>> > >>> On Thu, Jun 6, 2013 at 4:01 PM, Maruan Sahyoun > > >> wrote: > >>>> Hi, > >>>> > >>>> the question if a PDF can be rendered successfully is not dependent = on > >> the fact that it's a PDF/A file. In general PDFBox does a good job in > >> converting a PDF to image and it supports PDF/A as well as not PDF/A > >> compliant files. There are some limitations though which may or may no= t > >> apply to you. > >>>> > >>>> - there are limitations in PDFs render mode i.e. not all possible > >> render modes are supported > >>>> - there are limitations in PDFs shading i.e. not all shadings are > >> supported > >>>> - font rendering is dependent on awt i.e. PDFBox generates a font fr= om > >> embedded font which is passed to awt. This works in most but not all > cases. > >>>> - always test if e.g. Adobe Reader can display the file > >>>> =85. > >>>> > >>>> If while rendering PDFBox hit's an (yet) unsupported feature it will > be > >> reported. If you come across such a limitation please log an enhanceme= nt > >> request in Jira (you should search first if someone else already had a > >> similar issue and add to that) so we can look into removing the > limitation. > >> Of course if you are able to contribute that's even better. > >>>> > >>>> For PDF/A (PDF/A-1b) PDFBox passes several test suites. So if you > think > >> you have a valid PDF/A file and PDFBox complains we are very intereste= d > in > >> finding out why this is the case. But it's very likely that your file > might > >> not be PDF/A-1b compliant. > >>>> > >>>> If you have specific questions/issues please feel free to ask. > >>>> > >>>> Bottom line - I think PDFBox will help you doing the conversion. You= r > >> milage will vary dependent on the files content. > >>>> > >>>> > >>>> BR > >>>> Maruan Sahyoun > >>>> > >>>> Am 06.06.2013 um 12:12 schrieb Ankur Tripathi >: > >>>> > >>>>> Hi, > >>>>> > >>>>> I have a use case in my project where i want to convert every page = of > >> pdf > >>>>> to image. I have tried different opensource libraries like > >> pdfrenderere, > >>>>> open source version of jpedal etc but with each of them we have > >> problems > >>>>> with PDF/A pdf's. Before trying pdfbox i would like to know that if > >> support > >>>>> for embedded font and pdf/a is available in pdfbox api. If not is > >> there any > >>>>> way to identify if a particular pdf can not be converted to image, = I > >>>>> already tried > http://pdfbox.apache.org/cookbook/pdfavalidation.htmlto > >>>>> validate uploaded pdf's but it fails for all of our pdfs but we are > >> able to > >>>>> convert them into image properly. There are only few formed filled > pdf > >>>>> which have not been converted to image properly. > >>>>> > >>>>> Thanks for help. > >>>>> > >>>>> > >>>>> Thanks > >>>>> -Ankur Tripathi > >>> > >> > > --089e016282b850716a04dedd6f37--