Return-Path: X-Original-To: apmail-pdfbox-users-archive@www.apache.org Delivered-To: apmail-pdfbox-users-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 1A62CCF6B for ; Mon, 7 May 2012 09:21:54 +0000 (UTC) Received: (qmail 87756 invoked by uid 500); 7 May 2012 09:21:53 -0000 Delivered-To: apmail-pdfbox-users-archive@pdfbox.apache.org Received: (qmail 87629 invoked by uid 500); 7 May 2012 09:21:53 -0000 Mailing-List: contact users-help@pdfbox.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: users@pdfbox.apache.org Delivered-To: mailing list users@pdfbox.apache.org Received: (qmail 87614 invoked by uid 99); 7 May 2012 09:21:53 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 07 May 2012 09:21:53 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of peter.murray.rust@googlemail.com designates 74.125.82.176 as permitted sender) Received: from [74.125.82.176] (HELO mail-we0-f176.google.com) (74.125.82.176) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 07 May 2012 09:21:47 +0000 Received: by werc1 with SMTP id c1so1538385wer.21 for ; Mon, 07 May 2012 02:21:26 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlemail.com; s=20120113; h=mime-version:sender:in-reply-to:references:date :x-google-sender-auth:message-id:subject:from:to:cc:content-type; bh=l7rdnO7mzYBjVLhgyOdTUKIruxP9UG6YU8IUdyDcCyM=; b=MfXDFEdMa22NQ6983x2UpFdMaGjFzYJ/6U80/RnxyYogHdMU5+dYTDv/Wsxh3UZryl hd7+Y0IW95ae+PnjkBTPZs8xBbXrdXPrmxOZuMTbuEyRdwTzjlDQALq/8mu42d0yoEsy 0IV4iJvSLoL/FFpIx42q3L00TRZtvbDuH5HkfLBpiZDOqi+B4OGuo4UsHdKW58Sunc17 W03hpZH6TasY8/iNex/dNc5A/YHrp5aO7h8KbSRW7rG7C/yc61Gib+d3Dk7MmVhQxrgH HrWL6uljzgYDOWA+G9FM+yF1Mr+VUJDY9ackwrat1Yx2t/i1UogcML6NdWdwarekJe53 VC3g== MIME-Version: 1.0 Received: by 10.180.97.41 with SMTP id dx9mr33304084wib.9.1336382485966; Mon, 07 May 2012 02:21:25 -0700 (PDT) Sender: peter.murray.rust@googlemail.com Received: by 10.216.214.22 with HTTP; Mon, 7 May 2012 02:21:25 -0700 (PDT) In-Reply-To: <012601cd2c31$30f2f1a0$92d8d4e0$@de> References: <038f01cd10ca$22a158d0$67e40a70$@de> <039901cd10d7$c3916070$4ab42150$@de> <012601cd2c31$30f2f1a0$92d8d4e0$@de> Date: Mon, 7 May 2012 10:21:25 +0100 X-Google-Sender-Auth: -zYiMiXcazIGVdi4grrgcwn4AAs Message-ID: Subject: Re: Extracting vector graphics from PDF From: Peter Murray-Rust To: Andrey Kuznetsov Cc: users@pdfbox.apache.org Content-Type: multipart/alternative; boundary=f46d044306745c303b04bf6ecb4a X-Virus-Checked: Checked by ClamAV on apache.org --f46d044306745c303b04bf6ecb4a Content-Type: text/plain; charset=ISO-8859-1 On Mon, May 7, 2012 at 10:10 AM, Andrey Kuznetsov wrote: > Hi Peter,**** > > ** ** > > did you tried to trace from where setFont() get called?**** > > ** ** > > Best Regards**** > > ** ** > > Andrey**** > > > Andrey - this is very helpful of you - I hope you have time to comment on the stack trace: PDFSVGGraphics2D is my extended SVGGraphics engine PDF2SVGReader.writePage(PDF2SVGReader.java:115) is given below at org.xmlcml.graphics.pdf2svg.PDFSVGGraphics2D.setFont(PDFSVGGraphics2D.java:81) at org.apache.pdfbox.pdmodel.font.PDSimpleFont.writeFont(PDSimpleFont.java:304) at org.apache.pdfbox.pdmodel.font.PDSimpleFont.drawString(PDSimpleFont.java:114) at org.apache.pdfbox.pdfviewer.PageDrawer.processTextPosition(PageDrawer.java:194) at org.apache.pdfbox.util.PDFStreamEngine.processEncodedText(PDFStreamEngine.java:494) at org.apache.pdfbox.util.operator.ShowTextGlyph.process(ShowTextGlyph.java:62) at org.apache.pdfbox.util.PDFStreamEngine.processOperator(PDFStreamEngine.java:551) at org.apache.pdfbox.util.PDFStreamEngine.processSubStream(PDFStreamEngine.java:274) at org.apache.pdfbox.util.PDFStreamEngine.processSubStream(PDFStreamEngine.java:251) at org.apache.pdfbox.util.PDFStreamEngine.processStream(PDFStreamEngine.java:225) at org.apache.pdfbox.pdfviewer.PageDrawer.drawPage(PageDrawer.java:107) at org.xmlcml.graphics.pdf2svg.PDF2SVGReader.writePage(PDF2SVGReader.java:115) at org.xmlcml.graphics.pdf2svg.PDF2SVGReader.readPDFFile(PDF2SVGReader.java:99) at org.xmlcml.graphics.pdf2svg.PDF2SVGReader.main(PDF2SVGReader.java:210) at org.xmlcml.graphics.pdf.PDFReaderTest.testBMC(PDFReaderTest.java:16) // writePage is hacked from PDFReader private void writePage(int pageNumber) { try { // PMR PageDrawer drawer = new PageDrawer(); PageWrapper wrapper = new PageWrapper( this ); PDPage page = (PDPage)pages.get(pageNumber); wrapper.displayPage( page ); PDRectangle cropBox = page.findCropBox(); Dimension drawDimension = cropBox.createDimension(); svgGraphics2D = this.createSVGGraphics(); drawer.drawPage( svgGraphics2D, page, drawDimension ); writeSVG(pageNumber); } catch (IOException exception) { exception.printStackTrace(); } } > ** > > ** ** > > -- Peter Murray-Rust Reader in Molecular Informatics Unilever Centre, Dep. Of Chemistry University of Cambridge CB2 1EW, UK +44-1223-763069 --f46d044306745c303b04bf6ecb4a--