pdfbox-users mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From European Neuroscience Center <mnachev.nscenter...@gmail.com>
Subject Re: Extract embedded SVG image from PDF file
Date Thu, 07 Mar 2019 06:32:42 GMT
The Copyright is not an issue, because we have a contract and have
rights. The main goal is to automate a process which takes a while and
repeat many time. So, using Adobe Illustrator or free Inkscape editor is
not option. We have access to the sources (Website), but this is time
consuming. Partly, there are web services, which we can use, but not for
all tasks. The PDF files are generated automatically by schedule, so this
way can be fully automated.

I will try to find a way to convert this PDF to SVG, but most importantly
for me is to be able to take some parts of each page, not whole page. Do
you know how this can be done? Are there any examples?

Thank you.

On Thu, Mar 7, 2019 at 1:10 AM Jan Tosovsky <j.tosovsky@email.cz> wrote:

> On 2019-03-06 European Neuroscience Center wrote:
> > On 2019-03-06 Tilman Hausherr wrote:
> > >
> > > I looked at your PDF file with PDFDebugger, no SVG there.
> > > Just PDF vector graphics.
> >
> > What is the way to extract this PDF vector graphics in XML or
> > SVG format?
> During a typical conversion the SVG syntax is translated to the PDF
> syntax. Not all features can be translated 1:1 so the result can be
> degraded. It applies also for the reverse process.
> You can extract the graphics via many vector editors like Adobe
> Illustrator or free Inkscape editor.
> But it is always better to have the original source (SVG chart) or even
> better the original data so you can generate that chart as you like.
> Btw seeing the copyright notice in the footer I suppose any reverse data
> parsing sounds like intellectual property infringement.
> Jan
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: users-unsubscribe@pdfbox.apache.org
> For additional commands, e-mail: users-help@pdfbox.apache.org

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message