incubator-ooo-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From bugzi...@apache.org
Subject DO NOT REPLY [Bug 119312] PDF import has very poor layout accuracy
Date Tue, 08 May 2012 20:43:59 GMT
https://issues.apache.org/ooo/show_bug.cgi?id=119312

Dave Fisher <wave@apache.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|UNCONFIRMED                 |CONFIRMED
                 CC|                            |wave@apache.org
     Ever Confirmed|0                           |1

--- Comment #13 from Dave Fisher <wave@apache.org> 2012-05-08 20:43:59 UTC ---
In your case I have verified that AOO 3.4 does render page 2 and page 3
imperfectly. I tested a Mac version.

In looking at the attached PDF I see that the original is a Word document and
the file was produced by a Mac OS X 10.5.8 Quartz PDFContext and is version 1.6
PDF. There are embedded font subsets of Windows standard fonts.

I extracted the awful page 3 to a separate page Acrobat and the import in AOO
3.4 was just as bad.

It is very much a non-trivial task to re-assemble the text strings from a PDF
into usable text blocks. Remember that the PDF file format was designed as
digital paper.

With your example the next developer who attempts to fix PDF import will have
another example to use.

Meanwhile if you have the original Word document, how does Writer handle that?

-- 
Configure bugmail: https://issues.apache.org/ooo/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are on the CC list for the bug.

Mime
View raw message