pdfbox-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Soocheon Kim (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (PDFBOX-4275) Can't extract slanted text through the parsers of the PDFBox
Date Tue, 31 Jul 2018 14:50:00 GMT

     [ https://issues.apache.org/jira/browse/PDFBOX-4275?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Soocheon Kim updated PDFBOX-4275:
---------------------------------
    Attachment: rotation.pdf

> Can't extract slanted text through the parsers of the PDFBox
> ------------------------------------------------------------
>
>                 Key: PDFBOX-4275
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-4275
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Parsing, Text extraction
>    Affects Versions: 2.0.10
>         Environment: I tested that in the overried showGlyph() method of my class extending
 PDFStreamEngine, PDFGraphicsStreamEngine or PDFTextStripper.
>            Reporter: Soocheon Kim
>            Priority: Major
>         Attachments: rotation.pdf
>
>
> The PDFBox (StreamEngine) extracts only texts that are rotated by 0, 90, 180 or -90
degrees.
> For example, it can't extract texts rotated by 45 or 60 degrees.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@pdfbox.apache.org
For additional commands, e-mail: dev-help@pdfbox.apache.org


Mime
View raw message