pdfbox-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Tilman Hausherr (JIRA)" <j...@apache.org>
Subject [jira] [Closed] (PDFBOX-4311) Unable to parse some pdf's using pdfbox.
Date Tue, 04 Sep 2018 18:57:00 GMT

     [ https://issues.apache.org/jira/browse/PDFBOX-4311?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Tilman Hausherr closed PDFBOX-4311.
-----------------------------------
    Resolution: Not A Bug

Closing because this isn't a bug. You can still comment or reopen.

> Unable to parse some pdf's using pdfbox.
> ----------------------------------------
>
>                 Key: PDFBOX-4311
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-4311
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Text extraction
>    Affects Versions: 2.0.9
>         Environment: Pdfbox -2.0.9
> Pdfbox-tools - 2.0.9
> Java - 1.7
> Scala - 2.10.6
>            Reporter: Krishna Dheeraj
>            Priority: Major
>         Attachments: upload_user4024353_claimnr283909709_healthpartners_2018-06-17.pdf
>
>
> When I tried to convert the PDF file into HTML for parsing the content in the body is
empty and there are no errors or exceptions thrown. It is happening for only few files, others
are are working as expected. I am attaching the file which we are unable to parse. Let us
know know in case of any resolutions are avilable.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@pdfbox.apache.org
For additional commands, e-mail: dev-help@pdfbox.apache.org


Mime
View raw message