pdfbox-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Andreas Lehmkühler (JIRA) <j...@apache.org>
Subject [jira] [Comment Edited] (PDFBOX-4569) Implement an ondemand Parser
Date Tue, 11 Jun 2019 18:09:00 GMT

    [ https://issues.apache.org/jira/browse/PDFBOX-4569?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16861293#comment-16861293
] 

Andreas Lehmkühler edited comment on PDFBOX-4569 at 6/11/19 6:08 PM:
---------------------------------------------------------------------

I've adjusted the existing big bang parser to work as ondemand parser. It is still in an alpha
state but all tests are working. It stops loading the objects after the initial parse. All
remaining objects are loaded on demand. I've created a separate [branch|https://svn.apache.org/repos/asf/pdfbox/branches/issue4569/]
of the trunk for the changes.

I'd like to have some early feedback:
- are there any pdfs which won't work any more
- are there any use cases which are broken: merging, multi threading etc.
- are there any performance improvements and if so are those as big as expected
- are there any resource improvements and if so are those as big as expected
- the code isn't ready, but are there any hints to improve it



was (Author: lehmi):
I've adjusted the existing big bang parser to work as ondemand parser. It is still in an alpha
state but all tests are working. It stops loading the objects after the initial parse. All
remaining objects are loaded on demand. I've create a separate [branch|https://svn.apache.org/repos/asf/pdfbox/branches/issue4569/]
for the changes.

I'd like to have some early feedback:
- are there any pdfs which won't work any more
- are there any use cases which are broken: merging, multi threading etc.
- are there any performance improvements and if so are those as big as expected
- are there any resource improvements and if so are those as big as expected
- the code isn't ready, but are there any hints to improve it


> Implement an ondemand Parser
> ----------------------------
>
>                 Key: PDFBOX-4569
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-4569
>             Project: PDFBox
>          Issue Type: Improvement
>          Components: Parsing
>    Affects Versions: 3.0.0 PDFBox
>            Reporter: Andreas Lehmkühler
>            Assignee: Andreas Lehmkühler
>            Priority: Major
>             Fix For: 3.0.0 PDFBox
>
>
> There is a need to replace the big bang parser with an ondemand parser



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@pdfbox.apache.org
For additional commands, e-mail: dev-help@pdfbox.apache.org


Mime
View raw message