pdfbox-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Alexandre (JIRA)" <j...@apache.org>
Subject [jira] [Created] (PDFBOX-4215) Get pages from a HTTP stream of a large pdf file
Date Wed, 09 May 2018 16:19:00 GMT
Alexandre created PDFBOX-4215:

             Summary: Get pages from a HTTP stream of a large pdf file
                 Key: PDFBOX-4215
                 URL: https://issues.apache.org/jira/browse/PDFBOX-4215
             Project: PDFBox
          Issue Type: Wish
          Components: Parsing
    Affects Versions: 2.0.9
            Reporter: Alexandre

Hi Apache contributors,

Suppose I have a very big pdf file and I want to split this file into file chunks. I cannot
load the entire file into memory and I cannot use the hard disk of the computer... too bad :(.

I read that it is not possible to get in-order pages from a stream, but it is feasible to
load random pages if you read line by line and look for page breaks. 

Is this implemented in pdfbox?

Hagd, A.

This message was sent by Atlassian JIRA

To unsubscribe, e-mail: dev-unsubscribe@pdfbox.apache.org
For additional commands, e-mail: dev-help@pdfbox.apache.org

View raw message