pdfbox-users mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Rachel Arbit <rac...@citypath.com>
Subject Re: Page number discrepancy
Date Tue, 31 Jul 2012 07:18:52 GMT
Indeed it works :)
Thanks so much!

On 31 July 2012 09:43, "Andreas Lehmkühler" <andreas@lehmi.de> wrote:

> Hi,
>
>
> Leleu Eric <eric.leleu.dev@gmail.com> hat am 30. Juli 2012 um 22:50
> geschrieben:> Hi,
> >
> > PageLabels may help you.
> > See section §8.3.1 in the PDF Reference v1.7. [1]
> >
> > PageLabels are accessible through the Document Catalog.
> [1] contains a simple example on how to do that.
>
>
> BR
> Andreas Lehmkühler
>
> [1]
>
> http://svn.apache.org/repos/asf/pdfbox/trunk/pdfbox/src/test/java/org/apache/pdfbox/pdmodel/TestPDDocumentCatalog.java
>
>
> > BR,
> > Eric
> >
> > [1] http://www.adobe.com/devnet/pdf/pdf_reference_archive.html
> >
> > 2012/7/30 Rachel Arbit <rachel@citypath.com>
> >
> > > Hi all,
> > > I'm using PDFbox on an book in PDF format, I'm trying to map between
> terms
> > > mentioned in the index and the chapters.
> > >
> > > As I understand it, there are actually two page numbering schemes. One
> is
> > > the straight page number in the pdf, which counts the cover page, all
> the
> > > introductory pages, etc. The second is the numbering as it appears on
> the
> > > pages of the book, so that all the introductory page numbers are roman
> > > numerals, and page 1 is only on page 25 of the PDF.
> > > When I look at the PDF using a reader it shows me both numbers, e.g.:
> xi
> > > (12 / 841) or 18 (43 / 841)
> > >
> > > I'm only managing to get the number of the page in the PDF, and not the
> > > number as it's written on the page. I need to be able to get that
> number
> > > because the index uses these numbers, to map terms to pages. E.g. page
> 15
> > > in the index actually means page 40 of the pdf.
> > >
> > > Anyone have any idea how to get the numbers on the pages? Is that info
> part
> > > of the PDF at all?
> > >
> > > Thanks in advance!
> > > Rachel
> > >
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message