pdfbox-users mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Andreas Lehmkühler" <andr...@lehmi.de>
Subject Re: Page number discrepancy
Date Tue, 31 Jul 2012 06:43:35 GMT
Hi,


Leleu Eric <eric.leleu.dev@gmail.com> hat am 30. Juli 2012 um 22:50
geschrieben:> Hi,
>
> PageLabels may help you.
> See section §8.3.1 in the PDF Reference v1.7. [1]
>
> PageLabels are accessible through the Document Catalog.
[1] contains a simple example on how to do that.


BR
Andreas Lehmkühler

[1]
http://svn.apache.org/repos/asf/pdfbox/trunk/pdfbox/src/test/java/org/apache/pdfbox/pdmodel/TestPDDocumentCatalog.java


> BR,
> Eric
>
> [1] http://www.adobe.com/devnet/pdf/pdf_reference_archive.html
>
> 2012/7/30 Rachel Arbit <rachel@citypath.com>
>
> > Hi all,
> > I'm using PDFbox on an book in PDF format, I'm trying to map between terms
> > mentioned in the index and the chapters.
> >
> > As I understand it, there are actually two page numbering schemes. One is
> > the straight page number in the pdf, which counts the cover page, all the
> > introductory pages, etc. The second is the numbering as it appears on the
> > pages of the book, so that all the introductory page numbers are roman
> > numerals, and page 1 is only on page 25 of the PDF.
> > When I look at the PDF using a reader it shows me both numbers, e.g.: xi
> > (12 / 841) or 18 (43 / 841)
> >
> > I'm only managing to get the number of the page in the PDF, and not the
> > number as it's written on the page. I need to be able to get that number
> > because the index uses these numbers, to map terms to pages. E.g. page 15
> > in the index actually means page 40 of the pdf.
> >
> > Anyone have any idea how to get the numbers on the pages? Is that info part
> > of the PDF at all?
> >
> > Thanks in advance!
> > Rachel
> >
Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message