pdfbox-users mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Matthew Sheppard <mshepp...@funnelback.com>
Subject Accessing "alternate text" for an image via PDFBox?
Date Fri, 21 Sep 2012 07:14:23 GMT
Is there some way to extract "alternate text" for a specific image using

I have a PDF file which, as described at
http://www.w3.org/WAI/GL/2011/WD-WCAG20-TECHS-20110621/pdf.html#PDF1, has
had alternate text added to an image. Using PDFBox I can find my way
through the object model to the image itself (a PDXObjectImage)
through PDFDocument.getDocumentCatalog().getAllPages() [iterator]
.getResources.getImages() but I can not see any way to get from the image
itself to the alternate text for it.

A small sample PDF (with a single image which has some alternate text
specified) can be found at

Many thanks in advance to anyone who is able to point me in the right
Matt Sheppard

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message