pdfbox-users mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Alessandro Bellini <a.bellini.2...@gmail.com>
Subject Re: International characters only show correctly when form field is selected
Date Fri, 09 Mar 2018 14:21:39 GMT
Oh0u8e

Il 08/Mar/2018 20:42, "Scott Tomer" <scott@tomer.cc> ha scritto:

> All,
>
> I’m new to the list, but tried to search pdfbox-users.markmail.org <
> http://pdfbox-users.markmail.org/> before asking with no luck.
>
> We are using pdfbox to fill in some form fields in an Adobe generated
> template but getting odd results when certain international characters are
> used (some, not all).  When the pdf is first opened, the characters shown
> are basically garbage.  Here is an example: þÿB D Aóz
>
> However, when you click into the field (or in certain readers like Okular
> on Linux choose “Show Forms”), the correct characters are shown.  Here is
> what is inserted into the field and shown when field is selected: ł ń Ł ó ź
>
> It is almost like the PDF has one font selected for a read-only view and
> the correct font for the view when editing a field.
>
> This is happening with Polish, Russian, Chinese and other languages.
>
> This is how I am populating the fields:
>
> PDDocument pdfDoc = LoadPDF.load(cs, document);
> PDDocumentCatalog docCatalog = pdfDoc.getDocumentCatalog();
> PDAcroForm acroForm = docCatalog.getAcroForm();
>
> if (acroForm != null) {
>         for (PDField field : acroForm.getFieldTree()) {
>                 for (PdfField pdfField : pdfFields) {
>                         if (field.getPartialName() != null &&
> field.getPartialName().equalsIgnoreCase(pdfField.getName())) {
>                                 field.setValue(pdfField.getValue());
>                         }
>                 }
>         }
> }
> pdfDoc.save(tempPdf);
> pdfDoc.close();
>
>
> Thanks for any help,
> Scott

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message