pdfbox-users mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Javier García Sánchez <javig...@gmail.com>
Subject PDFBox PDAcroForm Fields
Date Thu, 10 Jul 2014 12:06:54 GMT

I'm developing a tool in which a PDF file is read by PDF BOx and then
return the all input's name of a PDF form.

PDDocument doc = PDDocument.load( "C:\\\2\\NDA.pdf", false  );
PDDocumentCatalog docCatalog = doc.getDocumentCatalog();
PDAcroForm acroForm = docCatalog.getAcroForm();
List fields = acroForm.getFields();
fields are not really the fields of the form. They are splitted by '.'. For
getting field's name you need to read recursively until kids == null.

What is the meaning of PDAcroForm .getFields() returns a list of fields
spliting the input's name of the form by '.' (point)? It forces to read
recursively fields until get Final Children (kids) for being able to get
the input's id of the form.

I mean the inputs of the form are atomic units. they don't need to be
splitted by anything. Does it make sense?

Thanks in advance.

un saludo
Javier García Sánchez

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message