poi-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Angelo zerr <angelo.z...@gmail.com>
Subject POI XWPF converters : docx->xhtml and docx->pdf converters
Date Fri, 12 Oct 2012 09:09:30 GMT
Hi POI Team,

I contact you because we have developped 2 docx converters based on POI (on
other words XWPFDocument converter)  in our
XDocReport<http://code.google.com/p/xdocreport/>project :

   1. *docx->xhtml* converter : this converter loads a docx in the
   POI XWPFDocument and loop for each structures of the docuiment
   (XWPFParagraph, XWPFTable etc ) and generates html content with SAX (and
   not with DOM like you have done with your doc->html converter). Using SAX
   gives you the capability to merge several docx converted in html in the
   same page by using some SAX pipelines.
   2. *docx->pdf* converter : I'm not sure that you will be interest with
   this converter because it is based on iText and not FOP. Why iText? Because
   it's more fast to create PDF structures directly although to generate FO
   content and parse it to generate PDF structures with FOP. Our goal was to
   provide a very fast docx->pdf converter.

Those converters are not finished (we are improving it) but I think they
can be used. we have managed complex styles too (ex: indentation paragraph
linked to StyleA definied is retrieved from StyleB where StyleA extends
StyleB, manage tblLook to for set style for firstRow, lastRow etc).
Today it exists the 0.9.8 release but the result is very bad. The 1.0.0
will improve a lot the converters.

If you want you can test our converter 1.0.0 in our live demo at

If you want see sources :

   - XWPF core converter :
   - XWPF xhtml converter :
   - XWPF pdf converter:

Hope you will like it our docx converters.

Regards Angelo

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message