poi-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Andrew C. Oliver" <acoli...@apache.org>
Subject Re: MS Word Text extracting
Date Tue, 15 Jul 2003 17:02:24 GMT
There is nothing "ready" yet but there is a new package called hwpf which
will be what we use going forward.  Its a rethought method and renamed to
prevent confusion with something else called HDF.


On 7/15/03 12:24 PM, "Information" <info@prec-it.com> wrote:

> Hi all
> I'm about to create a very simple app that parses a Word doc and extracts
> all text paragraphs in sequence, writing the content to an XML text file.
> This sounds trivial with POIFS and the existing HDF methods - I've looked at
> WordDocument.java in the src/org/apache/poi/hdef/extractor folder of the
> scratchpad that more or less does the basis of what I need.
> Before I start coding, in the interest of not reinventing the wheel, has
> anyone coded something newer/simpler that I could use ?
> Marc
> -----------
> Marc Barrot
> Precision IT Management, Inc
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: poi-user-unsubscribe@jakarta.apache.org
> For additional commands, e-mail: poi-user-help@jakarta.apache.org

Andrew C. Oliver
Custom enhancements and Commercial Implementation for Jakarta POI

For Java and Excel, Got POI?

View raw message