poi-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Andrew C. Oliver" <acoli...@apache.org>
Subject Re: MS Word Text extracting
Date Tue, 15 Jul 2003 17:02:24 GMT
There is nothing "ready" yet but there is a new package called hwpf which
will be what we use going forward.  Its a rethought method and renamed to
prevent confusion with something else called HDF.

-Andy

On 7/15/03 12:24 PM, "Information" <info@prec-it.com> wrote:

> Hi all
> 
> I'm about to create a very simple app that parses a Word doc and extracts
> all text paragraphs in sequence, writing the content to an XML text file.
> 
> This sounds trivial with POIFS and the existing HDF methods - I've looked at
> WordDocument.java in the src/org/apache/poi/hdef/extractor folder of the
> scratchpad that more or less does the basis of what I need.
> 
> Before I start coding, in the interest of not reinventing the wheel, has
> anyone coded something newer/simpler that I could use ?
> 
> TIA
> 
> Marc
> 
> 
> -----------
> Marc Barrot
> Precision IT Management, Inc
> 
> 
> 
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: poi-user-unsubscribe@jakarta.apache.org
> For additional commands, e-mail: poi-user-help@jakarta.apache.org
> 

-- 
Andrew C. Oliver
http://www.superlinksoftware.com/poi.jsp
Custom enhancements and Commercial Implementation for Jakarta POI

http://jakarta.apache.org/poi
For Java and Excel, Got POI?


Mime
View raw message