poi-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Janick Bernet <jaBer...@swissasp.ch>
Subject Re: HWPF
Date Wed, 08 Oct 2003 08:34:28 GMT
Ryan Ackley wrote:

Thanks for your fast answer!

>http://www.textmining.org has a Word text extractor that uses POIFS
>
>
>----- Original Message ----- 
>From: "Janick Bernet" <jabernet@gmx.ch>
>To: <poi-dev@jakarta.apache.org>
>Sent: Tuesday, October 07, 2003 5:08 PM
>Subject: HWPF
>
>
>  
>
>>I wanted to start implementing a .DOC-parser for the Nutch-Project
>>(www.nutch.org) and wanted to use POI for this purpose, but the
>>Word-format-implementation seems not to be ready yet. Now we only need
>>to be able to extract the text without formating or anything. Is this
>>already possible? If so, could you provide an example how to do this
>>using POI?
>>
>>If not Ill have to do a parser on my own ) and I would gladly provide my
>>work to POI afterwards.
>>
>>Regards
>>
>>Janick <jaBernet@gmx.ch>
>>------------------------
>>http://zap.to/jabernet
>>http://www.swissasp.ch
>>ICQ# 32896520
>>
>>
>>---------------------------------------------------------------------
>>To unsubscribe, e-mail: poi-dev-unsubscribe@jakarta.apache.org
>>For additional commands, e-mail: poi-dev-help@jakarta.apache.org
>>
>>    
>>
>
>---------------------------------------------------------------------
>To unsubscribe, e-mail: poi-dev-unsubscribe@jakarta.apache.org
>For additional commands, e-mail: poi-dev-help@jakarta.apache.org
>
>
>  
>


-- 

Janick Bernet
SwissASP AG

~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~
home: www.swissasp.ch
tel.: +41 (0)52 364 19 43
fax.: +41 (0)52 364 19 93


Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message