lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ryan Ackley" <>
Subject IS NOT POI (was Re: worddoucments search)
Date Tue, 24 Aug 2004 13:56:54 GMT
Go to for a platform independent library to
extract text from Word documents. I wrote 99.99% of the Word component of
POI and all of the library.

 I have seen several discussions and web pages that point to
that say I simply wrap POI classes (For example, the JGuru GAQ ) This is totally false.

* The library is optimized for extracting text. POI is not.
* The libraries supports extracting text from Word 6/95. POI
does not.
* The libraries do not extract deleted text that is still in
the document for the purposes of revision marking. POI does not handle this.

-Ryan Ackley

----- Original Message ----- 
From: "Chandan Tamrakar" <>
To: "Lucene Users List" <>
Sent: Tuesday, August 24, 2004 7:31 AM
Subject: Re: worddoucments search

> please look at Apache POI project.
> Words documents can be extracted using POI apis and later can be indexed.
> regards
> ----- Original Message ----- 
> From: "Santosh" <>
> To: "Lucene Users List" <>
> Sent: Tuesday, August 24, 2004 6:00 PM
> Subject: worddoucments search
> Can lucene be able to search word documents? if so please give me
> information about it
> regards
> Santosh kumar
> -----------------------SOFTPRO DISCLAIMER------------------------------
> Information contained in this E-MAIL and any attachments are
> confidential being  proprietary to SOFTPRO SYSTEMS  is 'privileged'
> and 'confidential'.
> If you are not an intended or authorised recipient of this E-MAIL or
> have received it in error, You are notified that any use, copying or
> dissemination  of the information contained in this E-MAIL in any
> manner whatsoever is strictly prohibited. Please delete it immediately
> and notify the sender by E-MAIL.
> In such a case reading, reproducing, printing or further dissemination
> of this E-MAIL is strictly prohibited and may be unlawful.
> SOFTPRO SYSYTEMS does not REPRESENT or WARRANT that an attachment
> hereto is free from computer viruses or other defects.
> The opinions expressed in this E-MAIL and any ATTACHEMENTS may be
> those of the author and are not necessarily those of SOFTPRO SYSTEMS.
> ------------------------------------------------------------------------
> ---------------------------------------------------------------------
> To unsubscribe, e-mail:
> For additional commands, e-mail:

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message