lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ryan Ackley" <sack...@cfl.rr.com>
Subject Textmining.org IS NOT POI (was Re: worddoucments search)
Date Tue, 24 Aug 2004 13:56:54 GMT
Go to http://www.textmining.org for a platform independent library to
extract text from Word documents. I wrote 99.99% of the Word component of
POI and all of the textmining.org library.

 I have seen several discussions and web pages that point to textmining.org
that say I simply wrap POI classes (For example, the JGuru GAQ
http://www.jguru.com ) This is totally false.

* The textmining.org library is optimized for extracting text. POI is not.
* The textmining.org libraries supports extracting text from Word 6/95. POI
does not.
* The textmining.org libraries do not extract deleted text that is still in
the document for the purposes of revision marking. POI does not handle this.

-Ryan Ackley

----- Original Message ----- 
From: "Chandan Tamrakar" <chandan@ccnep.com.np>
To: "Lucene Users List" <lucene-user@jakarta.apache.org>
Sent: Tuesday, August 24, 2004 7:31 AM
Subject: Re: worddoucments search


> please look at Apache POI project.
> http://jakarta.apache.org
>
> Words documents can be extracted using POI apis and later can be indexed.
>
> regards
>
> ----- Original Message ----- 
> From: "Santosh" <santosh.s@softprosys.com>
> To: "Lucene Users List" <lucene-user@jakarta.apache.org>
> Sent: Tuesday, August 24, 2004 6:00 PM
> Subject: worddoucments search
>
>
> Can lucene be able to search word documents? if so please give me
> information about it
>
> regards
> Santosh kumar
>
>
> -----------------------SOFTPRO DISCLAIMER------------------------------
>
> Information contained in this E-MAIL and any attachments are
> confidential being  proprietary to SOFTPRO SYSTEMS  is 'privileged'
> and 'confidential'.
>
> If you are not an intended or authorised recipient of this E-MAIL or
> have received it in error, You are notified that any use, copying or
> dissemination  of the information contained in this E-MAIL in any
> manner whatsoever is strictly prohibited. Please delete it immediately
> and notify the sender by E-MAIL.
>
> In such a case reading, reproducing, printing or further dissemination
> of this E-MAIL is strictly prohibited and may be unlawful.
>
> SOFTPRO SYSYTEMS does not REPRESENT or WARRANT that an attachment
> hereto is free from computer viruses or other defects.
>
> The opinions expressed in this E-MAIL and any ATTACHEMENTS may be
> those of the author and are not necessarily those of SOFTPRO SYSTEMS.
> ------------------------------------------------------------------------
>
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
> For additional commands, e-mail: lucene-user-help@jakarta.apache.org
>


---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-user-help@jakarta.apache.org


Mime
View raw message