lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Massimo Mannino" <mmann...@csc.com>
Subject RE: How to index a Word document
Date Fri, 31 Jan 2003 11:18:25 GMT

POI it' s correct, but use a OLE
If your application running under unix POI it' s incorrect...



                                                                                         
                                             
                      "Ronnie                                                            
                                             
                      Kolehmainen"             To:      "Lucene Users List" <lucene-user@jakarta.apache.org>
                          
                      <ronnie                  cc:                                    
                                                
                      @sunstone.se>            Subject: RE: How to index a Word document
                                              
                                                                                         
                                             
                      31/01/03 11.17                                                     
                                             
                      Please respond                                                     
                                             
                      to "Lucene Users                                                   
                                             
                      List"                                                              
                                             
                                                                                         
                                             
                                                                                         
                                             




I've been using the POI-scratchpad package with a slightly altered (only
interested in the text stuff) WordDocument class for a while.

Results show that approx 50% of the Word documents are parsable with this
package. This is not very good, but imo better than nothing, and yet the
best(?) Java solution.


/Ronnie




> -----Ursprungligt meddelande-----
> Från: Nellai [mailto:ngomathinayagam@eforceglobal.com]
> Skickat: den 31 januari 2003 04:50
> Till: lucene-user@jakarta.apache.org
> Ämne: How to index a Word document
>
>
> Hi!
>
> Can anyone tell me how to include word document for indexing. Is
> there any parser available for that.
>
> Thanks in advance
>
> Nellai...
>


---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-user-help@jakarta.apache.org






---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-user-help@jakarta.apache.org


Mime
View raw message