lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Victor Hadianto <vict...@nuix.com.au>
Subject Re: PDF / Word document parsers
Date Fri, 19 Apr 2002 07:10:41 GMT
> >To parse word
> > document you can have a look for OpenOffice. You can start OpenOffice to
> > receive a socket connection. From your Java app, you open a connection to
> > OpenOffice (using OpenOffice SDK), send the word document and it will
> convert it to text.
>
> That's actually quite a novel idea. I haven't tried it, is it complicated
> to communicate with OpenOffice?
>

It's a bit finnicky but fortunately there are examples how to do this. 

Java-OpenOffice page: http://udk.openoffice.org/java/man/index.html
OpenOffice API: http://api.openoffice.org/
Samples: 
http://api.openoffice.org/unbranded-source/browse/~checkout~/api/odk/examples/examples.html


-- 
Victor Hadianto
---------------
God, I ask for patience -- and I want it right now!

--
To unsubscribe, e-mail:   <mailto:lucene-user-unsubscribe@jakarta.apache.org>
For additional commands, e-mail: <mailto:lucene-user-help@jakarta.apache.org>


Mime
View raw message