lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Stephane James Vaucher <vauch...@cirano.qc.ca>
Subject Using Runtime.exec to extract text [Was: Bridge with OO]
Date Tue, 20 Apr 2004 16:08:25 GMT
In case you don't know. Using Runtime.exec() on windows, you need to 
consume the output streams of the application will block. This is not the 
case on linux.

http://www.javaworld.com/javaworld/jw-12-2000/jw-1229-traps.html

In short:
Because some native platforms only provide limited buffer size for 
standard input and output streams, failure to promptly write the input 
stream or read the output stream of the subprocess may cause the 
subprocess to block, and even deadlock.

HTH,
sv

On Tue, 20 Apr 2004, Argyn wrote:

> I've the same requirement. I used antiword, xlhtml and ppthtml on win2k. I 
> called them with Runtime.exec(). There are still problems: all three hang 
> up sometimes. Otherwise, it worked. I indexed several hunderds of 
> thousands files in development mode. I never got into production.
> 
> Argyn
> 
> 
> On Mon, 19 Apr 2004 16:53:41 -0400 (EDT), Stephane James Vaucher 
> <vauchers@cirano.qc.ca> wrote:
> 
> > Actually, the objective would be to use OO to extract text from MSOffice
> > formats. If I read your code correctly, your code should only work with 
> > OO
> > as the docs are in xml.
> >
> > Thanks for the code for OO docs through,
> > sv
> >
> > On Mon, 19 Apr 2004, Mario Ivankovits wrote:
> >
> 
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
> For additional commands, e-mail: lucene-user-help@jakarta.apache.org
> 


---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-user-help@jakarta.apache.org


Mime
View raw message