jackrabbit-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Felix Meschberger <fmesc...@gmail.com>
Subject Re: Remove pooling for text extraction
Date Fri, 12 Sep 2008 11:52:06 GMT
Hi,

-1, because:

  * Thread creation _is_ expensive (mostly due to the
     large memory requirements)
  * Without a pool of thread, limiting the number
     of threads actually created is a problem, and
     this number should definitely be limited.

Thus keep the pools.

Regards
Felix

Jukka Zitting schrieb:
> Hi,
> 
> In JCR-390 we added support for text extraction in background threads.
> This was done with the PooledTextExtractor class that maintains a pool
> of threads for this purpose. Do we need that pool, or could we simply
> just start a new thread for each new extraction task? That would
> simplify the indexing code.
> 
> The time to start a new thread is probably minimal compared to that of
> parsing a document. And when you're parsing a lot of large documents,
> much of the time is spent waiting for IO so the more concurrent
> threads you have the better throughput you get.
> 
> BR,
> 
> Jukka Zitting
> 

Mime
View raw message