Mailing-List: contact dev-help@lucene.apache.org; run by ezmlm
Precedence: bulk
Reply-To: dev@lucene.apache.org
Received-SPF: neutral (athena.apache.org: local policy)
MIME-Version: 1.0
In-Reply-To: <35030446-B726-4141-AF52-FE641C121E48@cominvent.com>
References: <35030446-B726-4141-AF52-FE641C121E48@cominvent.com>
Date: Tue, 28 Sep 2010 10:32:29 -0400
Message-ID: <AANLkTik7_wVUZrHSNLP_fGEUe8+zQ7gFiuT_Y760GnBa@mail.gmail.com>
Subject: Re: Indexing and threads
From: Michael McCandless <lucene@mikemccandless.com>
To: dev@lucene.apache.org
Content-Type: text/plain; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

I can't speak to how Solr handles threads, but in Lucene the two docs
are indexed concurrently.

Internally, Lucene's IndexWriter has separate thread states that hold
the RAM buffer of the inverted docs.  Multiple threads work on
separate thread states concurrently.

The one big exception to this is flushing a new segment, which is
currently single threaded and can be quite a bottleneck (I wrote about
this problem at
http://chbits.blogspot.com/2010/09/lucenes-indexing-is-fast.html).

Mike

On Tue, Sep 28, 2010 at 9:32 AM, Jan H=F8ydahl / Cominvent
<jan.asf@cominvent.com> wrote:
> Hi,
>
> How are threads being used when indexing?
>
> Let's say document A and B are ingested in parallell to XMLUpdateRequestH=
andler in two separate threads.
> How far down the chain are the processing of these done in the two separa=
te threads?
> Is the full UpdateRequestChain run in the same thread as the incoming req=
uest?
> Is analysis done in the request thread or in a single indexing thread?
> Are ADDs added to the same "commit queue", and then from COMMIT and down =
to Lucene segment building everything is single-threaded?
>
> --
> Jan H=F8ydahl, search solution architect
> Cominvent AS - www.cominvent.com
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
> For additional commands, e-mail: dev-help@lucene.apache.org
>
>

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org