Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 58946 invoked from network); 22 Jul 2009 17:41:08 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 22 Jul 2009 17:41:08 -0000 Received: (qmail 20617 invoked by uid 500); 22 Jul 2009 17:42:11 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 20523 invoked by uid 500); 22 Jul 2009 17:42:11 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 20490 invoked by uid 99); 22 Jul 2009 17:42:08 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 22 Jul 2009 17:42:08 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of tsuraan@gmail.com designates 209.85.221.203 as permitted sender) Received: from [209.85.221.203] (HELO mail-qy0-f203.google.com) (209.85.221.203) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 22 Jul 2009 17:42:00 +0000 Received: by qyk41 with SMTP id 41so458477qyk.29 for ; Wed, 22 Jul 2009 10:41:40 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:in-reply-to:references :date:message-id:subject:from:to:content-type :content-transfer-encoding; bh=njhf5cl7Gkdfnu4rzNOq1WeHuGZ4uELIIOxXxz/94t0=; b=NqM9e08EiZ4uyHOifGYdzjwWcIu1CQtHLFAxcWqrq6U8/FvY4+COBlWF/SV19680IV 87eV3JE7+URTjcfnJUFP45bMhWZMtnquIBp33JirJ/3j41u9NI444tkss521ghCalebt B8vIg3TF5RsUNRzIxGtBXKaO2/leVMzT7yXts= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type:content-transfer-encoding; b=ELAtgHw4jyn96KCz/4/3SDvAVI/Z4VYq8gCkI6aLsbkXER7O2rq1Ni9mb52A+MOvDM WbnWS0w+SQuFcQo7yi3YvRz6fFoSwepEi1wVx2J9JijLeft7qAo0ITVgoIZM5LgwtwBL hzVuyy8QXsxpYtjvAHYFxH5swAfSc6i7gSAqQ= MIME-Version: 1.0 Received: by 10.229.84.72 with SMTP id i8mr242233qcl.63.1248284499972; Wed, 22 Jul 2009 10:41:39 -0700 (PDT) In-Reply-To: <4A6746F1.7080906@informatics.jax.org> References: <84fb38e30907220959n40398f04rab899b3bb4ad5a18@mail.gmail.com> <4A6746F1.7080906@informatics.jax.org> Date: Wed, 22 Jul 2009 12:41:39 -0500 Message-ID: <84fb38e30907221041o2c02ed7co686eb0d24e1dceb3@mail.gmail.com> Subject: Re: Batch searching From: tsuraan To: java-user@lucene.apache.org Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org > If you did this, wouldn't you be binding the processing of the results > of all queries to that of the slowest performing one within the collection? I would imagine it would, but I haven't seen too much variance between lucene query speeds in our data. > I'm guessing you are trying for some sort of performance benefit by > batch processing, but I question whether or not you will actually get > more performance by performing your queries in a threaded type > environment, and then processing their results as they come in. > > Could you give a bit more description about what you are actually trying > to accomplish, I'm sure this list could help better if we had more > information. What I'd like to do is build lots of small indices (a few thousand documents per index) and put them into HDFS for search distribution. We already have our own map-reduce framework for searching, but HDFS seems to be a really good fit for an actual storage mechanism. My concern is that when we have one searcher using thousands of HDFS-backed indices, the seeking might get a bit nasty. HDFS apparently has pretty good seeking performance, but it really looks like it was designed for streaming, so if I could make my searches use sequential index access, I would expect better performance than having a ton of simultaneous searches making HDFS seek all over the place. --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org