Mailing-List: contact java-dev-help@lucene.apache.org; run by ezmlm
Precedence: bulk
Reply-To: java-dev@lucene.apache.org
Received-SPF: neutral (herse.apache.org: local policy)
Message-ID: <4579C959.3070109@apache.org>
Date: Fri, 08 Dec 2006 12:21:45 -0800
From: Doug Cutting <cutting@apache.org>
User-Agent: Thunderbird 1.5.0.8 (X11/20061117)
MIME-Version: 1.0
To: java-dev@lucene.apache.org
Subject: Re: Spliting the Lucene
References: <b66ddc900612071835l512e7b1bjfaebe139405f2db0@mail.gmail.com>
	 <4579B02A.1050703@apache.org>
 <b66ddc900612081041v5cdd7833kba11f72d6c691b4e@mail.gmail.com>
In-Reply-To: <b66ddc900612081041v5cdd7833kba11f72d6c691b4e@mail.gmail.com>
Content-Type: text/plain; charset=ISO-8859-1; format=flowed
Content-Transfer-Encoding: 7bit

howard chen wrote:
> Can you suggest if using Hadoop + Lucene, how to make a simple
> distributed indexing & searching program, i.e. what are the mapping /
> reducing processes involved in both indexing abd searching?

There is not yet a universal, best practice for this.

Nutch provides an example of how to use Lucene for distributed indexing. 
  Nutch's current distributed search implementation builds on Hadoop's 
RPC mechanism, but is not based on Hadoop's MapReduce.

http://lucene.apache.org/nutch/apidocs/org/apache/nutch/searcher/DistributedSearch.html

There has been some discussion of MapReduce-based distributed search on 
the Nutch lists, e.g.:

http://mail-archives.apache.org/mod_mbox/lucene-nutch-user/200604.mbox/%3C4448063D.8050406@apache.org%3E

I think Andrzej Bialecki has explored this approach some.

Another approach is to build a non-MapReduce-based system specifically 
for supporting distributed search and indexing.  I started a discussion 
about this a few months ago and hope to start work on this project 
before long.

http://www.nabble.com/-PROPOSAL--index-server-project-tf2469695.html

Doug


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org