lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Yuval Feinstein <>
Subject RE: Distributed Indexing
Date Thu, 08 Jul 2010 13:22:33 GMT
as far as I know, you still have to do this part yourself.
A possible way to shard is to number the shards from 0 to numShards-1, 
calculate hash(uniqueKey)%numShards per each document,
and send the document to the resulting shard number.
This number is consistent and sends documents uniformly to different shards.
-- Yuval

-----Original Message-----
From: Li Li [] 
Sent: Thursday, July 08, 2010 2:44 PM
Subject: Distributed Indexing

    Is there any tools for "Distributed Indexing"? It refers to
KattaIntegration  and ZooKeeperIntegration in
    But it seems that they concern more on error processing and
replication. I need a dispatcher that dispatch different docs by
uniqueKey(such as url)  to different machines. And when a doc is
updated, the doc is sent to the machine that contains the url. Also I
need the docs are randomly sent to all the machines so that when I do
a distributed search the idfs of different machines are similar
because the current distributed search's idf are local.

View raw message