lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jason Rutherglen <jason.rutherg...@gmail.com>
Subject Re: Is Lucene a good choice for PB scale mailbox search?
Date Tue, 24 Nov 2009 05:41:06 GMT
A sharded architecture (i.e. smaller indexes) used by Google for
example and implemented by open source in the Katta project may be
best for scaling to sizable levels.  Katta is also useful for
redundancy and fault tolerance.

On Mon, Nov 23, 2009 at 6:35 PM, fulin tang <tangfulin@gmail.com> wrote:
> We are going to add full-text search for our mailbox service .
>
> The problem is we have more than 1 PB mails there , and obviously we
> don't want to add another PB storage for search service , so we hope
> the index data will be small enough for storage while the search keeps
> fast .
>
> The lucky is that every user just search with mails of their own , so
> we can split the data into a lot of indexes instead of keeping them in
> a big one .
>
> So, after all these concerns ,  the question is , is lucene a good
> choice for this ? or which is the right way to do this ? Does anyone
> have done this  before ?
>
> All opinions and comments are welcome !
>
> fulin
>
>
> --
> 梦的开始挣扎于城市的边缘
> 心的远方执着在脚步的瞬间
> 我的宿命埋藏了寂寞的永远
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org
>
>

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Mime
View raw message