incubator-couchdb-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Paul Davis <paul.joseph.da...@gmail.com>
Subject Re: Best way to store 2^32 IPs in CouchDB
Date Mon, 01 Feb 2010 19:07:56 GMT
On Mon, Feb 1, 2010 at 1:50 PM, Santi Saez <santisaez@woop.es> wrote:
> El 01/02/10 17:56, Paul Davis escribió:
>
> Dear Paul,
>
>> Well, 2^32 of anything is 4GiB per byte stored. So, minimum of four
>> bytes and you're at 16GiB. Even with just 1KiB overhead you're at
>> 4TiB.
>>
>> I'm left wondering why you would want to store a list of numbers in
>> the first place.
>
> Imagine a service like Netcraft.
>
> I know that there aren't 2^32 active servers, but I wanted to test it with
> 4.3 billion documents and stress/benchamark CouchDB with other DBs.
>
> Regards,
>
> --
> Santi Saez
> http://woop.es
>

If you're looking for benchmark data I'd also suggest something like
the Enron email dataset. I can't imagine 4.3 billion integer documents
is going to be very informative about real world usage. The Wikipedia
abstracts data set another candidate as well.

HTH,
Paul Davis

Mime
View raw message