Return-Path: Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: (qmail 7949 invoked from network); 20 May 2010 17:17:53 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 20 May 2010 17:17:53 -0000 Received: (qmail 59809 invoked by uid 500); 20 May 2010 17:17:52 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 59784 invoked by uid 500); 20 May 2010 17:17:52 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 59776 invoked by uid 99); 20 May 2010 17:17:52 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 20 May 2010 17:17:52 +0000 X-ASF-Spam-Status: No, hits=2.0 required=10.0 tests=AWL,FREEMAIL_FROM,HTML_MESSAGE,RCVD_IN_DNSWL_NONE,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of unclemantis@gmail.com designates 74.125.82.172 as permitted sender) Received: from [74.125.82.172] (HELO mail-wy0-f172.google.com) (74.125.82.172) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 20 May 2010 17:17:47 +0000 Received: by wyb33 with SMTP id 33so3352wyb.31 for ; Thu, 20 May 2010 10:17:26 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:received:in-reply-to :references:date:message-id:subject:from:to:content-type; bh=YreQqnkXYEWfLpFS97LGKEM1Z1LH2nSdFa/3u0cYzC8=; b=S5QlCS7OAK1XStsEuoFalp0WEmSTWFHvP9953Q8h0ZCqxOENmId6nBqEHT/HojPw86 VDjVLI4y7TiANGbRs19kkgAYPbsMubvmmoYXOQFbH246juyE/b9IRjxU9p+Ykto0xWqS Scz+SuAK7yxWftPuBoK+r4akc29voqC2ybwr8= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; b=MCRLf+daWdYRIxSEKsYt3ynU703vlUKJ+RuD6rab5oO3tBSQCQ8f9FYs+AjY74GvI5 WXv6E+tHak/nSeyCxIKmOF1eRA7DmqiWhqotwSPNHTFUmscsCplIm5DJYZ1coWQg+eJI vfhGZe+MOwEoOlqCrKPs5kKOEInu/K483sMTs= MIME-Version: 1.0 Received: by 10.216.85.195 with SMTP id u45mr103256wee.72.1274375521358; Thu, 20 May 2010 10:12:01 -0700 (PDT) Received: by 10.216.11.196 with HTTP; Thu, 20 May 2010 10:12:01 -0700 (PDT) In-Reply-To: References: Date: Thu, 20 May 2010 12:12:01 -0500 Message-ID: Subject: Re: real-world dataset from social network? From: uncle mantis To: user@cassandra.apache.org Content-Type: multipart/alternative; boundary=0016e6d96d9042d436048709ac7a --0016e6d96d9042d436048709ac7a Content-Type: text/plain; charset=UTF-8 MIne is under developement. Sorry I can't help you at the moment :( Regards, Michael On Thu, May 20, 2010 at 12:09 PM, Valerio Schiavoni < valerio.schiavoni@gmail.com> wrote: > Not strictly Facebook. > Any online social network is ok to me, as long as it has a reasonable > number of users and that it's built on top of a schema-less storage system. > > > Are you looking for Facebook stuff? Good luck on getting a data set from >> any real world model. >> >> >> Hello everyone, >>> i'm a phd student looking for some real-world dataset of any social >>> networks built on top of some schema-less storage system. >>> The dataset should at least provide a mean to reconstruct the graph of >>> users. >>> Due to possible sensible informations in the dataset, the dataset can be >>> very possibly anonymized if required, it's not important for my research. >>> >>> Someone on #cassandra provided some dataset of reddit votes : >>> http://www.reddit.com/r/redditdev/comments/bubhl/csv_dump_of_reddit_voting_data/ >>> . >>> This dataset is interesting, but it doesn't provide informations about >>> the graph of users. >>> >> > --0016e6d96d9042d436048709ac7a Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable MIne is under developement. Sorry I can't help you at the moment :(

Regards,

Michael


On Thu, May 20, 2010 at 12:09 PM, Valerio Schiav= oni <va= lerio.schiavoni@gmail.com> wrote:
Not strictly Facebook.=C2=A0=20
Any online social network is ok to me, as long as it has a reasonable = number of users and that it's built on top of a schema-less storage sys= tem.


Are you looking for Facebook stuff? Good luck on getting = a data set from any real world model.
=C2=A0

Hello everyone,= =20
i'm a phd student looking for some real-world dataset of any socia= l networks built on top of some schema-less storage system.=C2=A0
The dataset should at least provide a mean to reconstruct the graph of= users.
Due to possible sensible informations in the dataset, the dataset can = be very possibly anonymized if required, it's not important for my rese= arch.

Someone on #cassandra provided some dataset of reddit votes :=C2=A0http://www.r= eddit.com/r/redditdev/comments/bubhl/csv_dump_of_reddit_voting_data/.
This dataset is interesting, but it doesn't provide informations a= bout the graph of users.
<= /blockquote>


--0016e6d96d9042d436048709ac7a--