Return-Path: Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: (qmail 2640 invoked from network); 20 May 2010 17:08:00 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 20 May 2010 17:08:00 -0000 Received: (qmail 41690 invoked by uid 500); 20 May 2010 17:07:59 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 41573 invoked by uid 500); 20 May 2010 17:07:59 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 41565 invoked by uid 99); 20 May 2010 17:07:59 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 20 May 2010 17:07:59 +0000 X-ASF-Spam-Status: No, hits=2.0 required=10.0 tests=AWL,FREEMAIL_FROM,HTML_MESSAGE,RCVD_IN_DNSWL_NONE,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of unclemantis@gmail.com designates 74.125.82.44 as permitted sender) Received: from [74.125.82.44] (HELO mail-ww0-f44.google.com) (74.125.82.44) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 20 May 2010 17:07:54 +0000 Received: by wwb24 with SMTP id 24so23914wwb.31 for ; Thu, 20 May 2010 10:07:32 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:received:in-reply-to :references:date:message-id:subject:from:to:content-type; bh=Y+USAl8xjeSXmBc+OtskjF/LTYVm2UbrKS+fF0sWQ0g=; b=dCQZXbCf3x3sP7u1m9owUWi1fWX59L4f2QQ6vkgTCT+gkus+5Pu5nDnOqRpEBWOziI 4Ni9cDS8qz7x4Kf8nBVNkVxqCAR26B0aGu2GnPWakGzXBujqNBQ6gmay7L8I2dYVOeQZ vvJXeQH/oW9901/0sEh00K/RX74woP2PDjitw= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; b=AWsb/YMLQMHR0MxDGe4CEnnfUkTqNxYtvIzGieWPOD6eZHnYWc+FGJaJI0JV3O05Ec n9yNVbvWcsfvWvM8ez3EN33KeWPHTLE8JXm/PmLxYIAfarn5F0UNrLV+XSQleJSLEnpX jM/ofUE9raRQb+9H70rRBAP31S/XPW/UlN/JM= MIME-Version: 1.0 Received: by 10.216.187.131 with SMTP id y3mr107226wem.34.1274375252808; Thu, 20 May 2010 10:07:32 -0700 (PDT) Received: by 10.216.11.196 with HTTP; Thu, 20 May 2010 10:07:32 -0700 (PDT) In-Reply-To: References: Date: Thu, 20 May 2010 12:07:32 -0500 Message-ID: Subject: Re: real-world dataset from social network? From: uncle mantis To: user@cassandra.apache.org Content-Type: multipart/alternative; boundary=0016364ef17641153f0487099c70 --0016364ef17641153f0487099c70 Content-Type: text/plain; charset=UTF-8 Are you looking for Facebook stuff? Good luck on getting a data set from any real world model. Regards, Michael On Thu, May 20, 2010 at 11:53 AM, Valerio Schiavoni < valerio.schiavoni@gmail.com> wrote: > Hello everyone, > i'm a phd student looking for some real-world dataset of any social > networks built on top of some schema-less storage system. > The dataset should at least provide a mean to reconstruct the graph of > users. > Due to possible sensible informations in the dataset, the dataset can be > very possibly anonymized if required, it's not important for my research. > > Someone on #cassandra provided some dataset of reddit votes : > http://www.reddit.com/r/redditdev/comments/bubhl/csv_dump_of_reddit_voting_data/ > . > This dataset is interesting, but it doesn't provide informations about the > graph of users. > > Thanks for any help you might provide. > > Best Regards. > > Valerio Schiavoni > PhD student > > University of Neuchatel > Switzerland > > --0016364ef17641153f0487099c70 Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable Are you looking for Facebook stuff? Good luck on getting a data set from an= y real world model.

Regards,

Michael

On Thu, May 20, 2010 at 11:53 AM, Valerio Schiav= oni <va= lerio.schiavoni@gmail.com> wrote:
Hello everyone,= =20
i'm a phd student looking for some real-world dataset of any socia= l networks built on top of some schema-less storage system.=C2=A0
The dataset should at least provide a mean to reconstruct the graph of= users.
Due to possible sensible informations in the dataset, the dataset can = be very possibly anonymized if required, it's not important for my rese= arch.

Someone on #cassandra provided some dataset of reddit votes :=C2=A0http://www.r= eddit.com/r/redditdev/comments/bubhl/csv_dump_of_reddit_voting_data/.
This dataset is interesting, but it doesn't provide informations a= bout the graph of users.

Thanks for any help you might provide.

Best Regards.

Valerio Schiavoni
PhD student

University of Neuchatel
Switzerland


--0016364ef17641153f0487099c70--