Return-Path: Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: (qmail 22698 invoked from network); 20 May 2010 19:58:08 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 20 May 2010 19:58:08 -0000 Received: (qmail 20665 invoked by uid 500); 20 May 2010 19:58:07 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 20647 invoked by uid 500); 20 May 2010 19:58:07 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 20639 invoked by uid 99); 20 May 2010 19:58:07 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 20 May 2010 19:58:07 +0000 X-ASF-Spam-Status: No, hits=2.2 required=10.0 tests=FREEMAIL_FROM,HTML_MESSAGE,RCVD_IN_DNSWL_NONE,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of valerio.schiavoni@gmail.com designates 74.125.82.44 as permitted sender) Received: from [74.125.82.44] (HELO mail-ww0-f44.google.com) (74.125.82.44) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 20 May 2010 19:58:01 +0000 Received: by wwb24 with SMTP id 24so158476wwb.31 for ; Thu, 20 May 2010 12:57:41 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:mime-version:received:in-reply-to :references:from:date:message-id:subject:to:content-type; bh=k1Xhd8+dm43sZlw8ZrfTl1zSl7jwEku77enl3dxtViU=; b=hnVsK52v151CBJDy02Fu8vIOsVFwoHhzBUNEVFtsgj5P3cUuZ43tGLzLd9hv4D37GW xO3RIuFgjeq80O9Z/JkWw1Ayrb57aobkFtAeY5kUmPFsNior66uGqpIgpsn+m+PUmou7 sW3ybeDT2lqdkdqg0aSLKa13VAb+zKpmwJSfE= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type; b=uQneQE5NFu4rltFKNEvdTACfIPkSC2rjSWyil/5B2SJHEIZqMGU1Y9OHu3lOBH/tLV Uu/O0lwITXNJJxRuu4HowlnMYroXEdGwxMz7FIwwnrumhgCusAV5qqw+ovx+9VuBnXmi YXiu7zx+sMYwu91N14X0WmAFlBLSVB629SmeU= Received: by 10.216.87.201 with SMTP id y51mr225808wee.80.1274385461137; Thu, 20 May 2010 12:57:41 -0700 (PDT) MIME-Version: 1.0 Received: by 10.216.25.71 with HTTP; Thu, 20 May 2010 12:57:21 -0700 (PDT) In-Reply-To: <753D56AD-466A-47B8-9814-94F76DE8C039@gmail.com> References: <753D56AD-466A-47B8-9814-94F76DE8C039@gmail.com> From: Valerio Schiavoni Date: Thu, 20 May 2010 21:57:21 +0200 Message-ID: Subject: Re: real-world dataset from social network? To: user@cassandra.apache.org Content-Type: multipart/alternative; boundary=0016e6d564c2b7d02504870bfc46 X-Virus-Checked: Checked by ClamAV on apache.org --0016e6d564c2b7d02504870bfc46 Content-Type: text/plain; charset=ISO-8859-1 Hello, It's unclear if you're looking for data that can be stored in Cassandra or > an example of someone using Cassandra to store a network; I'm assuming the > former. > You're assuming incorrectly. I'm looking an example of someone using Cassandra to store a graph. > You will have a hard time finding a social network dataset with > relationships already well-defined for free. I have seen crawls of Twitter > before, but IIRC they go for thousands (in USD). > There are free dumps of Twitter datasets ( http://www.public.asu.edu/~mdechoud/datasets.html). >From there, I should somehow 'reverse engineer' those datas to store them in cassandra as they were originally stored. I could synthesize fake datas, store them in cassandra (or similar) and from there continue. But the results would be less attracting than similar ones originated by real datas. thanks for the suggestions. valerio --0016e6d564c2b7d02504870bfc46 Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Hello,

It's unclear if you're looki= ng for data that can be stored in Cassandra or an example of someone using = Cassandra to store a network; I'm assuming the former.

You're assuming incorrectly. I&#= 39;m looking an example of someone using Cassandra to store a graph.
<= div>=A0
You will have a hard time finding = a social network dataset with relationships already well-defined for free. = =A0I have seen crawls of Twitter before, but IIRC they go for thousands (in= USD).

There are free dumps of Twitter data= sets (http://= www.public.asu.edu/~mdechoud/datasets.html).=A0

>From there, I should somehow 'reverse engineer' those datas to stor= e them in cassandra as they were originally stored.=A0

=
I could synthesize fake datas, store them in cassandra (or similar) an= d from there continue. But the results would be less attracting than simila= r ones originated by real datas.
=A0
thanks for the suggestions.
valerio
--0016e6d564c2b7d02504870bfc46--