Return-Path: Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: (qmail 29622 invoked from network); 3 Mar 2011 21:03:17 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 3 Mar 2011 21:03:17 -0000 Received: (qmail 80768 invoked by uid 500); 3 Mar 2011 21:03:14 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 80732 invoked by uid 500); 3 Mar 2011 21:03:14 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 80694 invoked by uid 99); 3 Mar 2011 21:03:14 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 03 Mar 2011 21:03:14 +0000 X-ASF-Spam-Status: No, hits=2.8 required=5.0 tests=FREEMAIL_FROM,HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL,URI_HEX X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of dan.kuebrich@gmail.com designates 209.85.213.44 as permitted sender) Received: from [209.85.213.44] (HELO mail-yw0-f44.google.com) (209.85.213.44) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 03 Mar 2011 21:03:08 +0000 Received: by ywi6 with SMTP id 6so621731ywi.31 for ; Thu, 03 Mar 2011 13:02:47 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:in-reply-to:references:from:date :message-id:subject:to:cc:content-type; bh=jdSKiflFIOAcxfmD7+rj7VWNTbeK5BTVcFszXlBOo0A=; b=nWv5hfSluerARN7TWndGM71WakhCDLHdhXs7z/IQqXl901HkZUnvXaCGnTnRAG70/P r5fYSfhWG2+8JKWgZ6hyXesI1Xumv1UXi7xGVr0Hc6QrRPT25HbPqVxM4XVKx4UGgnYc R7rV4dtt9NykG/s+WIyznGa7wVB68cb/CbEfg= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :cc:content-type; b=DX7GHxfzmGy4GoSIH0AHDyaYeJHMwy5aDhj0ijSrVgmWnkxQ40szhKf+pxVm9MkT3A KB/NkWSH463sp9WZI62IHz0j2uBEYAXmHgyE7LitxBWMZV0b6EeCQcg8fA0PpNvzQ9Xf vOuXXcGMtXlMktniAsNiDwWa7czH1BrImKEDI= Received: by 10.151.84.8 with SMTP id m8mr2367827ybl.258.1299186167509; Thu, 03 Mar 2011 13:02:47 -0800 (PST) MIME-Version: 1.0 Received: by 10.151.26.8 with HTTP; Thu, 3 Mar 2011 13:02:24 -0800 (PST) In-Reply-To: <1299185050415-6086307.post@n2.nabble.com> References: <1299005018456-6078278.post@n2.nabble.com> <1299010248308-6078542.post@n2.nabble.com> <1299018977921-6078983.post@n2.nabble.com> <1299088026161-6081940.post@n2.nabble.com> <1299181763520-6086135.post@n2.nabble.com> <1299185050415-6086307.post@n2.nabble.com> From: Dan Kuebrich Date: Thu, 3 Mar 2011 16:02:24 -0500 Message-ID: Subject: Re: Storing photos, images, docs etc. To: user@cassandra.apache.org Cc: mcasandra , cassandra-user@incubator.apache.org Content-Type: multipart/alternative; boundary=000e0cd59cb802f369049d9a5a39 X-Virus-Checked: Checked by ClamAV on apache.org --000e0cd59cb802f369049d9a5a39 Content-Type: text/plain; charset=ISO-8859-1 It's still maintained: https://github.com/mogilefs/ . I don't have a good sense of the community, though we did use it at my last job. On Thu, Mar 3, 2011 at 3:44 PM, mcasandra wrote: > Well it's not just metadata that I need to store but also Username, > profiles, > followers etc. What I meant was store the location of the images along with > other information that I described above. And when user queries them then > pull it from the file sytem. > > Most of the high volume sites (facebook, flickr, Digg etc.) currently seem > to be storing location/URI in the DB and actual blobs(images/docs) etc. on > the distributed file system. Some have written on their own and some are > using MogileFS, Lustre etc. > > Can't use S3 since the requirement is to keep everything within the network > so that it's secure and under control. > > Initially I thought Cassandra could be used for both row data and as well > as > large files. But from what I've read and suggestions that I've got it looks > like I need to look at distributed file system which is fault taulrent and > also scales well. After reading online I can come up with only few options > like lustre, glusterfs and mogileFS to store large files. As you mentioned > lustreFS needs kernel tweaking/volume creation etc. Still trying to read > more about glusterFS. And mogileFS last updated date on the site is back in > 2010 so not sure if it's still widely supported (in case of issues :)). > > > -- > View this message in context: > http://cassandra-user-incubator-apache-org.3065146.n2.nabble.com/Storing-photos-images-docs-etc-tp6078278p6086307.html > Sent from the cassandra-user@incubator.apache.org mailing list archive at > Nabble.com. > --000e0cd59cb802f369049d9a5a39 Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable It's still maintained:=A0https= ://github.com/mogilefs/ . =A0I don't have a good sense of the commu= nity, though we did use it at my last job.

On Thu, Mar 3, 2011 at 3:44 PM, mcasandra <mohitanchlia@gmail.com> wr= ote:
Well it's not just metadata that I need= to store but also Username, profiles,
followers etc. What I meant was store the location of the images along with=
other information that I described above. And when user queries them then pull it from the file sytem.

Most of the high volume sites (facebook, flickr, Digg etc.) currently seem<= br> to be storing location/URI in the DB and actual blobs(images/docs) etc. on<= br> the distributed file system. Some have written on their own and some are using MogileFS, Lustre etc.

Can't use S3 since the requirement is to keep everything within the net= work
so that it's secure and under control.

Initially I thought Cassandra could be used for both row data and as well a= s
large files. But from what I've read and suggestions that I've got = it looks
like I need to look at distributed file system which is fault taulrent and<= br> also scales well. After reading online I can come up with only few options<= br> like lustre, glusterfs and mogileFS to store large files. As you mentioned<= br> lustreFS needs kernel tweaking/volume creation etc. Still trying to read more about glusterFS. And mogileFS last updated date on the site is back in=
2010 so not sure if it's still widely supported (in case of issues :)).=


--
View this message in context: http://cassandra-user-incubator-apache-org.306= 5146.n2.nabble.com/Storing-photos-images-docs-etc-tp6078278p6086307.html
Sent from the
cassan= dra-user@incubator.apache.org mailing list archive at Nabble.com.

--000e0cd59cb802f369049d9a5a39--