Return-Path: X-Original-To: apmail-cassandra-user-archive@www.apache.org Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id F22C39548 for ; Tue, 3 Apr 2012 22:57:31 +0000 (UTC) Received: (qmail 83122 invoked by uid 500); 3 Apr 2012 22:57:29 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 83079 invoked by uid 500); 3 Apr 2012 22:57:29 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 83065 invoked by uid 99); 3 Apr 2012 22:57:29 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 03 Apr 2012 22:57:29 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of jbellis@gmail.com designates 209.85.214.44 as permitted sender) Received: from [209.85.214.44] (HELO mail-bk0-f44.google.com) (209.85.214.44) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 03 Apr 2012 22:57:25 +0000 Received: by bkuw5 with SMTP id w5so228182bku.31 for ; Tue, 03 Apr 2012 15:57:03 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type:content-transfer-encoding; bh=CQ+0o97IdprffLKXY1z7AQOuUblH6/T85DOBPtL0oL0=; b=d9ap3xSqrSv/hpqA4iyivUEH0LS3NhljiVylEVoPkneCwUcbBySQCdbJdrCaBRoGXu 026oh08F++szDRKtuALu5ttstfSEOAQ/yN+Tk+ahRdieHUiBJjhDGgiL2A+BxHLvZlt/ 31M2aWil/XdYuNrMYeUAbMi3U2K8zjbmOlZcXLLfDkhX2hZadDiJMmLMbsccKEBMl9ok mie+huF0iWFVxu5Wp0O4K5xT+pmNg9wmPwMa8Zokz2MoNybGJxJx6ua4qFkiQiOCG/ul vpvGdmTeDSZ0l8dQNLhN995u7zhljBq6ef01PR8mEwfv/gDfTi5NzIQVWowRXpUIWMzQ O3jw== Received: by 10.205.129.4 with SMTP id hg4mr6468896bkc.16.1333493823854; Tue, 03 Apr 2012 15:57:03 -0700 (PDT) MIME-Version: 1.0 Received: by 10.204.57.210 with HTTP; Tue, 3 Apr 2012 15:56:43 -0700 (PDT) In-Reply-To: References: From: Jonathan Ellis Date: Tue, 3 Apr 2012 17:56:43 -0500 Message-ID: Subject: Re: Largest 'sensible' value To: user@cassandra.apache.org Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable X-Virus-Checked: Checked by ClamAV on apache.org We use 2MB chunks for our CFS implementation of HDFS: http://www.datastax.com/dev/blog/cassandra-file-system-design On Mon, Apr 2, 2012 at 4:23 AM, Franc Carter wr= ote: > > Hi, > > We are in the early stages of thinking about a project that needs to stor= e > data that will be accessed by Hadoop. One of the concerns we have is arou= nd > the Latency of HDFS as our use case is is not for reading all the data an= d > hence we will need custom RecordReaders etc. > > I've seen a couple of comments that you shouldn't put large chunks in to = a > value - however 'large' is not well defined for the range of people using > these solutions ;-) > > Doe anyone have a rough rule of thumb for how big a single value can be > before we are outside sanity? > > thanks > > -- > > Franc Carter | Systems architect | Sirca Ltd > > franc.carter@sirca.org.au=A0|=A0www.sirca.org.au > > Tel:=A0+61 2 9236 9118 > > Level 9, 80 Clarence St, Sydney=A0NSW 2000 > > PO Box H58, Australia Square, Sydney NSW 1215 > > --=20 Jonathan Ellis Project Chair, Apache Cassandra co-founder of DataStax, the source for professional Cassandra support http://www.datastax.com