Return-Path: X-Original-To: apmail-cassandra-user-archive@www.apache.org Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id A6BDBD875 for ; Mon, 1 Oct 2012 13:28:07 +0000 (UTC) Received: (qmail 86117 invoked by uid 500); 1 Oct 2012 13:28:05 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 86088 invoked by uid 500); 1 Oct 2012 13:28:05 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 86078 invoked by uid 99); 1 Oct 2012 13:28:05 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 01 Oct 2012 13:28:05 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of lewis.mcgibbney@gmail.com designates 209.85.216.51 as permitted sender) Received: from [209.85.216.51] (HELO mail-qa0-f51.google.com) (209.85.216.51) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 01 Oct 2012 13:27:58 +0000 Received: by qabj40 with SMTP id j40so1621593qab.10 for ; Mon, 01 Oct 2012 06:27:37 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:date:message-id:subject:from:to:content-type; bh=1njpBShBTpMFnH0GaBt5079P+E4NCEN6Kir8dw+2nxw=; b=l4YWci1VTNOo2HmnoVryQhiOn0SrRhagN0RLYsNdOaGP2KA3mgxukNSnIFQphcjp8T lEpHnUh3iWzDFRhquZfQkY299VLAipYh13GCzXE+jJ4Vd06HYeF9uGGP66elmCn6DZRJ H8rh6j7pgk4LdQ3L/yLjv7nUX1VwD9qSC77ROMx8HJji6peIvhw5I7vW3YJVb19EUZmv /J8Dq0iJWM0OaPfhgO76z6YAWZaBiQNVotGmufR7ohQk0Pp7nUDSy+keqqQQOB+0bZh/ tERdf107JtLtoVsmK6yX1PKBfGOTcoH1KwoZ4xrA+b+S58Zp+X8TOh5LFSEdt6UB0ntS iOog== MIME-Version: 1.0 Received: by 10.229.171.221 with SMTP id i29mr10049773qcz.15.1349098057875; Mon, 01 Oct 2012 06:27:37 -0700 (PDT) Received: by 10.49.110.101 with HTTP; Mon, 1 Oct 2012 06:27:37 -0700 (PDT) Date: Mon, 1 Oct 2012 14:27:37 +0100 Message-ID: Subject: Advice on correct storage configuration From: Lewis John Mcgibbney To: user@cassandra.apache.org Content-Type: text/plain; charset=ISO-8859-1 X-Virus-Checked: Checked by ClamAV on apache.org Hi, I wish to confirm whether the current mapping (storage) configuration I have is suited to store data commonly extracted field data from Web Pages. My mapping can be seen here [0] which basically specifies three column families e.g. parse (p), fetch (f) and super columns (sc) within the webpage keyspace. Each column family subsequently includes several fields which for clarity include comments. Current CF configuration is as follows: - fetch CF includes 11 columns - parse CF including 4 - super column CF including 7 I am trying to ascertain why the 7 super column fields are currently configured to be super columns as oppose to standard columns! I therefore wonder if someone can please clarify if such a configuration is suited to storing data of this nature. Thank you in advance. if this is too vague an explanation the please say so and I will be happy to expand on any aspect in an attempt to fully understand the data model and the configuration. Thank you Lewis [0] http://svn.apache.org/viewvc/nutch/branches/2.x/conf/gora-cassandra-mapping.xml?view=markup -- Lewis