Return-Path: Delivered-To: apmail-cassandra-user-archive@www.apache.org Received: (qmail 28838 invoked from network); 4 Oct 2010 13:51:46 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 4 Oct 2010 13:51:46 -0000 Received: (qmail 8734 invoked by uid 500); 4 Oct 2010 13:51:43 -0000 Delivered-To: apmail-cassandra-user-archive@cassandra.apache.org Received: (qmail 7712 invoked by uid 500); 4 Oct 2010 13:51:40 -0000 Mailing-List: contact user-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@cassandra.apache.org Delivered-To: mailing list user@cassandra.apache.org Received: (qmail 7704 invoked by uid 99); 4 Oct 2010 13:51:40 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 04 Oct 2010 13:51:40 +0000 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests=FREEMAIL_FROM,RCVD_IN_DNSWL_NONE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of jbellis@gmail.com designates 209.85.161.44 as permitted sender) Received: from [209.85.161.44] (HELO mail-fx0-f44.google.com) (209.85.161.44) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 04 Oct 2010 13:51:34 +0000 Received: by fxm6 with SMTP id 6so3798406fxm.31 for ; Mon, 04 Oct 2010 06:51:14 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:received:in-reply-to :references:date:message-id:subject:from:to:content-type :content-transfer-encoding; bh=+Uxl/R/iOWFT6fB4tw1iJkP91F5OmbImCMfrPZ9/kaU=; b=UURmMBwTd59DjWg8tplSDQI65PaHk0YE6Pc7qKbxwSIU3D5yc+BOYb62xupTad011r gWvPZbMhVIr+A8zdTQZUhKRKopntOW2nA15K28aD6HthuNGlbaRL+qnKmSEGq4DkOTFz odH//YGYNgSuFi/SN3PO6LR2mUkf6g0rdbe5M= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type:content-transfer-encoding; b=Yc9EcvfEA4pmKygvpl8qm1xCXLUIYuj4mZx9d9dex36nNqD2VQvjVRmWs0oNpcsneZ 2eSIe9g1C5Zf75iFpf8AyG1OWCxul2AHY+iDzpom35X+bSdJ5FvU3hJRKnbY775cEmog yFY3OX7wpEIUx/w7KQKNkY8aYBUODS05JoxCM= MIME-Version: 1.0 Received: by 10.239.140.5 with SMTP id v5mr776628hbv.120.1286200268997; Mon, 04 Oct 2010 06:51:08 -0700 (PDT) Received: by 10.220.186.132 with HTTP; Mon, 4 Oct 2010 06:51:08 -0700 (PDT) In-Reply-To: References: Date: Mon, 4 Oct 2010 08:51:08 -0500 Message-ID: Subject: Re: A proposed use case, any comments and experience is appreciated From: Jonathan Ellis To: user Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable X-Virus-Checked: Checked by ClamAV on apache.org Expiring columns are 0.7 only. An expired column behaves like a deleted column until it is compacted away. On Mon, Oct 4, 2010 at 8:48 AM, Utku Can Top=E7u wrote: > Hi Jonathan, > > Thank you for mentioning about the expiring columns issue. I didn't know > that it had existed. > That's really great news. > First of all, does the current 0.6 branch support it? If not so, is the > patch available for the 0.6.5 somehow? > And about the deletion issue, if all the columns in a row expire? When th= e > row will be deleted, will I be seeing the row in my map inputs somehow, a= nd > for how long? > > Regards, > Utku > > On Mon, Oct 4, 2010 at 3:30 PM, Jonathan Ellis wrote: >> >> A simpler approach might be to insert expiring columns into a 2nd CF >> with a TTL of one hour. >> >> On Mon, Oct 4, 2010 at 5:12 AM, Utku Can Top=E7u wro= te: >> > Hey All, >> > >> > I'm planning to run Map/Reduce on one of the ColumnFamilies. The keys >> > are >> > formed in such a fashion that, they are indexed in descending order by >> > time. >> > So I'll be analyzing the data for every hour iteratively. >> > >> > Since the current Hadoop integration does not support partial >> > columnfamily >> > analysis. I feel that, I'll need to dump the data of the last hour and >> > put >> > it to the hadoop cluster and do my analysis on the flat text file. >> > Do you think of any other "better" way of getting the data of a keyran= ge >> > into a hadoop cluster for analysis? >> > >> > Regards, >> > >> > Utku >> > >> > >> > >> >> >> >> -- >> Jonathan Ellis >> Project Chair, Apache Cassandra >> co-founder of Riptano, the source for professional Cassandra support >> http://riptano.com > > --=20 Jonathan Ellis Project Chair, Apache Cassandra co-founder of Riptano, the source for professional Cassandra support http://riptano.com