Return-Path: X-Original-To: apmail-accumulo-user-archive@www.apache.org Delivered-To: apmail-accumulo-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id E816F17EFC for ; Thu, 24 Sep 2015 13:04:04 +0000 (UTC) Received: (qmail 88357 invoked by uid 500); 24 Sep 2015 13:03:59 -0000 Delivered-To: apmail-accumulo-user-archive@accumulo.apache.org Received: (qmail 88311 invoked by uid 500); 24 Sep 2015 13:03:59 -0000 Mailing-List: contact user-help@accumulo.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@accumulo.apache.org Delivered-To: mailing list user@accumulo.apache.org Received: (qmail 88301 invoked by uid 99); 24 Sep 2015 13:03:59 -0000 Received: from Unknown (HELO spamd4-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 24 Sep 2015 13:03:59 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd4-us-west.apache.org (ASF Mail Server at spamd4-us-west.apache.org) with ESMTP id 4E5B2C0FAC for ; Thu, 24 Sep 2015 13:03:59 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd4-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -0.021 X-Spam-Level: X-Spam-Status: No, score=-0.021 tagged_above=-999 required=6.31 tests=[RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H4=-0.01, RCVD_IN_MSPIKE_WL=-0.01, SPF_PASS=-0.001] autolearn=disabled Received: from mx1-us-east.apache.org ([10.40.0.8]) by localhost (spamd4-us-west.apache.org [10.40.0.11]) (amavisd-new, port 10024) with ESMTP id jAyjFp9W-QLH for ; Thu, 24 Sep 2015 13:03:58 +0000 (UTC) Received: from smtp89.ord1c.emailsrvr.com (smtp89.ord1c.emailsrvr.com [108.166.43.89]) by mx1-us-east.apache.org (ASF Mail Server at mx1-us-east.apache.org) with ESMTPS id 3C0E942B22 for ; Thu, 24 Sep 2015 13:03:58 +0000 (UTC) Received: from smtp28.relay.ord1c.emailsrvr.com (localhost.localdomain [127.0.0.1]) by smtp28.relay.ord1c.emailsrvr.com (SMTP Server) with ESMTP id 53B5D1802A0; Thu, 24 Sep 2015 09:03:52 -0400 (EDT) Received: by smtp28.relay.ord1c.emailsrvr.com (Authenticated sender: shweta.agrawal-AT-orkash.com) with ESMTPSA id ACDE21800B1 for ; Thu, 24 Sep 2015 09:03:51 -0400 (EDT) X-Sender-Id: shweta.agrawal@orkash.com Received: from [192.168.0.119] ([UNAVAILABLE]. [14.141.49.198]) (using TLSv1 with cipher DHE-RSA-CAMELLIA256-SHA) by 0.0.0.0:465 (trex/5.4.2); Thu, 24 Sep 2015 13:03:52 GMT Message-ID: <5603F4B0.3040703@orkash.com> Date: Thu, 24 Sep 2015 18:33:44 +0530 From: "shweta.agrawal" User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:17.0) Gecko/20130330 Thunderbird/17.0.5 MIME-Version: 1.0 To: user@accumulo.apache.org Subject: Time based aggregation problem on storing data in D4M schema Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Hi all, I have stored twitter graph data in the form of D4M schema. As in D4M schema we have tweet id in rowid. But I want to aggregate fields on the basis of time. If I apply timestamp filter for this query it will work slow the query, as data is large. And also if I want to check condition also before aggregation. I have 10 years of tweets data and want to run second level aggregations on two months data. Like I want to aggregate all location field of tweets having hashtag modi and tweets of 2 months. I can create reverse index on time but cannot apply any additional conditions on it with the help of index like hashtag modi condition. So can anyone tell me how to aggregate fields with some condition on the basis of time on D4M style data? Thanks and Regards Shweta