Return-Path: X-Original-To: apmail-accumulo-user-archive@www.apache.org Delivered-To: apmail-accumulo-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 2ED9A18DAD for ; Mon, 30 Nov 2015 04:10:21 +0000 (UTC) Received: (qmail 81076 invoked by uid 500); 30 Nov 2015 04:10:20 -0000 Delivered-To: apmail-accumulo-user-archive@accumulo.apache.org Received: (qmail 81021 invoked by uid 500); 30 Nov 2015 04:10:20 -0000 Mailing-List: contact user-help@accumulo.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@accumulo.apache.org Delivered-To: mailing list user@accumulo.apache.org Received: (qmail 81011 invoked by uid 99); 30 Nov 2015 04:10:20 -0000 Received: from Unknown (HELO spamd4-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 30 Nov 2015 04:10:20 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd4-us-west.apache.org (ASF Mail Server at spamd4-us-west.apache.org) with ESMTP id 37154C06C9 for ; Mon, 30 Nov 2015 04:10:20 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd4-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 0.001 X-Spam-Level: X-Spam-Status: No, score=0.001 tagged_above=-999 required=6.31 tests=[URIBL_BLOCKED=0.001] autolearn=disabled Received: from mx1-us-east.apache.org ([10.40.0.8]) by localhost (spamd4-us-west.apache.org [10.40.0.11]) (amavisd-new, port 10024) with ESMTP id l1FwVwlrPttN for ; Mon, 30 Nov 2015 04:10:09 +0000 (UTC) Received: from smtp65.ord1c.emailsrvr.com (smtp65.ord1c.emailsrvr.com [108.166.43.65]) by mx1-us-east.apache.org (ASF Mail Server at mx1-us-east.apache.org) with ESMTPS id 9C5F842B9E for ; Mon, 30 Nov 2015 04:10:09 +0000 (UTC) Received: from smtp9.relay.ord1c.emailsrvr.com (localhost.localdomain [127.0.0.1]) by smtp9.relay.ord1c.emailsrvr.com (SMTP Server) with ESMTP id ECFA63800F0; Sun, 29 Nov 2015 23:10:02 -0500 (EST) X-Auth-ID: shweta.agrawal@orkash.com Received: by smtp9.relay.ord1c.emailsrvr.com (Authenticated sender: shweta.agrawal-AT-orkash.com) with ESMTPSA id 067BC38009C for ; Sun, 29 Nov 2015 23:10:01 -0500 (EST) X-Sender-Id: shweta.agrawal@orkash.com Received: from [192.168.0.119] ([UNAVAILABLE]. [14.141.49.198]) (using TLSv1 with cipher DHE-RSA-CAMELLIA256-SHA) by 0.0.0.0:465 (trex/5.5.4); Sun, 29 Nov 2015 23:10:02 -0500 Message-ID: <565BCC17.9030504@orkash.com> Date: Mon, 30 Nov 2015 09:39:59 +0530 From: "shweta.agrawal" User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:17.0) Gecko/20130330 Thunderbird/17.0.5 MIME-Version: 1.0 To: user@accumulo.apache.org Subject: Re: how to maintain versioning in D4M schema? References: <5656D04A.5010002@orkash.com> <5657D3BF.1000502@orkash.com> <5657D7AD.4000302@orkash.com> <5657DFCD.5040507@orkash.com> <56581F71.1000607@orkash.com> <20151127134505.GA14781@ll.mit.edu> In-Reply-To: <20151127134505.GA14781@ll.mit.edu> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit The example which I am working is: rowid colf colq value id field|value1 1 id field|value2 1 id field|value3 1 id field|value4 1 id field|value5 1 id field|value6 1 This is my schema in D4M style. Here one field has multiple values. And I want to keep latest 3 values and I want that automatically other values to be deleted as in case of versioning iterator. So after versioning my table should look like this: rowid colf colq value id field|value1 1 id field|value2 1 id field|value3 1 Thanks Shweta On Friday 27 November 2015 07:15 PM, Jeremy Kepner wrote: > Can you provide a made up specific example? I think that will > make the discussion easier. > > > On Fri, Nov 27, 2015 at 02:46:33PM +0530, shweta.agrawal wrote: >> Thanks for the answer. >> But I am asking about versioning in D4M style. How can I use >> versioning iterator in D4M style as in D4M style, in Rowid id is >> strored and field|value is stored in ColumnQualifier. So as value is >> stored in columnQualifier I cannot maintain versions through >> versioning iterator. So I am asking how will I maintain versioning >> in D4M style? >> >> Thanks >> Shweta >> >> On Friday 27 November 2015 12:45 PM, Dylan Hutchison wrote: >>> In order to store five versions of a key but return only one of >>> them during a scan, set the minc and majc VersioningIterator to 5 >>> and set the scan VersioningIterator to 1. You can set scanning >>> iterators on a per-scan basis if this helps. >>> >>> It is not necessary to put the timestamp in the column family if >>> you are going with the VersioningIterator approach. >>> >>> There are many ways to achieve versioning in Accumulo. As the >>> designer/programmer, you must choose one that fits your >>> application, of which we do not know the full details. It sounds >>> like you have narrowed your choice to (1) putting the timestamp in >>> the column family, or (2) not putting the timestamp anywhere else >>> but instead changing the VersioningIterator such that Accumulo >>> stores more versions than the latest version of a >>> (row,colfam,colqual,colvis) key. >>> >>> >>> >>> On Thu, Nov 26, 2015 at 8:45 PM, mohit.kaushik >>> > >>> wrote: >>> >>> David, >>> >>> But this is the case when we store versions based on timestamp >>> field. The point is, in D4M schema we can not achieve it by doing >>> this. In this case we are considering CF to store timestamp in >>> reverse order as described by Dylan. Then how can we configure >>> Accumulo to return only latest version and store only 5 versions? >>> >>> Thanks >>> Mohit Kaushik >>> >>> On 11/27/2015 09:54 AM, David Medinets wrote: >>>> From the user manual: >>>> >>>> user@myinstance mytable> config -t mytable -s table.iterator.scan.vers.opt.maxVersions=5 >>>> user@myinstance mytable> config -t mytable -s table.iterator.minc.vers.opt.maxVersions=5 >>>> user@myinstance mytable> config -t mytable -s table.iterator.majc.vers.opt.maxVersions=5 >>>> >>>> On Thu, Nov 26, 2015 at 11:10 PM, shweta.agrawal >>>> > wrote: >>>> >>>> I want to maintain 5 versions only and user can enter any >>>> number of versions but I want to keep only 5 latest version. >>>> >>>> >>>> On Friday 27 November 2015 09:38 AM, David Medinets wrote: >>>>> Do you want five versions of every entry or will the number >>>>> of versions vary? >>>>> >>>>> On Thu, Nov 26, 2015 at 10:53 PM, shweta.agrawal >>>>> >>>> > wrote: >>>>> >>>>> Thanks Dylan and David. >>>>> I can store version information in column family. But my >>>>> problem is when I have many versions of the same key how >>>>> will I manage that. In Accumulo versioning I can specify >>>>> that how many versions I want to manage. >>>>> >>>>> Suppose I have 10 versions and I only want 5 versions to >>>>> store, how to manage this in a big table? >>>>> >>>>> Thanks >>>>> Shweta >>>>> >>>>> On Thursday 26 November 2015 10:22 PM, David Medinets wrote: >>>>>> What are the query patterns? If you are versioning for >>>>>> auditing then changing the VersioningIterator seems the >>>>>> easiest approach. You could also store >>>>>> application-specific version information in the column >>>>>> family. One of the reasons that D4M does not use it is >>>>>> to allow application-specific uses. Using the CF means >>>>>> that any applications that understand D4M would not >>>>>> need to change their queries to adjust for the version >>>>>> information. >>>>>> >>>>>> On Thu, Nov 26, 2015 at 4:26 AM, shweta.agrawal >>>>>> >>>>> > wrote: >>>>>> >>>>>> Hi, >>>>>> >>>>>> I have my data stored in D4M style. I also want to >>>>>> maintain versions of different value on the basis >>>>>> of time. As in D4M style data is only in rowid >>>>>> and colQualifier only. >>>>>> >>>>>> Is there any way to achieve versioning in D4M schema? >>>>>> >>>>>> Thanks >>>>>> Shweta >>>>>> >>>>>> >>>>> >>>> >>> >>> -- >>> >>> *Mohit Kaushik* >>> Software Engineer >>> A Square,Plot No. 278, Udyog Vihar, Phase 2, Gurgaon 122016, India >>> *Tel:*+91 (124) 4969352 | >>> *Fax:*+91 (124) 4033553 >>> >>> interactive social intelligence >>> at work... >>> >>> >>> >>> >>> >>> ... ensuring Assurance in complexity and >>> uncertainty >>> >>> /This message including the attachments, if any, is a confidential >>> business communication. If you are not the intended recipient it >>> may be unlawful for you to read, copy, distribute, disclose or >>> otherwise use the information in this e-mail. If you have received >>> it in error or are not the intended recipient, please destroy it >>> and notify the sender immediately. Thank you / >>> >>>