Return-Path: X-Original-To: apmail-hbase-user-archive@www.apache.org Delivered-To: apmail-hbase-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 34153DEC5 for ; Wed, 21 Nov 2012 16:18:48 +0000 (UTC) Received: (qmail 6616 invoked by uid 500); 21 Nov 2012 16:18:46 -0000 Delivered-To: apmail-hbase-user-archive@hbase.apache.org Received: (qmail 6385 invoked by uid 500); 21 Nov 2012 16:18:45 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 6350 invoked by uid 99); 21 Nov 2012 16:18:45 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 21 Nov 2012 16:18:45 +0000 X-ASF-Spam-Status: No, hits=1.0 required=5.0 tests=FREEMAIL_REPLY,RCVD_IN_DNSWL_NONE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of michael_segel@hotmail.com designates 65.55.111.109 as permitted sender) Received: from [65.55.111.109] (HELO blu0-omc2-s34.blu0.hotmail.com) (65.55.111.109) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 21 Nov 2012 16:18:38 +0000 Received: from BLU0-SMTP198 ([65.55.111.71]) by blu0-omc2-s34.blu0.hotmail.com with Microsoft SMTPSVC(6.0.3790.4675); Wed, 21 Nov 2012 08:18:17 -0800 X-Originating-IP: [173.15.87.37] X-EIP: [mfKeQsxGbsgSn/a4LSuE0OJaXzdTWp9g] X-Originating-Email: [michael_segel@hotmail.com] Message-ID: Received: from [192.168.0.104] ([173.15.87.37]) by BLU0-SMTP198.blu0.hotmail.com over TLS secured channel with Microsoft SMTPSVC(6.0.3790.4675); Wed, 21 Nov 2012 08:18:15 -0800 Content-Type: text/plain; charset="GB2312" MIME-Version: 1.0 (Mac OS X Mail 6.2 \(1499\)) Subject: Re: delete rows without writing HLog may be appear in the future? From: Michael Segel In-Reply-To: Date: Wed, 21 Nov 2012 10:18:13 -0600 Content-Transfer-Encoding: quoted-printable References: To: user@hbase.apache.org X-Mailer: Apple Mail (2.1499) X-OriginalArrivalTime: 21 Nov 2012 16:18:15.0748 (UTC) FILETIME=[CFA0A040:01CDC803] X-Virus-Checked: Checked by ClamAV on apache.org Ok,=20 First, I am a firm believer of not bypassing the WAL period. I'm not sure why you would be seeing the data after a delete. If what = you said is true, then either the delete got lost or the delete happened = before the insert (which doesn't make sense because the delete should = have thrown an exception...) I am also confused by what you mean that the delete has to be low = latency.=20 What's the timing difference between writing a delete to the WAL or = bypassing the WAL.=20 Also I am concerned by your statement that you do the delete, it looks = like it was deleted, only a week or two later, its back.=20 That doesn't make sense because the data written to the WAL would have = long since been flushed and then if you delete, then the flag should = have still remained.=20 Can you check the timestamp of the cell?=20 Something isn't right.=20 On Nov 21, 2012, at 9:37 AM, Bing Jiang = wrote: > In our apps, deletes will be frequent, and it occurs to each records = every > time, if write hlog, the performance and response will be low. In = fact,we > can bear with some records with delete fail, but recently I have found = more > records delete some time ago, for example, one week , they reappear > again.Then, that makes me curious about what should do next., delete = with > writing hlog, or put without hlog.... > On Nov 21, 2012 11:19 PM, "Kevin O'dell" = wrote: >=20 >> Bing, >>=20 >> I am curious to hear more about Mike's question. Why are you not = using >> the WAL for your deletes? >>=20 >> On Wed, Nov 21, 2012 at 10:17 AM, Bing Jiang = >> wrote: >>=20 >>> yes,hbase has made a compaction between batch-put and deletes. any = ideas? >>>=20 >>> On Nov 21, 2012 11:10 PM, "Michael Segel" = >>> wrote: >>>>=20 >>>> Some time later? >>>>=20 >>>> Time of course is relative, so I have to ask what occurred between = the >>> write and the delete? >>>> How much time? Did you have any compactions in between the write = and >> the >>> delete? >>>>=20 >>>> Why are you not consistent in your use of the WAL ? >>>>=20 >>>>=20 >>>> On Nov 21, 2012, at 6:37 AM, Bing Jiang >>> wrote: >>>>=20 >>>>> hi=A3=ACall. >>>>> I want to describe a phenomenon that happens to our hbase cluster. >>>>> I use puts(List) to insert many records with writing hlog >> enable, >>>>> and some time later I delete all of these records with writing = hlog >>> disable. >>>>> When one week later, i scan the table, I found some records I have >>> delete >>>>> reappear again. >>>>> It is an interesting case. In my opinion, if we delete data = without >>> enable >>>>> writing hlog, when regionserver fails, the log will replay in = another >>>>> regionserver. >>>>> Can anyone tell me if I persist on deleting records without enable >>> writing >>>>> hlog, is there a way to prevent these records from reappearing = again >>> some >>>>> time later? >>>>>=20 >>>>> Cheers! >>>>> -- >>>>> Bing Jiang >>>>> weibo: http://weibo.com/jiangbinglover >>>>> BLOG: http://blog.sina.com.cn/jiangbinglover >>>>> BLOG: http://www.binospace.com >>>>> National Research Center for Intelligent Computing Systems >>>>> Institute of Computing technology >>>>> Graduate University of Chinese Academy of Science >>>>=20 >>>=20 >>=20 >>=20 >>=20 >> -- >> Kevin O'Dell >> Customer Operations Engineer, Cloudera >>=20