Return-Path: Delivered-To: apmail-hbase-dev-archive@www.apache.org Received: (qmail 52279 invoked from network); 22 Jul 2010 23:38:46 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 22 Jul 2010 23:38:46 -0000 Received: (qmail 45864 invoked by uid 500); 22 Jul 2010 23:38:46 -0000 Delivered-To: apmail-hbase-dev-archive@hbase.apache.org Received: (qmail 45725 invoked by uid 500); 22 Jul 2010 23:38:45 -0000 Mailing-List: contact dev-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@hbase.apache.org Delivered-To: mailing list dev@hbase.apache.org Received: (qmail 45709 invoked by uid 99); 22 Jul 2010 23:38:45 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 22 Jul 2010 23:38:45 +0000 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests=FREEMAIL_FROM,RCVD_IN_DNSWL_NONE,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of jdcryans@gmail.com designates 209.85.212.41 as permitted sender) Received: from [209.85.212.41] (HELO mail-vw0-f41.google.com) (209.85.212.41) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 22 Jul 2010 23:38:40 +0000 Received: by vws16 with SMTP id 16so830279vws.14 for ; Thu, 22 Jul 2010 16:38:19 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:sender:received :in-reply-to:references:date:x-google-sender-auth:message-id:subject :from:to:content-type:content-transfer-encoding; bh=rtSRAxnky9DFIL1RTjhdoPCPUERKtsLgM/xoeG3x3bk=; b=ZXftbAaysScKWzOJMnuCMsfNI82yL3GuuArm8sMOCdyBG3SiOg2bT8pvxe7I7ZfUcj sPxCE446wdiZWJQuVG9vwB1X0prrZ0gHxzVuWHO+WGuolsMzwos2RLSa3JZAGlpC9jQK Bd6f9wbAAf2JXehEERrScPf2eBQLDaqtKfxKo= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:sender:in-reply-to:references:date :x-google-sender-auth:message-id:subject:from:to:content-type :content-transfer-encoding; b=n5bVP2sgglqThSBRMde+Ccxi86V1L4X6o+/HkAQ1Bci2fANOWpC7DXfwl0+BA6CZoJ dI16SbOLVtEQHZVq3SHqX6D2m08Cuk70+etxNxPycmFXOh1uXHFU0xxPBnSwpHMRF0Vl 0doaj69qwp8xJ+PQvPzFrVmInqks/AjsMPeu4= MIME-Version: 1.0 Received: by 10.224.20.78 with SMTP id e14mr1790935qab.135.1279841898995; Thu, 22 Jul 2010 16:38:18 -0700 (PDT) Sender: jdcryans@gmail.com Received: by 10.229.78.217 with HTTP; Thu, 22 Jul 2010 16:38:18 -0700 (PDT) In-Reply-To: References: Date: Thu, 22 Jul 2010 16:38:18 -0700 X-Google-Sender-Auth: 4ASGVXEHF7acly4YIH_qHAtcd8Q Message-ID: Subject: Re: Data disappears and re-appears again after HBase cluster restart From: Jean-Daniel Cryans To: dev@hbase.apache.org Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable I would guess clock skew, all the machines have approx the same time? A few seconds is acceptable, but not more. J-D On Thu, Jul 22, 2010 at 4:34 PM, Vladimir Rodionov wrote: > Have anybody encountered this particular bug before? > We have been having this intermittently in our QA small cluster. > > We run a flow =A0which is basically custom ETL process over data stored i= n hdfs. Yes it is a bunch of M/R jobs. > One of the jobs stores data into HBase (0.20.3), the next one loads data = from HBase (using scan) performs additional transformations > and stores data finally into RDBMS. > > Flow works fine (most of the time). It means that new HBase tables are cr= eated, data is loaded and can be read after that during the next M/R job > > After flow finishes , data from tables (but not tables itself), sometimes= , mysteriously disappear. This is not deterministic and to get data back we= need to RESTART HBase cluster. > So HBase restart fixes the problem. > > Cluster is small (3 servers). RAM is limited - 8GB. Only 2 CPU cores per = server but input data size is small as well and the average size of disappe= aring tables is several 1000s rows- > they are small. Hadoop is from CHD2. I can not get you any additional hel= pful information at the time (no log files), but may be somebody has encoun= tered this > before and has idea how to fix it. > > > Best regards, > Vladimir Rodionov > Principal Platform Engineer > Carrier IQ, www.carrieriq.com > e-mail: vrodionov@carrieriq.com >