Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id AA0E3200B9B for ; Wed, 12 Oct 2016 17:02:40 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id A884F160AD4; Wed, 12 Oct 2016 15:02:40 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id EEA1A160AD3 for ; Wed, 12 Oct 2016 17:02:39 +0200 (CEST) Received: (qmail 55258 invoked by uid 500); 12 Oct 2016 15:02:37 -0000 Mailing-List: contact dev-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@hbase.apache.org Delivered-To: mailing list dev@hbase.apache.org Received: (qmail 55246 invoked by uid 99); 12 Oct 2016 15:02:37 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd4-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 12 Oct 2016 15:02:37 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd4-us-west.apache.org (ASF Mail Server at spamd4-us-west.apache.org) with ESMTP id 08AA1C0118 for ; Wed, 12 Oct 2016 15:02:37 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd4-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 2.898 X-Spam-Level: ** X-Spam-Status: No, score=2.898 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, FREEMAIL_ENVFROM_END_DIGIT=0.25, HTML_MESSAGE=2, KAM_LOTSOFHASH=0.25, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H2=-0.001, RCVD_IN_SORBS_SPAM=0.5, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd4-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd4-us-west.apache.org [10.40.0.11]) (amavisd-new, port 10024) with ESMTP id fNE1PQrdvrtX for ; Wed, 12 Oct 2016 15:02:35 +0000 (UTC) Received: from mail-oi0-f48.google.com (mail-oi0-f48.google.com [209.85.218.48]) by mx1-lw-eu.apache.org (ASF Mail Server at mx1-lw-eu.apache.org) with ESMTPS id E5F955FAF7 for ; Wed, 12 Oct 2016 15:02:34 +0000 (UTC) Received: by mail-oi0-f48.google.com with SMTP id t73so63858935oie.1 for ; Wed, 12 Oct 2016 08:02:34 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:from:date:message-id:subject:to; bh=wIdoygkwdQLy1Rcuw4bbzv/Eoz/SGo8yRZ+euXPQWmA=; b=z2p/lxM9e05rhLnF9ELNa5ltJTs0zMK+z3D/CtVzX6VfqHYnJnfZqQEgDMNMP0hYud fs3cDXHxQaSXX8y/KU3I3uqpSRajIIvko9w2hCiE/gllXPsH73UrE/9QL7+LqQJh6YyO qUlAlz4RK7RhhThBRbJKCXHaWef/66PlPpvoTeXqKe01AgFF7phug7o4EurSZ7duPWXr 2awWOJ9guhGZlf26VKbeXJmGPpARw/C9R8LrQhgqvmER34xb4LlHNdicS5NRJsr8bWIj NPapIveVwjBb6PVbeF6xWv2nygCmnFFVWSXLWZc/ckP1Pl5ahA2K9NHGUHbu9Qhu02D1 z0dg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:mime-version:from:date:message-id:subject:to; bh=wIdoygkwdQLy1Rcuw4bbzv/Eoz/SGo8yRZ+euXPQWmA=; b=TQLplo4Zh9KokrNBBnsEEwO1D1LtxPD8YX4/YIfdwq59OdZgeeoTrFZ+NFhOgjxFRO p7c20ElC9qdHGoQHuZHKW6dza0XO+snE3K37poH3pHy4hpyjTO3cpWvNlAnbLZAUsS0v O1DEEb8sMdN81BHuHms2jH74jmM+g47+PjV4svBsMRmNXkaPTZ5X4tgfdQzc8T6vzKVG zekGVZWrZS39DLeFvKctFRO++kgRzGjB7mi1vGrVV4+83EqCwv7zdqlfaHccVRn8+HNO xszq2gql06uwZkXrM/AZCIcVaQeBB2BI8yGIAphi0x8/pcjkU83Y68DPIrNVK0Rn5I0F ON4A== X-Gm-Message-State: AA6/9RkXRzC0TqmvaTVF4QLN56SczMNtsD435zztSkA8xnLvJbx/DDCT3uyOLwWC+zki0NEcj5s0k4shC5adjw== X-Received: by 10.157.17.49 with SMTP id g46mr785500ote.25.1476284551962; Wed, 12 Oct 2016 08:02:31 -0700 (PDT) MIME-Version: 1.0 Received: by 10.157.16.118 with HTTP; Wed, 12 Oct 2016 08:02:31 -0700 (PDT) From: Tim Robertson Date: Wed, 12 Oct 2016 17:02:31 +0200 Message-ID: Subject: Data loss in MOB snapshot and clone? To: dev@hbase.apache.org Content-Type: multipart/alternative; boundary=001a11352ee64ea69a053eac486d archived-at: Wed, 12 Oct 2016 15:02:40 -0000 --001a11352ee64ea69a053eac486d Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable Hi devs, [Had a quick chat with Lars G. about this and before opening a Jira I thought I'd raise it here first] We have just experienced data loss in HBase 1.0.0-cdh5.4.10. Before I dig into this further, I'd like to just ask if anyone has seen this before? The initial state was a table (tim_test) built with MOB support and a few 10's million rows and 10's billions of cells. I wanted to rename the table to get this into production and did so as follows: snapshot 'tim_test', 'tim_test-snapshot' clone_snapshot 'tim_test-snapshot', 'prod_b_map' At this stage the application all looked good, and so I continued with: delete_snapshot 'tim_test-snapshot' disable 'tim_test' drop =E2=80=98tim_test=E2=80=99 Then things went... awry and data just started dropping out in the app. Before long, all MOB data seemingly is gone. The references in the new table MOB folder appear to point to the source table (e.g. /hbase/mobdir/data/default/prod_b_map/ba42a2e8e9b669d9fc85bdfeed2f5f2a/EPSG= _4326/tim_test=3D14bf5f1737ac65c34615ed97c0b7de06-d41d8cd98f00b204e9800998e= cf8427e20161006ff8baa70d21f408caefe8ae6318dfba2). The RS logs full of ERROR like: 2016-10-12 15:19:14,640 ERROR org.apache.hadoop.hbase.regionserver.HStore: The mob file d41d8cd98f00b204e9800998ecf8427e20161006b59865f80e604781a79ebfa2ddd66b48 could not be found in the locations [hdfs://ha-nn/hbase/mobdir/data/default/tim_test/14bf5f1737ac65c34615ed97c0= b7de06/EPSG_4326, hdfs://ha-nn/hbase/archive/data/default/tim_test/14bf5f1737ac65c34615ed97c0= b7de06/EPSG_4326] What I don't know is: 1) was this running a background task to copy the MOB data when the snapshot was cloned and I just deleted the source before the copy was complete? - or 2) when running "snapshot and clone" it just references the source MOB data until a (?) change? 3) snapshot and clone just doesn't support MOB? Can anyone shed some light on this easily before I dig into it please? While this situation exists (at least in 1.0.0) might it be good to get info about data loss for MOB tables into the snapshot clone docs? Thanks, Tim --001a11352ee64ea69a053eac486d--