Return-Path: X-Original-To: apmail-hbase-dev-archive@www.apache.org Delivered-To: apmail-hbase-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id B549218156 for ; Fri, 10 Jul 2015 22:46:25 +0000 (UTC) Received: (qmail 49672 invoked by uid 500); 10 Jul 2015 22:46:25 -0000 Delivered-To: apmail-hbase-dev-archive@hbase.apache.org Received: (qmail 49582 invoked by uid 500); 10 Jul 2015 22:46:25 -0000 Mailing-List: contact dev-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@hbase.apache.org Delivered-To: mailing list dev@hbase.apache.org Received: (qmail 49570 invoked by uid 99); 10 Jul 2015 22:46:24 -0000 Received: from Unknown (HELO spamd3-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 10 Jul 2015 22:46:24 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd3-us-west.apache.org (ASF Mail Server at spamd3-us-west.apache.org) with ESMTP id 771F918199F for ; Fri, 10 Jul 2015 22:46:24 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd3-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 2.899 X-Spam-Level: ** X-Spam-Status: No, score=2.899 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=3, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd3-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-us-east.apache.org ([10.40.0.8]) by localhost (spamd3-us-west.apache.org [10.40.0.10]) (amavisd-new, port 10024) with ESMTP id IP0NVFT84AUu for ; Fri, 10 Jul 2015 22:46:12 +0000 (UTC) Received: from mail-vn0-f44.google.com (mail-vn0-f44.google.com [209.85.216.44]) by mx1-us-east.apache.org (ASF Mail Server at mx1-us-east.apache.org) with ESMTPS id EEF7D43DF5 for ; Fri, 10 Jul 2015 22:46:11 +0000 (UTC) Received: by vnbg1 with SMTP id g1so39188091vnb.7 for ; Fri, 10 Jul 2015 15:46:11 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=sx+ZboDr4z2C5IFoEfK9ol5hKghkDkoW2gfQVUkaOx8=; b=XyBPGZj8cftRN2Oj6pLBuXG3FcjoaMMo/5sPMfJOvkqvF5qhgSe3opp+sfzl2gtZoK kwsizC00iMVlz7qgl0K+5tpp5nVW/dsv8cEsharqa016DKAhI+r3SWkxf7Cdq33R869H +aDe8g5KsyYP3GEzma/Iqn0W8hDTQDa47v39EOz+kZfCm2CbnlPiOlAK4dr5TWzztiLN 62QTEgZXrV4biE5WZt9zo+1kVS7ByrrA6yd7G9z8aWZ81+bpQ5SUSPggBbKiTm+w9C5T 522pq7kZ6bRQxMzGIt8DmnlPGUbAUcWZe3wV8IR9sU++wljxoEKdO5YsYQmGgERUL7/H Evdw== MIME-Version: 1.0 X-Received: by 10.52.135.112 with SMTP id pr16mr5397436vdb.53.1436568371197; Fri, 10 Jul 2015 15:46:11 -0700 (PDT) Received: by 10.31.6.65 with HTTP; Fri, 10 Jul 2015 15:46:11 -0700 (PDT) In-Reply-To: References: Date: Fri, 10 Jul 2015 15:46:11 -0700 Message-ID: Subject: Re: hbase hdfs snapshots From: rahul gidwani To: dev@hbase.apache.org Content-Type: multipart/alternative; boundary=bcaec52c5c1775fe4a051a8d2324 --bcaec52c5c1775fe4a051a8d2324 Content-Type: text/plain; charset=UTF-8 Hi Matteo, We do SKIP_FLUSH. We have 1200+ regionservers with a single table with 60k regions and 4 column families. It takes around 30 minutes to snapshot this table using manifests compared to just seconds doing this with hdfs. Cloning this table takes considerably longer. For cases where someone would want to run Map/Reduce over snapshots this could be much faster as we could take an hdfs snapshot and bypass the clone. rahul On Thu, Jul 9, 2015 at 12:20 PM, Matteo Bertozzi wrote: > On Thu, Jul 9, 2015 at 12:12 PM, rahul gidwani > wrote: > > > Even with manifests (Snapshot V2) for our larger tables it can take hours > > to Snapshot and Clone a table. > > > > on snapshot time the only thing that can take hours, is "flush". > if you don't need that (which is what you get with hdfs snapshots) you can > specify SKIP_FLUSH => true > > > Matteo > > > On Thu, Jul 9, 2015 at 12:12 PM, rahul gidwani > wrote: > > > HBase snapshots are a very useful feature. but it was implemented back > > before there was the ability to snapshot via HDFS. > > > > Newer versions of Hadoop support HDFS snapshots. I was wondering if the > > community would be interested in something like a Snapshot V3 where we > use > > HDFS to take these snapshots. > > > > Even with manifests (Snapshot V2) for our larger tables it can take hours > > to Snapshot and Clone a table. > > > > Would this feature be of use to anyone? > > > > thanks > > rahul > > > --bcaec52c5c1775fe4a051a8d2324--