Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id B3859200ACC for ; Mon, 2 May 2016 10:26:02 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id B1E921609B0; Mon, 2 May 2016 10:26:02 +0200 (CEST) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 07E4E1609A1 for ; Mon, 2 May 2016 10:26:01 +0200 (CEST) Received: (qmail 20843 invoked by uid 500); 2 May 2016 08:26:00 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 20831 invoked by uid 99); 2 May 2016 08:26:00 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd4-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 02 May 2016 08:26:00 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd4-us-west.apache.org (ASF Mail Server at spamd4-us-west.apache.org) with ESMTP id 17835C04B9 for ; Mon, 2 May 2016 08:26:00 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd4-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 1.43 X-Spam-Level: * X-Spam-Status: No, score=1.43 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, FREEMAIL_ENVFROM_END_DIGIT=0.25, HTML_MESSAGE=2, HTML_OBFUSCATE_05_10=0.001, RCVD_IN_DNSWL_LOW=-0.7, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd4-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd4-us-west.apache.org [10.40.0.11]) (amavisd-new, port 10024) with ESMTP id LMfCv2uOtxIM for ; Mon, 2 May 2016 08:25:58 +0000 (UTC) Received: from mail-wm0-f41.google.com (mail-wm0-f41.google.com [74.125.82.41]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTPS id 693985F24F for ; Mon, 2 May 2016 08:25:57 +0000 (UTC) Received: by mail-wm0-f41.google.com with SMTP id e201so97735472wme.0 for ; Mon, 02 May 2016 01:25:57 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to; bh=bTKW8YpJTPL+PSG5G8v9MUVOG0vv3IKczBCjvyxSgJM=; b=YUBsO90pAkbPPQeuL3fATn776J6NFBt9nDujt90jnJ5PCR+Yl7I9gbGIGZwTkNGDS5 1mMw7Wr7mt4KIpMEJNuYNEjcBB/Uje7gt+ZKzDnGEf1fhtTDAYV4xcAUPzwHcxPZWU6O xMCCnYdDa/uKgYvbTJMyk8vINggNOtEmwpWhuvoOct7LRLDoAYvVVXGsKXWf0X0nbygR VhpH+UVQxhN2SdUdwqk7QBkpIsD0sQ40D3PmdsMIqG+Y0lWpTCC4cpVoqnNx6IwnBQ6Z tg786BqshLxk5ynRRoqKGoN5p0Stc1KRbUdzIoigBdmloZqmBm/Fe2WQhGjltH/MkNYW Svjg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:mime-version:in-reply-to:references:date :message-id:subject:from:to; bh=bTKW8YpJTPL+PSG5G8v9MUVOG0vv3IKczBCjvyxSgJM=; b=mcEMQTP/8CsDD8hPl7npynqli66YEQRdLpt3O704uILsrLUBa2WZPQ0zpVaiSBiarO Nj0VoJUyNtq1pLPHUhrDBRTrFI+cF1FJgny2ie5KHOlao4thY5Djcn5vntPoLty2EyCW C1Lcr7LSnRdYD0jKDXAU+nsCSTvCqOyCKfzbo3FEaAAS9CQTXh3815EI0H1c0ajxacEB X+B/YIA+xFau8/KrfVQepofcUVWqnYuDRRJooPG1NpA9R130wUEk2sLmavLlKAVmxsTF dek8jYo/UXBUDz9B1g3ouvZHUKLBcenS6q27t1ZCjezFVdITaH2DXXTwjThGilFV6sfX AJ7Q== X-Gm-Message-State: AOPr4FV9jeFZV2PeYvROI37f1kBTsahE3kJcX2UZtsZLWr9FMpaCil2MFQJbvPNWdJ5GsMtH2shOp47mn3ryRg== MIME-Version: 1.0 X-Received: by 10.194.11.97 with SMTP id p1mr34652144wjb.159.1462177555901; Mon, 02 May 2016 01:25:55 -0700 (PDT) Received: by 10.28.170.81 with HTTP; Mon, 2 May 2016 01:25:55 -0700 (PDT) In-Reply-To: References: Date: Mon, 2 May 2016 13:55:55 +0530 Message-ID: Subject: Re: hbase architecture doubts From: Shushant Arora To: user@hbase.apache.org Content-Type: multipart/alternative; boundary=047d7b4508e4d1783e0531d7bd42 archived-at: Mon, 02 May 2016 08:26:02 -0000 --047d7b4508e4d1783e0531d7bd42 Content-Type: text/plain; charset=UTF-8 Thanks! Few doubts; 1.LSM tree comprises two tree-like structures, called C0 and C1 and If the insertion causes the C0 component to exceed a certain size threshold, a contiguous segment of entries is removed from C0 and merged into C1 on disk But in Hbase when C0 which is memstore I guess? is exceeded the threshold size its dumped on to HDFS as HFIle(c1 I guess?) - and does compaction is the process which here means as merging of C0 and C1 ? 2.Moves current, active Map aside as a snapshot (while a write lock is held for a short period of time), and then creates a new CSLS instances. In background, the snapshot is then dumped to disk. We get an Iterator on CSLS. We write a block at a time. When we exceed configured block size, we start a new one. -- Does write lock is held till the time complete CSLS is dumpled on disk.And read is allowed using snapshot. Thanks! On Mon, May 2, 2016 at 11:39 AM, Stack wrote: > On Sun, May 1, 2016 at 3:36 AM, Shushant Arora > wrote: > > > 1.Does Hbase uses ConcurrentskipListMap(CSLM) to store data in memstore? > > > > Yes (We use a CSLS but this is implemented over a CSLM). > > > > 2.When mwmstore is flushed to HDFS- does it dump the memstore > > Concurrentskiplist as Hfile2? Then How does it calculates blocks out of > > CSLM and dmp them in HDFS. > > > > > Moves current, active Map aside as a snapshot (while a write lock is held > for a short period of time), and then creates a new CSLS instances. > > In background, the snapshot is then dumped to disk. We get an Iterator on > CSLS. We write a block at a time. When we exceed configured block size, we > start a new one. > > > > 3.After dumping the inmemory CSLM of memstore to HFILe does memstore > > content is discarded > > > Yes > > > > > and if while dumping memstore any read request comes > > will it be responded by copy of memstore or discard of memstore will be > > blocked until read request is completed? > > > > We will respond using the snapshot until it has been successfully dumped. > Once dumped, we'll respond using the hfile. > > No blocking (other than for the short period during which the snapshot is > made and the file is swapped into the read path). > > > > > 4.When a read request comes does it look in inmemory CSLM and then in > > HFile? > > > Generally, yes. > > > > > And what is LogStructuredMerge tree and its usage in Hbase. > > > > > Suggest you read up on LSM Trees ( > https://en.wikipedia.org/wiki/Log-structured_merge-tree) and if you still > can't see the LSM tree in the HBase forest, ask specific questions and > we'll help you out. > > St.Ack > > > > > > Thanks! > > > --047d7b4508e4d1783e0531d7bd42--