Return-Path: X-Original-To: apmail-hbase-user-archive@www.apache.org Delivered-To: apmail-hbase-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id F0E769253 for ; Sat, 16 Jun 2012 14:18:21 +0000 (UTC) Received: (qmail 43119 invoked by uid 500); 16 Jun 2012 14:18:20 -0000 Delivered-To: apmail-hbase-user-archive@hbase.apache.org Received: (qmail 43050 invoked by uid 500); 16 Jun 2012 14:18:20 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 43042 invoked by uid 99); 16 Jun 2012 14:18:20 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 16 Jun 2012 14:18:20 +0000 X-ASF-Spam-Status: No, hits=-0.1 required=5.0 tests=FSL_RCVD_USER,HTML_MESSAGE,RCVD_IN_DNSWL_MED,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of amits@infolinks.com designates 207.126.144.137 as permitted sender) Received: from [207.126.144.137] (HELO eu1sys200aog114.obsmtp.com) (207.126.144.137) by apache.org (qpsmtpd/0.29) with SMTP; Sat, 16 Jun 2012 14:18:14 +0000 Received: from mail-ob0-f175.google.com ([209.85.214.175]) (using TLSv1) by eu1sys200aob114.postini.com ([207.126.147.11]) with SMTP ID DSNKT9yVkJAqYcx0Ymv3jtAqMV5IRsp50TQL@postini.com; Sat, 16 Jun 2012 14:17:53 UTC Received: by obhx4 with SMTP id x4so6326842obh.34 for ; Sat, 16 Jun 2012 07:17:51 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type:x-gm-message-state; bh=H7vmLp4x+yEKeKK2v+uCwu0SXvGoDUB9riKpmmCwBas=; b=YA2f+q1vkoy3M+S6OFJo8Y5wC/PPVhccdlOUubz6fGFcaqNDQQ3nC+GOW7UQMDX2tJ A1LfUOsECacz3jGzizqvOchdV1xC0BkJc9M1UH/ai9D5enUzuhCFDC2U/71ZtRY9ZvcC u68YB2OjAkNYFDXuk0TOLMRt9utYLnHE0VNaOW/JG5fBsCZQPkPc8ozn7jbddCpDv0Uo ocVA4Fq4+0MJkLjpg5LgSA/W2b55oXTSnaoXkz4LYsORdxeS6uH6R3ss7FYa2HFIM3Cz 2wfqIdc6CieISpo1G+7iTwR38+Ftn3pMq9/melkwb0oh6C5qBeiD6onoU8RJqgWKHmlW 5pSw== MIME-Version: 1.0 Received: by 10.50.219.197 with SMTP id pq5mr4518906igc.21.1339856270809; Sat, 16 Jun 2012 07:17:50 -0700 (PDT) Received: by 10.50.151.197 with HTTP; Sat, 16 Jun 2012 07:17:50 -0700 (PDT) In-Reply-To: References: Date: Sat, 16 Jun 2012 17:17:50 +0300 Message-ID: Subject: Re: The write process in the Region Server From: Amit Sela To: user@hbase.apache.org Content-Type: multipart/alternative; boundary=14dae93405d7125b1204c2979985 X-Gm-Message-State: ALoCoQmhBZGS+vCHGzPn26yRJynm4QXB0UfTSGGqz012TTWRRtjQbP/7kfsFpYDpmNuWIlYBRKFE X-Virus-Checked: Checked by ClamAV on apache.org --14dae93405d7125b1204c2979985 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable Thanks Doug, I read the regions section from the book like you recommended but I still have some questions left. When running a massive write job, the regionserver log show the memsize that is flushed. The problem is that most of the time the memsize is either much smaller then the memstore.flush.size configured (resulting in writing more files, which leads to frequent compactions) or bigger than memstore.flush.size * memstore.block.multiplier (resulting in Blocking updates for 'IPC Server handler # on ...). In some cases I also see HBaseServer throwing a ClosedChannelException: "WARN org.apache.hadoop.ipc.HBaseServer: IPC Server handler on caught: java.nio.channels.ClosedChannelException" I guess these problems are also the cause for long (few minutes) pauses and in extreme cases Full GC during the write jobs. Any ideas anyone ? In general, I did some digging and couldn't find much about the write process in HBase from a "memory usage" point of view... besides the configurations description - maybe worth adding to the book. Thank you for all your help, Amit. On Mon, Jun 11, 2012 at 3:22 PM, Doug Meil w= rote: > > Hi there- > > Your understanding is on track. > > > You probably want to read this section.. > > http://hbase.apache.org/book.html#regions.arch > > =C5=A0 as it covers those topics in more detail. > > > > > On 6/10/12 1:02 PM, "Amit Sela" wrote: > > >Hi all, > > > >I'm trying to better understand what's going on in the region server > >during > >write to HBase. > > > >As I understand the process: > > > >1. Data is written to memstore. > >2. Once the memstore has reached hbase.hregion.memstore.flush.size -> > >memstore executes flush and writes a new StoreFile. > >3. The number of StoreFiles increases until a compaction is triggered. > > > >To my understanding, the compaction is triggered after a compaction chec= k > >is done by either CheckCompaction thread running in the background or by > >the flush memstore executed. > >The compaction triggered will be a minor compaction BUT it could promote > >to > >major if it includes all store files. > >When will it NOT include all store files ? say I set compactionThreshld = to > >3, then when the 3rd (or 4th) flush is executed, a compaction wiil be > >triggered and will promote to major since it includes all store files. > > > >Is this right ? can anyone elaborate ? > > > --14dae93405d7125b1204c2979985--