Return-Path: X-Original-To: apmail-hbase-user-archive@www.apache.org Delivered-To: apmail-hbase-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 61DB51868E for ; Wed, 6 Jan 2016 10:28:11 +0000 (UTC) Received: (qmail 32686 invoked by uid 500); 6 Jan 2016 10:28:09 -0000 Delivered-To: apmail-hbase-user-archive@hbase.apache.org Received: (qmail 32617 invoked by uid 500); 6 Jan 2016 10:28:09 -0000 Mailing-List: contact user-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hbase.apache.org Delivered-To: mailing list user@hbase.apache.org Received: (qmail 32595 invoked by uid 99); 6 Jan 2016 10:28:09 -0000 Received: from Unknown (HELO spamd2-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 06 Jan 2016 10:28:09 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd2-us-west.apache.org (ASF Mail Server at spamd2-us-west.apache.org) with ESMTP id A620E1A0902 for ; Wed, 6 Jan 2016 10:28:08 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd2-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 3.879 X-Spam-Level: *** X-Spam-Status: No, score=3.879 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, FREEMAIL_REPLY=1, HTML_MESSAGE=3, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, SPF_PASS=-0.001] autolearn=disabled Authentication-Results: spamd2-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com Received: from mx1-us-east.apache.org ([10.40.0.8]) by localhost (spamd2-us-west.apache.org [10.40.0.9]) (amavisd-new, port 10024) with ESMTP id gT_u4AtfWMtq for ; Wed, 6 Jan 2016 10:28:06 +0000 (UTC) Received: from mail-yk0-f173.google.com (mail-yk0-f173.google.com [209.85.160.173]) by mx1-us-east.apache.org (ASF Mail Server at mx1-us-east.apache.org) with ESMTPS id 124C2429C4 for ; Wed, 6 Jan 2016 10:28:06 +0000 (UTC) Received: by mail-yk0-f173.google.com with SMTP id x67so311019760ykd.2 for ; Wed, 06 Jan 2016 02:28:06 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=bYKsFTwoHLDuJ1Mg25zG1rlZ/wn1fQFxbb2oZgoz87I=; b=HpdfE88GBSZ9Hj89Q0c6e3jrCCzFeTRrbV6uKyFqRTFBXe4GYmK+in1DcEclieWEO7 SkD1V2dMQXkSHtLyk8FDV9BlzezN2aLj4KFvxN6x+iXFCk1PWXzC5n3Qzqt5mEOqFzSv L9MlZB5UbG3HaVkB1hue41s8lgtTKO62HvpKRZAQZwxNKDLBmSNgdYHMG3wRR6dTqoav 9alMRm5Ul8duNAwoIV4aSVLQdZ9XF4wYOWz3PWRGqDBZaR0tWSlAb+kFTCxQ12Leht0/ zy31ifPR8VWKQyYMewZzxSSHeXjkqlvk468qG5DWXxnSBnAwQ/WTQqkBodPbAjjSE1me fp+Q== MIME-Version: 1.0 X-Received: by 10.13.218.68 with SMTP id c65mr73716239ywe.315.1452076085684; Wed, 06 Jan 2016 02:28:05 -0800 (PST) Received: by 10.37.207.5 with HTTP; Wed, 6 Jan 2016 02:28:05 -0800 (PST) In-Reply-To: References: Date: Wed, 6 Jan 2016 02:28:05 -0800 Message-ID: Subject: Re: When compactions become major ones From: Ted Yu To: "user@hbase.apache.org" Content-Type: multipart/alternative; boundary=94eb2c07eaa046301a0528a7cf32 --94eb2c07eaa046301a0528a7cf32 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable This parameter is not listed in hbase-default.xml >From hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/wal/FS= HLog.java (of branch-1.1): this.maxLogs =3D conf.getInt("hbase.regionserver.maxlogs", 32); You don't need to adjust its value given the information below. FYI On Wed, Jan 6, 2016 at 2:22 AM, Mehdi Ben Haj Abbes wrote: > Hi, > I couldn't find the hbase.regionserver.maxlogs property in > hbase-default.xml neither in the 0.98 nor in the 0.94 nor the 1.XX > documentation. So how could I know for sure the default value for this > property and how could I change is ? is it a property to put in the > hbase-site.xml ? > > And if the default for this property is 32 I'm good according to the guid= e > you posted Ted. > I have my XMX set to 6144 MB and my upper limit to 0.4 and the DFS blocks > are set to 128MB. > So I'm good with 6144 * 0.4 < 128 * 32 * 095 > > Best regards, > Mehdi > > On Tue, Jan 5, 2016 at 9:36 PM, Mehdi Ben Haj Abbes > > wrote: > > > Thanks guys for your feedbacks. I will check the WAL property and make > > other tests and let u know. > > Best regards, > > Mehdi > > Le 5 janv. 2016 9:09 PM, "Vladimir Rodionov" a > > =C3=A9crit : > > > >> >>And I still dont understand how the store files resulting after > memstore > >> >>flushes are having a size of 40MB. Does it hove smth to do with > memstore > >> >>upper limit and these 42MB are the result of forcing the memstore to > be > >> >>flushed? The problem is that all the newly store files added to HDFS > are > >> >>starting with this size (42MB) I did not mention that my CF is > >> in-memory. > >> > >> Its due to Java object overhead, so 3x is normal (128MB in memory -> > 42MB > >> on disk) > >> Another aspect to take into account: flush can happen not only when we > >> reach memstore size limit, > >> there are other triggers as well: > >> > >> 1. maximum WAL files reached (hbase.regionserver.maxlogs) > >> 2. periodic memstore flusher (once an 1h) can trigger flushes a s well > >> > >> -Vlad > >> > >> > >> > >> On Tue, Jan 5, 2016 at 9:37 AM, Ted Yu wrote: > >> > >> > For #1, > >> > bq. would this minor which becomes major take care of deleted rows > >> > > >> > Yes. > >> > > >> > For #2, please consider the following guide: > >> > > >> > dfs.blocksize (value: ${propdata["dfs.blocksize"]}) * 0.95 * > >> > hbase.regionserver.maxlogs (value: > >> > ${propdata["hbase.regionserver.maxlogs"]}) should be greater than > >> > hbase.regionserver.global.memstore.upperLimit * HBASE_HEAPSIZE (the > >> value > >> > for -Xmx) > >> > > >> > Cheers > >> > > >> > On Tue, Jan 5, 2016 at 8:39 AM, Mehdi Ben Haj Abbes < > >> mehdi.abbes@gmail.com > >> > > > >> > wrote: > >> > > >> > > Thanks Ted for the clarification about the major compactions. So i= f > I > >> did > >> > > understand well when a minor compaction is triggered and the polic= y > >> > selects > >> > > all the store files, this compaction becomes a major one. But woul= d > >> this > >> > > minor which becomes major take care of deleted rows as a major one > >> would > >> > do > >> > > or at the end it is just a minor that happened and selected all th= e > >> store > >> > > files ? > >> > > > >> > > About disabling splitting I have already hbase.hregion.max.filesiz= e > >> set > >> > to > >> > > 10GB besides I pre splitted my table. > >> > > > >> > > And I still dont understand how the store files resulting after > >> memstore > >> > > flushes are having a size of 40MB. Does it hove smth to do with > >> memstore > >> > > upper limit and these 42MB are the result of forcing the memstore = to > >> be > >> > > flushed? The problem is that all the newly store files added to HD= FS > >> are > >> > > starting with this size (42MB) I did not mention that my CF is > >> in-memory. > >> > > > >> > > Best regards, > >> > > > >> > > On Tue, Jan 5, 2016 at 4:04 PM, Ted Yu wrote= : > >> > > > >> > > > For #1, when all store files are selected for compaction, the > >> > compaction > >> > > > becomes major > >> > > > > >> > > > see 'Determine the Optimal Number of Pre-Split Regions' under: > >> > > > http://hbase.apache.org/book.html#disable.splitting > >> > > > > >> > > > See also http://hbase.apache.org/book.html#managed.compactions > >> > > > > >> > > > Cheers > >> > > > > >> > > > On Tue, Jan 5, 2016 at 6:52 AM, Mehdi Ben Haj Abbes < > >> > > mehdi.abbes@gmail.com > >> > > > > > >> > > > wrote: > >> > > > > >> > > > > Hi folks, > >> > > > > > >> > > > > I'm using hbase 0.98. I have heavy writes workload. I'm writin= g > to > >> > one > >> > > > > table with one CF compressed with GZ. My table is pre splitted > to > >> 27 > >> > > > > regions. As I start writing to this table I start seeing HFile= s > of > >> > the > >> > > > size > >> > > > > of 2-4 MB across the regions. I have the default hbase > >> configuration > >> > > for > >> > > > > compaction properties. The compactions start as soon as I star= t > >> > writing > >> > > > to > >> > > > > HBase but many of these compaction are major ones. I can see > this > >> > > through > >> > > > > HBase master UI on the table details view. So I wanted to > >> understand > >> > > > when a > >> > > > > compaction becomes major. > >> > > > > > >> > > > > Another question, If I'm not wrong we have a memstore per > region, > >> so > >> > > > when a > >> > > > > memstore is flushed I will have a HFile with 128MB but I only > see > >> > files > >> > > > > with 42MB (without compression and 2.5MB when compressed with > GZ). > >> > > > > > >> > > > > Any explanation ? > >> > > > > > >> > > > > Thanks in advance. > >> > > > > -- > >> > > > > Mehdi BEN HAJ ABBES > >> > > > > > >> > > > > >> > > > >> > > > >> > > > >> > > -- > >> > > Mehdi BEN HAJ ABBES > >> > > > >> > > >> > > > > > -- > Mehdi BEN HAJ ABBES > --94eb2c07eaa046301a0528a7cf32--