hbase-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From bijieshan <bijies...@huawei.com>
Subject Re: Closed parent region present in Hlog.lastSeqWritten
Date Wed, 25 Jan 2012 06:44:24 GMT
Ted:

Below is the search info for "20312224" "20312223" "20312225". We also analyzed this but got
nothing:(

[20312224]
00:31:42,529 DEBUG org.apache.hadoop.hbase.regionserver.Store: loaded hdfs://192.168.1.103:9000/hbase/Htable_UFDR_031/52ff3c7c6df6e0337876bbca29cee84a/value/973789709483406123.2acaf8e3acfd2e8a5825a1f6f0aca4a8,
isReference=true, isBulkLoadResult=false, seqid=20312224, majorCompaction=false
00:31:42,541 INFO org.apache.hadoop.hbase.regionserver.HRegion: Onlined Htable_UFDR_031,00332,1325809872607.259d0c620c9105928e52713f4a5a252e.;
next sequenceid=20312224
00:31:42,595 INFO org.apache.hadoop.hbase.regionserver.Store: Started compaction of 10 file(s)
in cf=value, hasReferences=true, into hdfs://192.168.1.103:9000/hbase/Htable_UFDR_031/52ff3c7c6df6e0337876bbca29cee84a/.tmp,
seqid=20312224, totalSize=1.6g
00:34:48,061 DEBUG org.apache.hadoop.hbase.regionserver.wal.HLog: Found 1 hlogs to remove
out of total 4; oldest outstanding sequenceid is 20312224 from region 2acaf8e3acfd2e8a5825a1f6f0aca4a8
01:39:14,494 DEBUG org.apache.hadoop.hbase.regionserver.Store: loaded hdfs://192.168.1.103:9000/hbase/Htable_UFDR_031/52ff3c7c6df6e0337876bbca29cee84a/value/1634833527842694834,
isReference=false, isBulkLoadResult=false, seqid=20312224, majorCompaction=false
01:55:15,293 DEBUG org.apache.hadoop.hbase.regionserver.Store: loaded hdfs://192.168.1.103:9000/hbase/Htable_UFDR_031/52ff3c7c6df6e0337876bbca29cee84a/value/1634833527842694834,
isReference=false, isBulkLoadResult=false, seqid=20312224, majorCompaction=false

[20312223]
00:30:59,614 INFO org.apache.hadoop.hbase.regionserver.Store: Added hdfs://192.168.1.103:9000/hbase/Htable_UFDR_031/2acaf8e3acfd2e8a5825a1f6f0aca4a8/value/973789709483406123,
entries=7537, sequenceid=20312223, memsize=4.2m, filesize=2.9m
00:30:59,787 INFO org.apache.hadoop.hbase.regionserver.HRegion: Finished memstore flush of
~133.5m for region Htable_UFDR_031,00332,1325808823997.2acaf8e3acfd2e8a5825a1f6f0aca4a8. in
21816ms, sequenceid=20312223, compaction requested=true
00:31:42,532 DEBUG org.apache.hadoop.hbase.regionserver.Store: loaded hdfs://192.168.1.103:9000/hbase/Htable_UFDR_031/259d0c620c9105928e52713f4a5a252e/value/973789709483406123.2acaf8e3acfd2e8a5825a1f6f0aca4a8,
isReference=true, isBulkLoadResult=false, seqid=20312223, majorCompaction=false

[20312225]	
00:31:42,531 INFO org.apache.hadoop.hbase.regionserver.HRegion: Onlined Htable_UFDR_031,003732800093168-03594291912,1325809872607.52ff3c7c6df6e0337876bbca29cee84a.;
next sequenceid=20312225	
	

-----邮件原件-----
发件人: Ted Yu [mailto:yuzhihong@gmail.com] 
发送时间: 2012年1月25日 14:33
收件人: dev@hbase.apache.org
抄送: user@hbase.apache.org; Chenjian
主题: Re: Closed parent region present in Hlog.lastSeqWritten

Jieshan:
Can you search the region server log for seq id: 20312224 ?

Thanks

2012/1/24 Ramkrishna.S.Vasudevan <ramkrishna.vasudevan@huawei.com>

> HBASE-5225 was merged for different purpose.  But this issue does not seem
> to be because of HBASE-5225.
> Because in HBASE-5225 we tried to fix the case where the seq id is missed.
> Here it is not missed it is still there in lastSeqWritten without getting
> flushed.
>
> Regards
> Ram
>
> -----Original Message-----
> From: bijieshan [mailto:bijieshan@huawei.com]
> Sent: Wednesday, January 25, 2012 10:59 AM
> To: user@hbase.apache.org; dev@hbase.apache.org
> Cc: Chenjian
> Subject: Re: Closed parent region present in Hlog.lastSeqWritten
>
> Not including HBASE-5225(before 0.90.6.rc0). I think no relationship with
> this.
> Thanks.
> Jieshan
>
> -----邮件原件-----
> 发件人: Ted Yu [mailto:yuzhihong@gmail.com]
> 发送时间: 2012年1月25日 13:18
> 收件人: dev@hbase.apache.org
> 抄送: user@hbase.apache.org; Chenjian
> 主题: Re: Closed parent region present in Hlog.lastSeqWritten
>
> Can you provide more detail on this 0.90.5 +, please ?
>
> Did it include HBASE-5225 ?
>
> Thanks
>
> On Tue, Jan 24, 2012 at 8:55 PM, bijieshan <bijieshan@huawei.com> wrote:
>
> > Hi all,
> > We found so many hlogs in our cluster, after some analysis, we also found
> > one splitted region occurred in HLog.lastSeqWritten. For this region had
> > been closed, it can't be flushed again. So blocking all the other logs
> > removing to ".oldlogs" directory.
> >
> > 05:06:44,422 INFO org.apache.hadoop.hbase.regionserver.wal.HLog: Too many
> > hlogs: logs=122, maxlogs=32; forcing flush of 1 regions(s):
> > 2acaf8e3acfd2e8a5825a1f6f0aca4a8
> > 05:06:44,422 WARN org.apache.hadoop.hbase.regionserver.LogRoller: Failed
> > to schedule flush of 2acaf8e3acfd2e8a5825a1f6f0aca4a8r=null,
> requester=null
> > 05:10:48,666 INFO org.apache.hadoop.hbase.regionserver.wal.HLog: Too many
> > hlogs: logs=123, maxlogs=32; forcing flush of 1 regions(s):
> > 2acaf8e3acfd2e8a5825a1f6f0aca4a8
> > 05:10:48,666 WARN org.apache.hadoop.hbase.regionserver.LogRoller: Failed
> > to schedule flush of 2acaf8e3acfd2e8a5825a1f6f0aca4a8r=null,
> requester=null
> > 05:14:46,075 INFO org.apache.hadoop.hbase.regionserver.wal.HLog: Too many
> > hlogs: logs=124, maxlogs=32; forcing flush of 1 regions(s):
> > 2acaf8e3acfd2e8a5825a1f6f0aca4a8
> > 05:14:46,075 WARN org.apache.hadoop.hbase.regionserver.LogRoller: Failed
> > to schedule flush of 2acaf8e3acfd2e8a5825a1f6f0aca4a8r=null,
> requester=null
> > 05:15:41,584 INFO org.apache.hadoop.hbase.regionserver.wal.HLog: Too many
> > hlogs: logs=125, maxlogs=32; forcing flush of 1 regions(s):
> > 2acaf8e3acfd2e8a5825a1f6f0aca4a8
> > 05:15:41,584 WARN org.apache.hadoop.hbase.regionserver.LogRoller: Failed
> > to schedule flush of 2acaf8e3acfd2e8a5825a1f6f0aca4a8r=null,
> requester=null
> >
> > Let's see what happened to the region of
> > "2acaf8e3acfd2e8a5825a1f6f0aca4a8r":
> >
> > 00:30:49,242 INFO org.apache.hadoop.hbase.regionserver.HRegion: Finished
> > memstore flush of ~129.5m for region
> > Htable_UFDR_031,00332,1325808823997.2acaf8e3acfd2e8a5825a1f6f0aca4a8. in
> > 13299ms, sequenceid=20311822, compaction requested=true
> > 00:30:49,242 DEBUG
> > org.apache.hadoop.hbase.regionserver.CompactSplitThread: Compaction
> > requested for
> > Htable_UFDR_031,00332,1325808823997.2acaf8e3acfd2e8a5825a1f6f0aca4a8.
> > because User-triggered split; priority=1, compaction queue size=5840
> > 00:30:55,214 INFO org.apache.hadoop.hbase.regionserver.Store: Renaming
> > flushed file at hdfs://
> >
>
> 192.168.1.103:9000/hbase/Htable_UFDR_031/2acaf8e3acfd2e8a5825a1f6f0aca4a8/.t
> mp/1755862026714756815to<http://192.168.1.103:9000/hbase/Htable_UFDR_031/2acaf8e3acfd2e8a5825a1f6f0aca4a8/.t%0Amp/1755862026714756815to>hdfs://
> >
>
> 192.168.1.103:9000/hbase/Htable_UFDR_031/2acaf8e3acfd2e8a5825a1f6f0aca4a8/va
> lue/973789709483406123<http://192.168.1.103:9000/hbase/Htable_UFDR_031/2acaf8e3acfd2e8a5825a1f6f0aca4a8/va%0Alue/973789709483406123>
> >
> 00:30:55,214<
> http://192.168.1.103:9000/hbase/Htable_UFDR_031/2acaf8e3acfd2e8
> a5825a1f6f0aca4a8/value/973789709483406123%0A00:30:55,214<http://192.168.1.103:9000/hbase/Htable_UFDR_031/2acaf8e3acfd2e8%0Aa5825a1f6f0aca4a8/value/973789709483406123%0A00:30:55,214>
> >INFO
> org.apache.hadoop.hbase.regionserver.Store: Renaming flushed file at
> > hdfs://
> >
>
> 192.168.1.103:9000/hbase/Htable_UFDR_031/2acaf8e3acfd2e8a5825a1f6f0aca4a8/.t
> mp/1755862026714756815to<http://192.168.1.103:9000/hbase/Htable_UFDR_031/2acaf8e3acfd2e8a5825a1f6f0aca4a8/.t%0Amp/1755862026714756815to>hdfs://
> >
>
> 192.168.1.103:9000/hbase/Htable_UFDR_031/2acaf8e3acfd2e8a5825a1f6f0aca4a8/va
> lue/973789709483406123<http://192.168.1.103:9000/hbase/Htable_UFDR_031/2acaf8e3acfd2e8a5825a1f6f0aca4a8/va%0Alue/973789709483406123>
> >
> 00:30:59,614<
> http://192.168.1.103:9000/hbase/Htable_UFDR_031/2acaf8e3acfd2e8
> a5825a1f6f0aca4a8/value/973789709483406123%0A00:30:59,614<http://192.168.1.103:9000/hbase/Htable_UFDR_031/2acaf8e3acfd2e8%0Aa5825a1f6f0aca4a8/value/973789709483406123%0A00:30:59,614>
> >INFO
> org.apache.hadoop.hbase.regionserver.Store: Added hdfs://
> >
>
> 192.168.1.103:9000/hbase/Htable_UFDR_031/2acaf8e3acfd2e8a5825a1f6f0aca4a8/va
> lue/973789709483406123<http://192.168.1.103:9000/hbase/Htable_UFDR_031/2acaf8e3acfd2e8a5825a1f6f0aca4a8/va%0Alue/973789709483406123>
> ,
> > entries=7537, sequenceid=20312223, memsize=4.2m, filesize=2.9m
> > 00:30:59,787 INFO org.apache.hadoop.hbase.regionserver.HRegion: Finished
> > memstore flush of ~133.5m for region
> > Htable_UFDR_031,00332,1325808823997.2acaf8e3acfd2e8a5825a1f6f0aca4a8. in
> > 21816ms, sequenceid=20312223, compaction requested=true
> > 00:30:59,787 DEBUG
> > org.apache.hadoop.hbase.regionserver.CompactSplitThread: Compaction
> > requested for
> > Htable_UFDR_031,00332,1325808823997.2acaf8e3acfd2e8a5825a1f6f0aca4a8.
> > because regionserver20020.cacheFlusher; priority=0, compaction queue
> > size=5840
> > 00:31:12,605 INFO org.apache.hadoop.hbase.regionserver.HRegion: Starting
> > compaction on region
> > Htable_UFDR_031,00332,1325808823997.2acaf8e3acfd2e8a5825a1f6f0aca4a8.
> > 00:31:12,607 INFO org.apache.hadoop.hbase.regionserver.HRegion: completed
> > compaction on region
> > Htable_UFDR_031,00332,1325808823997.2acaf8e3acfd2e8a5825a1f6f0aca4a8.
> after
> > 0sec
> > 00:31:12,607 INFO org.apache.hadoop.hbase.regionserver.SplitTransaction:
> > Starting split of region
> > Htable_UFDR_031,00332,1325808823997.2acaf8e3acfd2e8a5825a1f6f0aca4a8.
> > 00:31:13,694 DEBUG org.apache.hadoop.hbase.regionserver.HRegion: Closing
> > Htable_UFDR_031,00332,1325808823997.2acaf8e3acfd2e8a5825a1f6f0aca4a8.:
> > disabling compactions & flushes
> > 00:31:13,694 DEBUG org.apache.hadoop.hbase.regionserver.HRegion: Updates
> > disabled for region
> > Htable_UFDR_031,00332,1325808823997.2acaf8e3acfd2e8a5825a1f6f0aca4a8.
> > 00:31:13,718 INFO org.apache.hadoop.hbase.regionserver.HRegion: Closed
> > Htable_UFDR_031,00332,1325808823997.2acaf8e3acfd2e8a5825a1f6f0aca4a8.
> > 00:31:39,552 INFO org.apache.hadoop.hbase.catalog.MetaEditor: Offlined
> > parent region
> > Htable_UFDR_031,00332,1325808823997.2acaf8e3acfd2e8a5825a1f6f0aca4a8. in
> > META
> > ------
> > 00:31:42,529 DEBUG org.apache.hadoop.hbase.regionserver.Store: loaded
> > hdfs://
> >
>
> 192.168.1.103:9000/hbase/Htable_UFDR_031/52ff3c7c6df6e0337876bbca29cee84a/va
> lue/973789709483406123.2acaf8e3acfd2e8a5825a1f6f0aca4a8<http://192.168.1.103:9000/hbase/Htable_UFDR_031/52ff3c7c6df6e0337876bbca29cee84a/va%0Alue/973789709483406123.2acaf8e3acfd2e8a5825a1f6f0aca4a8>
> ,
> > isReference=true, isBulkLoadResult=false, seqid=20312224,
> > majorCompaction=false
> > 00:31:42,532 DEBUG org.apache.hadoop.hbase.regionserver.Store: loaded
> > hdfs://
> >
>
> 192.168.1.103:9000/hbase/Htable_UFDR_031/259d0c620c9105928e52713f4a5a252e/va
> lue/973789709483406123.2acaf8e3acfd2e8a5825a1f6f0aca4a8<http://192.168.1.103:9000/hbase/Htable_UFDR_031/259d0c620c9105928e52713f4a5a252e/va%0Alue/973789709483406123.2acaf8e3acfd2e8a5825a1f6f0aca4a8>
> ,
> > isReference=true, isBulkLoadResult=false, seqid=20312223,
> > majorCompaction=false
> > 00:31:42,575 INFO
> org.apache.hadoop.hbase.regionserver.CompactSplitThread:
> > Region split, META updated, and report to master.
> >
>
> Parent=Htable_UFDR_031,00332,1325808823997.2acaf8e3acfd2e8a5825a1f6f0aca4a8.
> ,
> > new regions:
> > Htable_UFDR_031,00332,1325809872607.259d0c620c9105928e52713f4a5a252e.,
> >
>
> Htable_UFDR_031,003732800093168-03594291912,1325809872607.52ff3c7c6df6e03378
> 76bbca29cee84a..
> > Split took 29sec
> > // At this time, found this region again in HLog#lastSeqWritten.
> > 00:34:48,061 DEBUG org.apache.hadoop.hbase.regionserver.wal.HLog: Found 1
> > hlogs to remove out of total 4; oldest outstanding sequenceid is 20312224
> > from region 2acaf8e3acfd2e8a5825a1f6f0aca4a8
> >
> > Our HBase version is "0.90.5+" when we found this issue.
> >
> > So after splitting and closing, how this region come into
> > HLog.lastSeqWritten?
> >
> > We analyze all the related code, it seems impossible. Because after a
> > region be closed, it has no chance to add this region into
> > HLog.lastSeqWritten again.
> >
> > So I want to get some insight here from someone who is intimately
> familiar
> > with this or has encountered the similar problem.
> >
> > Thanks and Regards,
> > Jieshan
> >
>
>
Mime
View raw message