hadoop-common-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Billy Pearson (JIRA)" <j...@apache.org>
Subject [jira] Issue Comment Edited: (HADOOP-2636) [hbase] Make cache flush triggering less simplistic
Date Sun, 27 Jan 2008 04:49:34 GMT

    [ https://issues.apache.org/jira/browse/HADOOP-2636?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12562954#action_12562954
] 

viper799 edited comment on HADOOP-2636 at 1/26/08 8:49 PM:
----------------------------------------------------------------

I tried your patch out above and it only flushing one column for me I have three getting data
and I only see one flushing and its flushing back to back over and over.

Example this is a flush on the same column 4 times with in one sec
{code}
2008-01-26 22:40:14,137 DEBUG org.apache.hadoop.hbase.HRegion: Started memcache flush for
region webdata,,1201405676281 store 332212182/in_rank
2008-01-26 22:40:14,418 DEBUG org.apache.hadoop.hbase.HStore: Added 332212182/in_rank/1595847912559744983
with 70 entries, sequence id 1877650, and size 6.6k for 332212182/in_rank
2008-01-26 22:40:14,418 DEBUG org.apache.hadoop.hbase.HRegion: Finished memcache flush for
store 332212182/in_rank in 281ms, sequenceid=1877650
2008-01-26 22:40:14,436 DEBUG org.apache.hadoop.hbase.HRegion: Started memcache flush for
region webdata,,1201405676281 store 332212182/in_rank
2008-01-26 22:40:14,650 DEBUG org.apache.hadoop.hbase.HStore: Added 332212182/in_rank/3253290776281930479
with 6 entries, sequence id 1877667, and size 621.0 for 332212182/in_rank
2008-01-26 22:40:14,650 DEBUG org.apache.hadoop.hbase.HRegion: Finished memcache flush for
store 332212182/in_rank in 214ms, sequenceid=1877667
2008-01-26 22:40:14,682 DEBUG org.apache.hadoop.hbase.HRegion: Started memcache flush for
region webdata,,1201405676281 store 332212182/in_rank
2008-01-26 22:40:14,893 DEBUG org.apache.hadoop.hbase.HStore: Added 332212182/in_rank/6244850576092789885
with 5 entries, sequence id 1877683, and size 497.0 for 332212182/in_rank
2008-01-26 22:40:14,894 DEBUG org.apache.hadoop.hbase.HRegion: Finished memcache flush for
store 332212182/in_rank in 212ms, sequenceid=1877683
2008-01-26 22:40:14,941 DEBUG org.apache.hadoop.hbase.HRegion: Started memcache flush for
region webdata,,1201405676281 store 332212182/in_rank
{code}

      was (Author: viper799):
    I tried your patch out above and it only flushing one column for me I have three getting
date and I only see one flushing and its flushing back to back over and over.

example this is a flush on the same column 4 times with in one sec
{code}
2008-01-26 22:40:14,137 DEBUG org.apache.hadoop.hbase.HRegion: Started memcache flush for
region webdata,,1201405676281 store 332212182/in_rank
2008-01-26 22:40:14,418 DEBUG org.apache.hadoop.hbase.HStore: Added 332212182/in_rank/1595847912559744983
with 70 entries, sequence id 1877650, and size 6.6k for 332212182/in_rank
2008-01-26 22:40:14,418 DEBUG org.apache.hadoop.hbase.HRegion: Finished memcache flush for
store 332212182/in_rank in 281ms, sequenceid=1877650
2008-01-26 22:40:14,436 DEBUG org.apache.hadoop.hbase.HRegion: Started memcache flush for
region webdata,,1201405676281 store 332212182/in_rank
2008-01-26 22:40:14,650 DEBUG org.apache.hadoop.hbase.HStore: Added 332212182/in_rank/3253290776281930479
with 6 entries, sequence id 1877667, and size 621.0 for 332212182/in_rank
2008-01-26 22:40:14,650 DEBUG org.apache.hadoop.hbase.HRegion: Finished memcache flush for
store 332212182/in_rank in 214ms, sequenceid=1877667
2008-01-26 22:40:14,682 DEBUG org.apache.hadoop.hbase.HRegion: Started memcache flush for
region webdata,,1201405676281 store 332212182/in_rank
2008-01-26 22:40:14,893 DEBUG org.apache.hadoop.hbase.HStore: Added 332212182/in_rank/6244850576092789885
with 5 entries, sequence id 1877683, and size 497.0 for 332212182/in_rank
2008-01-26 22:40:14,894 DEBUG org.apache.hadoop.hbase.HRegion: Finished memcache flush for
store 332212182/in_rank in 212ms, sequenceid=1877683
2008-01-26 22:40:14,941 DEBUG org.apache.hadoop.hbase.HRegion: Started memcache flush for
region webdata,,1201405676281 store 332212182/in_rank
{code}
  
> [hbase] Make cache flush triggering less simplistic
> ---------------------------------------------------
>
>                 Key: HADOOP-2636
>                 URL: https://issues.apache.org/jira/browse/HADOOP-2636
>             Project: Hadoop Core
>          Issue Type: Improvement
>          Components: contrib/hbase
>    Affects Versions: 0.16.0
>            Reporter: stack
>            Assignee: Jim Kellerman
>             Fix For: 0.17.0
>
>         Attachments: patch.txt
>
>
> When flusher runs -- its triggered when the sum of all Stores in a Region > a configurable
max size -- we flush all Stores though a Store memcache might have but a few bytes.
> I would think Stores should only dump their memcache disk if they have some substance.
> The problem becomes more acute, the more families you have in a Region.
> Possible behaviors would be to dump the biggest Store only, or only those Stores >
50% of max memcache size.  Behavior would vary dependent on the prompt that provoked the flush.
 Would also log why the flush is running: optional or > max size.
> This issue comes out of HADOOP-2621.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message