Return-Path: Delivered-To: apmail-hadoop-hbase-dev-archive@minotaur.apache.org Received: (qmail 25392 invoked from network); 16 Feb 2010 20:01:52 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 16 Feb 2010 20:01:52 -0000 Received: (qmail 51542 invoked by uid 500); 16 Feb 2010 20:01:52 -0000 Delivered-To: apmail-hadoop-hbase-dev-archive@hadoop.apache.org Received: (qmail 51492 invoked by uid 500); 16 Feb 2010 20:01:52 -0000 Mailing-List: contact hbase-dev-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hbase-dev@hadoop.apache.org Delivered-To: mailing list hbase-dev@hadoop.apache.org Received: (qmail 51482 invoked by uid 99); 16 Feb 2010 20:01:51 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 16 Feb 2010 20:01:51 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.140] (HELO brutus.apache.org) (140.211.11.140) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 16 Feb 2010 20:01:49 +0000 Received: from brutus.apache.org (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id B0177234C4AD for ; Tue, 16 Feb 2010 12:01:28 -0800 (PST) Message-ID: <1062328381.313691266350488720.JavaMail.jira@brutus.apache.org> Date: Tue, 16 Feb 2010 20:01:28 +0000 (UTC) From: "stack (JIRA)" To: hbase-dev@hadoop.apache.org Subject: [jira] Commented: (HBASE-2070) Collect HLogs and delete them after a period of time In-Reply-To: <236629781.1261604549496.JavaMail.jira@brutus> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HBASE-2070?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12834419#action_12834419 ] stack commented on HBASE-2070: ------------------------------ .bq this seems like a good first cut, but we should probably be tracking logfiles in ZK HT does this. Makes some kinda sense in that who owns them is clear in case of HRS failure. Master could take ownership before starts splitting them. We should make a new issue for this (Hopefully we won't have as many wals going forward with working flush). Yeah too to the logs shouldn't be cleared if replication is down. Can we put up a gate in zk? On the patch: Can we change this? {code} static final String HREGION_OLDLOGFILE_NAME = "oldlogfile.log"; {code} It must be dumbest name ever given a file since the epoch began? (We should do that in another patch.....another issue). Want to make a regex to verify expected file name rather than: {code} + String[] parts = filePath.getName().split("\\."); {code} Do you have to put a timestamp on it? Doesn't HDFS tell you its last-modified time? (There may be caveats to this but IIRC, for something this basic should be fine). Otherwise patch looks good to me. > Collect HLogs and delete them after a period of time > ---------------------------------------------------- > > Key: HBASE-2070 > URL: https://issues.apache.org/jira/browse/HBASE-2070 > Project: Hadoop HBase > Issue Type: Sub-task > Reporter: Jean-Daniel Cryans > Assignee: Jean-Daniel Cryans > Fix For: 0.21.0 > > Attachments: HBASE-2070-v2.patch, HBASE-2070-v3.patch, HBASE-2070-v4.patch, HBASE-2070.patch > > > For replication we need to be able to service clusters that are a few hours behind in edits. For example, after distcp'ing a snapshot of the DB to another cluster, we need to make sure we get the edits that came in after the snapshot was taken. > I plan the following changes: > - Instead of deleting HLogs during a log roll or after a log split, move them to another folder where all logs should be aggregated. > - Add a new configuration for how old a log can be. For a normal cluster I think of a default of 2 hours. For replication you may want to set it much higher. > - Create a new thread in the master that checks for logs older than configured time and that deletes them. > I also fancy having the deletion time to be configurable while the cluster is running. I'm also thinking of adding a way to tell the cluster to replay edits on itself. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.