Return-Path: X-Original-To: apmail-hbase-issues-archive@www.apache.org Delivered-To: apmail-hbase-issues-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id ACD98D0AE for ; Thu, 16 Aug 2012 16:41:38 +0000 (UTC) Received: (qmail 13850 invoked by uid 500); 16 Aug 2012 16:41:38 -0000 Delivered-To: apmail-hbase-issues-archive@hbase.apache.org Received: (qmail 13767 invoked by uid 500); 16 Aug 2012 16:41:38 -0000 Mailing-List: contact issues-help@hbase.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Delivered-To: mailing list issues@hbase.apache.org Received: (qmail 13756 invoked by uid 99); 16 Aug 2012 16:41:38 -0000 Received: from arcas.apache.org (HELO arcas) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 16 Aug 2012 16:41:38 +0000 Received: from arcas.apache.org (localhost [127.0.0.1]) by arcas (Postfix) with ESMTP id 0553E2C5BE2 for ; Thu, 16 Aug 2012 16:41:38 +0000 (UTC) Date: Fri, 17 Aug 2012 03:41:38 +1100 (NCT) From: "Andrew Purtell (JIRA)" To: issues@hbase.apache.org Message-ID: <648597649.19490.1345135298022.JavaMail.jiratomcat@arcas> In-Reply-To: <1334641237.10070.1344981878613.JavaMail.jiratomcat@arcas> Subject: [jira] [Commented] (HBASE-6586) Quarantine Corrupted HFiles MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/HBASE-6586?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13436081#comment-13436081 ] Andrew Purtell commented on HBASE-6586: --------------------------------------- [~jmhsieh] Do you think it reasonable and worth it to add a HFile corruption detect-and-quarantine flag and mode to hbck? > Quarantine Corrupted HFiles > --------------------------- > > Key: HBASE-6586 > URL: https://issues.apache.org/jira/browse/HBASE-6586 > Project: HBase > Issue Type: Bug > Reporter: Jonathan Hsieh > Assignee: Jonathan Hsieh > > We've encountered a few upgrades from 0.90 hbases + 20.2/1.x hdfs to 0.92 hbases + hdfs 2.x that get stuck. I haven't been able to duplicate the problem in my dev environment but we suspect this may be related to HDFS-3731. On the HBase side, it seems reasonable to quarantine what are most likely truncated hfiles, so that can could later be recovered. > Here's an example of the exception we've encountered: > {code} > 2012-07-18 05:55:01,152 ERROR handler.OpenRegionHandler (OpenRegionHandler.java:openRegion(346)) - Failed open of region=user_mappings,080112102AA76EF98197605D341B9E6C5824D2BC|1001,1317824890618.eaed0e7abc6d27d28ff0e5a9b49c4c > 0d. > java.io.IOException: java.lang.IllegalArgumentException: Invalid HFile version: 842220600 (expected to be between 1 and 2) > at org.apache.hadoop.hbase.io.hfile.FixedFileTrailer.readFromStream(FixedFileTrailer.java:306) > at org.apache.hadoop.hbase.io.hfile.HFile.pickReaderVersion(HFile.java:371) > at org.apache.hadoop.hbase.io.hfile.HFile.createReader(HFile.java:387) > at org.apache.hadoop.hbase.regionserver.StoreFile$Reader.(StoreFile.java:1026) > at org.apache.hadoop.hbase.regionserver.StoreFile.open(StoreFile.java:485) > at org.apache.hadoop.hbase.regionserver.StoreFile.createReader(StoreFile.java:566) > at org.apache.hadoop.hbase.regionserver.Store.loadStoreFiles(Store.java:286) > at org.apache.hadoop.hbase.regionserver.Store.(Store.java:223) > at org.apache.hadoop.hbase.regionserver.HRegion.instantiateHStore(HRegion.java:2534) > at org.apache.hadoop.hbase.regionserver.HRegion.initialize(HRegion.java:454) > at org.apache.hadoop.hbase.regionserver.HRegion.openHRegion(HRegion.java:3282) > at org.apache.hadoop.hbase.regionserver.HRegion.openHRegion(HRegion.java:3230) > at org.apache.hadoop.hbase.regionserver.handler.OpenRegionHandler.openRegion(OpenRegionHandler.java:331) > at org.apache.hadoop.hbase.regionserver.handler.OpenRegionHandler.process(OpenRegionHandler.java:107) > at org.apache.hadoop.hbase.executor.EventHandler.run(EventHandler.java:169) > at java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886) > at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908) > at java.lang.Thread.run(Thread.java:619) > Caused by: java.lang.IllegalArgumentException: Invalid HFile version: 842220600 (expected to be between 1 and 2) > at org.apache.hadoop.hbase.io.hfile.HFile.checkFormatVersion(HFile.java:515) > at org.apache.hadoop.hbase.io.hfile.FixedFileTrailer.readFromStream(FixedFileTrailer.java:303) > ... 17 more > {code} > Specifically -- the FixedFileTrailer are incorrect, and seemingly missing. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira