hbase-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hudson (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HBASE-13985) Add configuration to skip validating HFile format when bulk loading
Date Thu, 13 Aug 2015 06:58:46 GMT

    [ https://issues.apache.org/jira/browse/HBASE-13985?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14694814#comment-14694814
] 

Hudson commented on HBASE-13985:
--------------------------------

SUCCESS: Integrated in HBase-1.3-IT #87 (See [https://builds.apache.org/job/HBase-1.3-IT/87/])
HBASE-13985 Add configuration to skip validating HFile format when bulk loading (Victor Xu)
(apurtell: rev ca19f961a25dce5359bfb9b35c0bbbd64ec0fb0b)
* hbase-server/src/main/java/org/apache/hadoop/hbase/mapreduce/LoadIncrementalHFiles.java


> Add configuration to skip validating HFile format when bulk loading
> -------------------------------------------------------------------
>
>                 Key: HBASE-13985
>                 URL: https://issues.apache.org/jira/browse/HBASE-13985
>             Project: HBase
>          Issue Type: Improvement
>    Affects Versions: 0.98.13
>            Reporter: Victor Xu
>            Assignee: Victor Xu
>            Priority: Minor
>              Labels: regionserver
>             Fix For: 2.0.0, 0.98.14, 1.2.0, 1.3.0
>
>         Attachments: HBASE-13985-v2.patch, HBASE-13985-v3.patch, HBASE-13985.patch
>
>
> When bulk loading millions of HFile into one HTable, checking HFile format is the most
time-consuming phase. Maybe we could use a parallel mechanism to increase the speed, but when
it comes to millions of HFiles, it may still cost dozens of minutes. So I think it's necessary
to add an option for advanced user to bulkload without checking HFile format at all. 
> Of course, the default value of this option should be true.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message