hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ratheesh Kamoor (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HIVE-14925) MSCK repair table hang while running with multi threading enabled
Date Wed, 12 Oct 2016 20:10:20 GMT

    [ https://issues.apache.org/jira/browse/HIVE-14925?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15569746#comment-15569746
] 

Ratheesh Kamoor commented on HIVE-14925:
----------------------------------------

[~pxiong] I moved the logic in inline callable to an external class so that code can be reused
in with multi-threaded and non-multi threaded scenario. Also, it will fix the issues of thread
lock. Could you please review. Tested with very large partitions (5K+) we have and worked
fine. 

> MSCK repair table hang while running with multi threading enabled
> -----------------------------------------------------------------
>
>                 Key: HIVE-14925
>                 URL: https://issues.apache.org/jira/browse/HIVE-14925
>             Project: Hive
>          Issue Type: Bug
>          Components: CLI
>    Affects Versions: 2.2.0
>            Reporter: Ratheesh Kamoor
>            Assignee: Pengcheng Xiong
>            Priority: Critical
>             Fix For: 2.2.0
>
>         Attachments: HIVE-14925.patch
>
>
> MSCK REPAIR TABLE hanging while running with multi-threading enabled (default). I think
it is because of a major design flaw in how thread pool implemented in HiveMetaSoreChecker
class / checkPartitionDirs method. This method has a thread pool which register Callable but
callable makes a recursive call to checkPartitionDirs method again. This code will hang when
number of directories is more than thread pool size. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message