drill-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "ASF GitHub Bot (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (DRILL-4836) ZK Issue during Drillbit startup, possibly due to race condition
Date Tue, 09 Aug 2016 18:17:20 GMT

    [ https://issues.apache.org/jira/browse/DRILL-4836?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15413944#comment-15413944
] 

ASF GitHub Bot commented on DRILL-4836:
---------------------------------------

GitHub user paul-rogers opened a pull request:

    https://github.com/apache/drill/pull/564

    DRILL-4836: ZK Issue during Drillbit startup, possibly due to race condition

    ZK Issue during Drillbit startup, possibly due to race condition.
    A change made in February created a race condition if two Drillbits
    attempt to create the same storage plugin node at the same time.
    Revised the code to eliminate the race condition by relying on an
    exception to detect that the node already exists.

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/paul-rogers/drill DRILL-4836

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/drill/pull/564.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #564
    
----
commit 14d8d0e179419e86d86250eae0b7ae0254162ea7
Author: Paul Rogers <progers@maprtech.com>
Date:   2016-08-09T03:16:59Z

    DRILL-4836
    
    ZK Issue during Drillbit startup, possibly due to race condition.
    A change made in February created a race condition if two Drillbits
    attempt to create the same storage plugin node at the same time.
    Revised the code to eliminate the race condition by relying on an
    exception to detect that the node already exists.

----


> ZK Issue during Drillbit startup, possibly due to race condition
> ----------------------------------------------------------------
>
>                 Key: DRILL-4836
>                 URL: https://issues.apache.org/jira/browse/DRILL-4836
>             Project: Apache Drill
>          Issue Type: Bug
>          Components:  Server
>            Reporter: Abhishek Girish
>            Assignee: Paul Rogers
>             Fix For: 1.8.0
>
>
> During a parallel launch of Drillbits on a 4 node cluster, I hit this issue during startup:
> {code}
> Exception in thread "main" org.apache.drill.exec.exception.DrillbitStartupException:
Failure during initial startup of Drillbit.
>         at org.apache.drill.exec.server.Drillbit.start(Drillbit.java:284)
>         at org.apache.drill.exec.server.Drillbit.start(Drillbit.java:261)
>         at org.apache.drill.exec.server.Drillbit.main(Drillbit.java:257)
> Caused by: org.apache.drill.common.exceptions.DrillRuntimeException: unable to put
>         at org.apache.drill.exec.coord.zk.ZookeeperClient.put(ZookeeperClient.java:196)
>         at org.apache.drill.exec.store.sys.store.ZookeeperPersistentStore.putIfAbsent(ZookeeperPersistentStore.java:94)
>         ...
>         at org.apache.drill.exec.server.Drillbit.run(Drillbit.java:113)
>         at org.apache.drill.exec.server.Drillbit.start(Drillbit.java:281)
>         ... 2 more
> Caused by: org.apache.zookeeper.KeeperException$NodeExistsException: KeeperErrorCode
= NodeExists for /drill/sys.storage_plugins/dfs
>         at org.apache.drill.exec.coord.zk.ZookeeperClient.put(ZookeeperClient.java:191)
>         ... 7 more
> {code}
> And similarly,
> {code}
> Caused by: org.apache.zookeeper.KeeperException$NodeExistsException: KeeperErrorCode
= NodeExists for /drill/sys.storage_plugins/kudu
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message