hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Prasanth Jayachandran (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HIVE-16927) LLAP: Slider takes down all daemons when some daemons fail repeatedly
Date Fri, 29 Sep 2017 17:39:00 GMT

    [ https://issues.apache.org/jira/browse/HIVE-16927?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16186128#comment-16186128
] 

Prasanth Jayachandran commented on HIVE-16927:
----------------------------------------------

The defaults would generate the following resources.json file
{code:title=defaults}
{
  "schema" : "http://example.org/specification/v2.0.0",
  "metadata" : {
  },
  "global" : {
    "yarn.log.include.patterns": ".*\\.done"
  },
  "components": {
    "slider-appmaster": {
      "yarn.memory": "1024",
      "yarn.component.instances": "1"
    },
    "LLAP": {
      "yarn.role.priority": "1",
      "yarn.component.instances": "3",
      "yarn.resource.normalization.enabled": "false",
      "yarn.memory": "174080",
      "yarn.component.placement.policy" : "4",
      "yarn.container.health.threshold.percent": "80",
      "yarn.container.health.threshold.window.secs": "300",
      "yarn.container.health.threshold.init.delay.secs": "400"
    }
  }
}
{code}

{code:title=hive --service llap --health-percent 75 --health-init-delay-secs 600 --health-time-window-secs
1200}
{
  "schema" : "http://example.org/specification/v2.0.0",
  "metadata" : {
  },
  "global" : {
    "yarn.log.include.patterns": ".*\\.done"
  },
  "components": {
    "slider-appmaster": {
      "yarn.memory": "1024",
      "yarn.component.instances": "1"
    },
    "LLAP": {
      "yarn.role.priority": "1",
      "yarn.component.instances": "3",
      "yarn.resource.normalization.enabled": "false",
      "yarn.memory": "174080",
      "yarn.component.placement.policy" : "4",
      "yarn.container.health.threshold.percent": "75",
      "yarn.container.health.threshold.window.secs": "1200",
      "yarn.container.health.threshold.init.delay.secs": "600"
    }
  }
}
{code}

> LLAP: Slider takes down all daemons when some daemons fail repeatedly
> ---------------------------------------------------------------------
>
>                 Key: HIVE-16927
>                 URL: https://issues.apache.org/jira/browse/HIVE-16927
>             Project: Hive
>          Issue Type: Bug
>          Components: llap
>    Affects Versions: 3.0.0
>            Reporter: Prasanth Jayachandran
>            Assignee: Prasanth Jayachandran
>         Attachments: HIVE-16927.1.patch
>
>
> When some containers fail repeatedly, slider thinks application is in unstable state
which brings down all llap daemons. 



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Mime
View raw message