hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "zhao hongjun (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (HIVE-16478) The result of select sum sql is not correct when some datanodes of hadoop cluster are sometime bad and sometime good every 1-2mins
Date Wed, 19 Apr 2017 09:49:41 GMT

     [ https://issues.apache.org/jira/browse/HIVE-16478?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

zhao hongjun updated HIVE-16478:
--------------------------------
    Description: 
1. submiting a select sum sql to hadoop cluster with hive

2. the mapreduce job are running on the datanodes:  DN_A, DN_B, DN_C

3. When map job is completed, the datanodes DN_A, DN_B, DN_C  are bad health(disk utilization
percent reach the value 'yarn.nodemanager.disk-health-checker.max-disk-utilization-per-disk-percentage'
),  and reported to resource manager. At the same time, application master killed the map
job.

4. the result of select sum sql is not correct.
    

  was:

1. submiting a select sum sql to hadoop cluster with hive
2. the mapreduce job are running on the datanodes:  DN_A, DN_B, DN_C
3. When map job is completed, the datanodes DN_A, DN_B, DN_C  are bad health(disk utilization
percent reach the value 'yarn.nodemanager.disk-health-checker.max-disk-utilization-per-disk-percentage'
),  and reported to resource manager. At the same time, application master killed the map
job.
4. the result of select sum sql is not correct.
    


> The result of select sum sql is not correct when some datanodes of hadoop cluster are
sometime bad and sometime good every 1-2mins
> ----------------------------------------------------------------------------------------------------------------------------------
>
>                 Key: HIVE-16478
>                 URL: https://issues.apache.org/jira/browse/HIVE-16478
>             Project: Hive
>          Issue Type: Bug
>    Affects Versions: 1.1.0
>         Environment: Linux 2.6.32-431.el6.x86_64, CDH Cluster
>            Reporter: zhao hongjun
>
> 1. submiting a select sum sql to hadoop cluster with hive
> 2. the mapreduce job are running on the datanodes:  DN_A, DN_B, DN_C
> 3. When map job is completed, the datanodes DN_A, DN_B, DN_C  are bad health(disk utilization
percent reach the value 'yarn.nodemanager.disk-health-checker.max-disk-utilization-per-disk-percentage'
),  and reported to resource manager. At the same time, application master killed the map
job.
> 4. the result of select sum sql is not correct.
>     



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Mime
View raw message