hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "jiaxin zou (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (HIVE-21803) The result of "insert overwrite table" is inconsistent with the original table
Date Wed, 29 May 2019 12:44:00 GMT

     [ https://issues.apache.org/jira/browse/HIVE-21803?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

jiaxin zou updated HIVE-21803:
------------------------------
    Description: 
Hi all

I have a tableA partitioned by day/hour( insert overwrite table A partiton(day,hour) select
...from table B where day =.. and hour =...) but  count ( * ) of table A and B is not equal occasionally
(for example, hour =12). when i rerun the job (  insert overwrite ..hour =12), the count
( * )  is consistent. That means the bug cannot repeat.

I find the map output records is not equal to the reducer input records

  !企业微信截图_15591333565716.png!

 

  was:
Hi all

I have a tableA partitioned by day/hour( insert overwrite table A partiton(day,hour) select
...from table B where day =.. and hour =...) but  count(*) of table A and B is not equal occasionally
(for example, hour =12). when i rerun the job (  insert overwrite ..hour =12), the count(*)
is consistent. That means the bug cannot repeat.

I find the map output records is not equal to the reducer input records

  !企业微信截图_15591333565716.png!

 


> The result of "insert overwrite table" is inconsistent with the original table
> ------------------------------------------------------------------------------
>
>                 Key: HIVE-21803
>                 URL: https://issues.apache.org/jira/browse/HIVE-21803
>             Project: Hive
>          Issue Type: Bug
>    Affects Versions: 2.3.0
>            Reporter: jiaxin zou
>            Priority: Major
>         Attachments: 企业微信截图_15591333565716.png
>
>
> Hi all
> I have a tableA partitioned by day/hour( insert overwrite table A partiton(day,hour)
select ...from table B where day =.. and hour =...) but  count ( * ) of table A and B is
not equal occasionally (for example, hour =12). when i rerun the job (  insert overwrite
..hour =12), the count ( * )  is consistent. That means the bug cannot repeat.
> I find the map output records is not equal to the reducer input records
>   !企业微信截图_15591333565716.png!
>  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Mime
View raw message