hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ning Zhang (JIRA)" <j...@apache.org>
Subject [jira] Updated: (HIVE-2037) Merge result file size should honor hive.merge.size.per.task
Date Wed, 09 Mar 2011 07:13:59 GMT

     [ https://issues.apache.org/jira/browse/HIVE-2037?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Ning Zhang updated HIVE-2037:
-----------------------------

    Status: Patch Available  (was: Open)

> Merge result file size should honor hive.merge.size.per.task
> ------------------------------------------------------------
>
>                 Key: HIVE-2037
>                 URL: https://issues.apache.org/jira/browse/HIVE-2037
>             Project: Hive
>          Issue Type: Bug
>            Reporter: Ning Zhang
>            Assignee: Ning Zhang
>         Attachments: HIVE-2037.patch
>
>
> The merge job set mapred.min.split.size to the value of hive.merge.size.per.task, which
roughly equals to the output file size. However the input split size is also determined by
mapred.min.split.size.per.node, mapred.min.split.size.per.rack, and mapred.max.split.size.
They should be set the same as hive.merge.size.per.task as well.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message