hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "shanyu zhao (JIRA)" <j...@apache.org>
Subject [jira] [Created] (HIVE-7155) WebHCat controller job exceeds container memory limit
Date Sat, 31 May 2014 01:29:03 GMT
shanyu zhao created HIVE-7155:
---------------------------------

             Summary: WebHCat controller job exceeds container memory limit
                 Key: HIVE-7155
                 URL: https://issues.apache.org/jira/browse/HIVE-7155
             Project: Hive
          Issue Type: Bug
          Components: WebHCat
    Affects Versions: 0.13.0
            Reporter: shanyu zhao
            Assignee: shanyu zhao


Submit a Hive query on a large table via WebHCat results in failure because the WebHCat controller
job is killed by Yarn since it exceeds the memory limit (set by mapreduce.map.memory.mb, defaults
to 1GB):
{code}
 INSERT OVERWRITE TABLE Temp_InjusticeEvents_2014_03_01_00_00 SELECT * from Stage_InjusticeEvents
where LogTimestamp > '2014-03-01 00:00:00' and LogTimestamp <= '2014-03-01 01:00:00';
{code}

We could increase mapreduce.map.memory.mb to solve this problem, but this way we are changing
this setting system wise.

We need to provide a WebHCat configuration to overwrite mapreduce.map.memory.mb when submitting
the controller job.





--
This message was sent by Atlassian JIRA
(v6.2#6252)

Mime
View raw message