hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Lefty Leverenz (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HIVE-4248) Implement a memory manager for ORC
Date Sat, 09 Aug 2014 19:33:12 GMT

    [ https://issues.apache.org/jira/browse/HIVE-4248?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14091872#comment-14091872
] 

Lefty Leverenz commented on HIVE-4248:
--------------------------------------

This added configuration parameter *hive.exec.orc.memory.pool* to HiveConf.java in 0.11.0.
 It's documented in the wiki here:

* [Configuration Properties -- hive.exec.orc.memory.pool | https://cwiki.apache.org/confluence/display/Hive/Configuration+Properties#ConfigurationProperties-hive.exec.orc.memory.pool]


> Implement a memory manager for ORC
> ----------------------------------
>
>                 Key: HIVE-4248
>                 URL: https://issues.apache.org/jira/browse/HIVE-4248
>             Project: Hive
>          Issue Type: New Feature
>          Components: Serializers/Deserializers
>            Reporter: Owen O'Malley
>            Assignee: Owen O'Malley
>             Fix For: 0.11.0
>
>         Attachments: HIVE-4248.D9993.1.patch, HIVE-4248.D9993.2.patch, HIVE-4248.D9993.4.patch
>
>
> With the large default stripe size (256MB) and dynamic partitions, it is quite easy for
users to run out of memory when writing ORC files. We probably need a solution that keeps
track of the total number of concurrent ORC writers and divides the available heap space between
them. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Mime
View raw message