hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Owen O'Malley (JIRA)" <>
Subject [jira] [Commented] (HIVE-3153) Release codecs and output streams between flushes of RCFile
Date Thu, 26 Jul 2012 21:01:35 GMT


Owen O'Malley commented on HIVE-3153:

I also wrote a test program that just writes to a large number of RCFile.Writers. With the
patch, I was able to use a lot more Writers before I ran out of memory in the process.
> Release codecs and output streams between flushes of RCFile
> -----------------------------------------------------------
>                 Key: HIVE-3153
>                 URL:
>             Project: Hive
>          Issue Type: Improvement
>          Components: Compression
>            Reporter: Owen O'Malley
>            Assignee: Owen O'Malley
>         Attachments: hive-3153.patch
> Currently, RCFile writer holds a compression codec per a file and a compression output
stream per a column. Especially for queries that use dynamic partitions this quickly consumes
a lot of memory.
> I'd like flushRecords to get a codec from the pool and create the compression output
stream in flushRecords.

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:!default.jspa
For more information on JIRA, see:


View raw message