hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Mohit Sabharwal (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (HIVE-13057) Remove duplicate copies of TableDesc property values in PartitionDesc
Date Sat, 13 Feb 2016 01:52:18 GMT

     [ https://issues.apache.org/jira/browse/HIVE-13057?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Mohit Sabharwal updated HIVE-13057:
-----------------------------------
    Attachment: HIVE-13057.patch

> Remove duplicate copies of TableDesc property values in PartitionDesc
> ---------------------------------------------------------------------
>
>                 Key: HIVE-13057
>                 URL: https://issues.apache.org/jira/browse/HIVE-13057
>             Project: Hive
>          Issue Type: Bug
>            Reporter: Mohit Sabharwal
>            Assignee: Mohit Sabharwal
>         Attachments: HIVE-13057.patch
>
>
> For a partitioned table, each PartitionDesc has a copy of corresponding TableDesc.
> While TableDesc is mutable and hence cannot be interned, it's property values can be.
> For a simple select on a table with 100K partitions, this cut total number of String
instances by ~65%.
> Most replicated strings were location, serde, input/output format, column, types, table
name, etc.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message