hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Mark Wagner (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HIVE-4732) Speed up AvroSerde by checking hashcodes instead of equality
Date Mon, 01 Jul 2013 18:40:22 GMT

    [ https://issues.apache.org/jira/browse/HIVE-4732?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13697052#comment-13697052
] 

Mark Wagner commented on HIVE-4732:
-----------------------------------

I double checked and I am using the latest release (1.7.4) for my benchmarks. Although the
equals() method uses hashcode to quickly discard unequal schemas, it still does a full recursive
walk of the schema to make sure all the nodes are equal, so I think this patch is still a
necessary change.


                
> Speed up AvroSerde by checking hashcodes instead of equality
> ------------------------------------------------------------
>
>                 Key: HIVE-4732
>                 URL: https://issues.apache.org/jira/browse/HIVE-4732
>             Project: Hive
>          Issue Type: Improvement
>          Components: Serializers/Deserializers
>            Reporter: Mark Wagner
>            Assignee: Mark Wagner
>         Attachments: HIVE-4732.1.patch
>
>
> The AvroSerde spends a significant amount of time checking schema equality. Changing
to compare hashcodes (which can be computed once then reused) will improve performance.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message