hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Edward Capriolo (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HIVE-4732) Speed up AvroSerde by checking hashcodes instead of equality
Date Mon, 01 Jul 2013 21:57:20 GMT

    [ https://issues.apache.org/jira/browse/HIVE-4732?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13697238#comment-13697238
] 

Edward Capriolo commented on HIVE-4732:
---------------------------------------



{quote}
This change definitely assumes that we won't be unlucky and have a hashcode collision.
{quote}

^ is a dangerous assumption I would not want to +1.

{quote}
&& wouldn't allow short circuiting if hashCodes were equal, only if they're unequal.
{quote}

Ah good point. I just meant to say 'use short circuiting and a compound if clause' (it could
use ! | || && or & or whatever your need. The point was you can optimize the code
without making it logically incorrect. 
                
> Speed up AvroSerde by checking hashcodes instead of equality
> ------------------------------------------------------------
>
>                 Key: HIVE-4732
>                 URL: https://issues.apache.org/jira/browse/HIVE-4732
>             Project: Hive
>          Issue Type: Improvement
>          Components: Serializers/Deserializers
>            Reporter: Mark Wagner
>            Assignee: Mark Wagner
>         Attachments: HIVE-4732.1.patch
>
>
> The AvroSerde spends a significant amount of time checking schema equality. Changing
to compare hashcodes (which can be computed once then reused) will improve performance.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message