hadoop-hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ning Zhang (JIRA)" <j...@apache.org>
Subject [jira] Commented: (HIVE-1638) convert commonly used udfs to generic udfs
Date Wed, 29 Sep 2010 19:43:34 GMT

    [ https://issues.apache.org/jira/browse/HIVE-1638?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12916256#action_12916256
] 

Ning Zhang commented on HIVE-1638:
----------------------------------

Siying, great work!

Also can you do an optimization for the case when the parameters are constants (e.g., the
2nd parameter of f_c='5015'). The objectInspector doesn't have the information of whether
the input parameter is constant or not, but I think if you check in evaluate() whether the
parameter is the same *object* between the 1st and 2nd row, you can conclude the parameter
is a constant. This can save a lot in object constructions. 

> convert commonly used udfs to generic udfs
> ------------------------------------------
>
>                 Key: HIVE-1638
>                 URL: https://issues.apache.org/jira/browse/HIVE-1638
>             Project: Hadoop Hive
>          Issue Type: Improvement
>          Components: Query Processor
>            Reporter: Namit Jain
>            Assignee: Siying Dong
>         Attachments: HIVE-1638.1.patch
>
>
> Copying a mail from Joy:
> i did a little bit of profiling of a simple hive group by query today. i was surprised
to see that one of the most expensive functions were in converting the equals udf (i had some
simple string filters) to generic udfs. (primitiveobjectinspectorconverter.textconverter)
> am i correct in thinking that the fix is to simply port some of the most popular udfs
(string equality/comparison etc.) to generic udsf?

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message