hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Kevin Wilfong (JIRA)" <>
Subject [jira] [Updated] (HIVE-3706) getBoolVar in FileSinkOperator can be optimized
Date Tue, 13 Nov 2012 19:12:14 GMT


Kevin Wilfong updated HIVE-3706:

    Attachment: HIVE-3706.1.patch.txt
> getBoolVar in FileSinkOperator can be optimized
> -----------------------------------------------
>                 Key: HIVE-3706
>                 URL:
>             Project: Hive
>          Issue Type: Improvement
>          Components: Query Processor
>    Affects Versions: 0.10.0
>            Reporter: Kevin Wilfong
>            Assignee: Kevin Wilfong
>         Attachments: HIVE-3706.1.patch.txt
> There's a call to HiveConf.getBoolVar in FileSinkOperator's processOp method.  In benchmarks
we found this call to be using ~2% of the CPU time on simple queries, e.g. INSERT OVERWRITE
> This boolean value, a flag to collect the RawDataSize stat, won't change during the processing
of a query, so we can determine it at initialization and store that value, saving that CPU.

This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see:

View raw message