hadoop-hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Raghotham Murthy (JIRA)" <j...@apache.org>
Subject [jira] Commented: (HIVE-518) test mode in hive
Date Wed, 27 May 2009 04:02:45 GMT

    [ https://issues.apache.org/jira/browse/HIVE-518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12713396#action_12713396

Raghotham Murthy commented on HIVE-518:

what happens if the production query has one sampled table joined against an unsampled table?
A common example is facts table sampled by user, joined with a dimension table on a dimension
attribute like gender/country etc. by adding an arbitrary sample clause on the dimension table,
the join result may be empty.

> test mode in hive
> -----------------
>                 Key: HIVE-518
>                 URL: https://issues.apache.org/jira/browse/HIVE-518
>             Project: Hadoop Hive
>          Issue Type: Bug
>          Components: Query Processor
>    Affects Versions: 0.3.1
>            Reporter: Namit Jain
>            Assignee: Namit Jain
>             Fix For: 0.4.0
>         Attachments: hive.518.1.patch, hive.518.2.patch
> It would be good to have a test mode in hive - this will help in checking the validity
of a hive drop on a production cluster.
> The following would be good to have:
> Testmode --> In testmode, all input tables are sampled (if not already sampled) and
all output tables are prefixed by a user supplied name.
> This way, multiple hive drops can be compared quickly for correctness

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message