hadoop-hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "David Phillips (JIRA)" <j...@apache.org>
Subject [jira] Commented: (HIVE-29) Add functionality similar to SCOPE's "virtual" views to Hive
Date Mon, 01 Dec 2008 18:01:44 GMT

    [ https://issues.apache.org/jira/browse/HIVE-29?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12652078#action_12652078

David Phillips commented on HIVE-29:

The example script from section 3.5:

E = EXTRACT query
    FROM @@logfile@@
    USING LogExtractor ;

R = SELECT query, COUNT() AS count
    FROM E
    GROUP BY query
    HAVING count > @@mincount@@;

Using the above script:

Q1 = IMPORT "MyView.script"
     PARAMS logfile="Queries_Jan.log",
            limit=1000 ;

Q2 = IMPORT "MyView.script"
     PARAMS logfile="Queries_Feb.log",
            limit=1000 ;

JQ = SELECT Q1.query, Q2.count-Q1.count AS diff,
            Q1.count AS jan_cnt,
            Q2.count AS feb_count,
          ON Q1.query == Q2.query
     ORDER BY diff DESC;

> Add functionality similar to SCOPE's "virtual" views to Hive
> ------------------------------------------------------------
>                 Key: HIVE-29
>                 URL: https://issues.apache.org/jira/browse/HIVE-29
>             Project: Hadoop Hive
>          Issue Type: Bug
>          Components: Query Processor
>            Reporter: Jeff Hammerbacher
> SCOPE has many nice features, and the ability to IMPORT/EXPORT parameterized scripts
and store partial queries in named variables is one of them. Section 3.5 of the SCOPE paper
has the details, and there are several examples throughout the paper. Perhaps we can choose
an alternative delimiter for PARAMTER imports, however (SCOPE uses "@@...@@").

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message