hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Edward Capriolo (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HIVE-3227) Implement data loading from user provided string directly for test
Date Fri, 06 Jul 2012 15:49:34 GMT

    [ https://issues.apache.org/jira/browse/HIVE-3227?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13408086#comment-13408086
] 

Edward Capriolo commented on HIVE-3227:
---------------------------------------

@Navis  This is a good idea but we have to be very careful about features we add to the language.
Also we have to cover the cases of overwritten files etc.

Your idea though prompted me to write:
https://issues.apache.org/jira/browse/HIVE-3238

I think user-space is a better answer for this problem. We can still consider adding this
issue but I think 3238 is a little safer.

You should hang out on hive IRC so we can discuss more. 3238 is a bit more verbose and will
not speed up until testing like you mentioned, but I like the approach better.
                
> Implement data loading from user provided string directly for test
> ------------------------------------------------------------------
>
>                 Key: HIVE-3227
>                 URL: https://issues.apache.org/jira/browse/HIVE-3227
>             Project: Hive
>          Issue Type: Improvement
>          Components: Query Processor, Testing Infrastructure
>    Affects Versions: 0.10.0
>            Reporter: Navis
>            Assignee: Navis
>            Priority: Trivial
>
> {code}
> load data instream 'key value\nkey2 value2' into table test;
> {code}
> This will make test easier and also can reduce test time. For example,
> {code}
> -- ppr_pushdown.q
> create table ppr_test (key string) partitioned by (ds string);
> alter table ppr_test add partition (ds = '1234');
> insert overwrite table ppr_test partition(ds = '1234') select * from (select '1234' from
src limit 1 union all select 'abcd' from src limit 1) s;
> {code}
> last query is 4MR job. But can be replaced by
> {code}
> create table ppr_test (key string) partitioned by (ds string) ROW FORMAT delimited fields
terminated by ' ';
> alter table ppr_test add partition (ds = '1234');
> load data local instream '1234\nabcd' overwrite into table ppr_test partition(ds = '1234');
> {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Mime
View raw message