pig-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Alex Bain" <ambclo...@gmail.com>
Subject Re: Review Request 14897: PIG-3538 Implement LIMIT in Tez
Date Thu, 24 Oct 2013 01:42:05 GMT

-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/14897/
-----------------------------------------------------------

(Updated Oct. 23, 2013, 6:42 p.m.)


Review request for pig, Cheolsoo Park, Daniel Dai, Mark Wagner, and Rohini Palaniswamy.


Summary (updated)
-----------------

PIG-3538 Implement LIMIT in Tez


Bugs: PIG-3538
    https://issues.apache.org/jira/browse/PIG-3538


Repository: pig-git


Description
-------

Implement LIMIT in Tez by providing an implementation of visitLimit in TezCompiler.java.


Diffs
-----

  src/org/apache/pig/backend/hadoop/executionengine/tez/TezCompiler.java 0c20214 

Diff: https://reviews.apache.org/r/14897/diff/


Testing (updated)
-------

[abain@abain-ld pig]$ cat data/1.dat
1,orange
2,apple
3,strawberry

[abain@abain-ld pig]$ cat test3.pig
a = load './1.dat' using PigStorage(',') as (id:int, fruit:chararray);
b = LIMIT a 2;
STORE b INTO 'foo';

I ran with with "pig -x tez -f test3.pig" and got the following (correct results):

[abain@abain-ld pig]$ hadoop fs -ls /user/abain/foo
Found 2 items
-rw-r--r--   1 abain supergroup          0 2013-10-23 18:38 /user/abain/foo/_SUCCESS
-rw-r--r--   1 abain supergroup         17 2013-10-23 18:38 /user/abain/foo/part-r-00000

[abain@abain-ld pig]$ hadoop fs -cat /user/abain/foo/part-r-00000
1	orange
2	apple


Thanks,

Alex Bain


Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message