falcon-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Balu Vellanki (JIRA)" <j...@apache.org>
Subject [jira] [Comment Edited] (FALCON-20) Remove dependency on custom InMobi DistCp
Date Wed, 08 Oct 2014 23:20:36 GMT

    [ https://issues.apache.org/jira/browse/FALCON-20?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14164128#comment-14164128
] 

Balu Vellanki edited comment on FALCON-20 at 10/8/14 11:20 PM:
---------------------------------------------------------------

The latest patch FALCON-20-C.patch tested end2end. Patch applies cleanly, all tests pass.
 +1 for this patch.

I tested for the following.

hadoop-distcp.jar exists under <falcon-home>/hadooplibs/ dir and is not packaged as
part of falcon.war. This is expected behavior. InMobi DistCp jar is not packaged. 
I submitted a feed with replication enabled on a single node falcon setup. The replication
job (which uses distcp jar) succeeded.  

Source 
[hdfs@falcon-balu-6-3 ~]$ hadoop fs -ls -R /user/hrt_qa/falcon/demo/primary/processed/enron/2014-10-03-00
drwxr-xr-x   - hrt_qa hrt_qa          0 2014-10-07 14:43 /user/hrt_qa/falcon/demo/primary/processed/enron/2014-10-03-00
-rw-r--r--   3 hrt_qa hrt_qa          0 2014-10-07 14:43 /user/hrt_qa/falcon/demo/primary/processed/enron/2014-10-03-00/_SUCCESS
-rw-r--r--   3 hrt_qa hrt_qa    9485831 2014-10-07 14:43 /user/hrt_qa/falcon/demo/primary/processed/enron/2014-10-03-00/part-m-00000


Target. 
[hdfs@falcon-balu-6-3 ~]$ hadoop fs -ls -R /falcon/demo/bcp/processed/enron/2014-10-03-00
drwxrwxrwx   - hrt_qa hdfs          0 2014-10-08 20:37 /falcon/demo/bcp/processed/enron/2014-10-03-00
-rw-r--r--   3 hrt_qa hdfs          0 2014-10-08 20:37 /falcon/demo/bcp/processed/enron/2014-10-03-00/_SUCCESS
-rwxrwxrwx   3 hrt_qa hdfs    9485831 2014-10-08 20:37 /falcon/demo/bcp/processed/enron/2014-10-03-00/part-m-00000




was (Author: bvellanki):
The latest patch FALCON-20-C.patch tested end2end. Patch applies cleanly, all tests pass.
 +1 for this patch.

I tested for the following.

hadoop-distcp.jar exists under <falcon-home>/hadooplibs/ dir and is not packaged as
part of falcon.war. This is expected behavior. InMobi DistCp jar is not packaged. 
I submitted a feed with replication enabled on a single node falcon setup. The replication
job (which uses distcp jar) succeeded. Attaching the feed xml for reference. 

Source 
[hdfs@falcon-balu-6-3 ~]$ hadoop fs -ls -R /user/hrt_qa/falcon/demo/primary/processed/enron/2014-10-03-00
drwxr-xr-x   - hrt_qa hrt_qa          0 2014-10-07 14:43 /user/hrt_qa/falcon/demo/primary/processed/enron/2014-10-03-00
-rw-r--r--   3 hrt_qa hrt_qa          0 2014-10-07 14:43 /user/hrt_qa/falcon/demo/primary/processed/enron/2014-10-03-00/_SUCCESS
-rw-r--r--   3 hrt_qa hrt_qa    9485831 2014-10-07 14:43 /user/hrt_qa/falcon/demo/primary/processed/enron/2014-10-03-00/part-m-00000


Target. 
[hdfs@falcon-balu-6-3 ~]$ hadoop fs -ls -R /falcon/demo/bcp/processed/enron/2014-10-03-00
drwxrwxrwx   - hrt_qa hdfs          0 2014-10-08 20:37 /falcon/demo/bcp/processed/enron/2014-10-03-00
-rw-r--r--   3 hrt_qa hdfs          0 2014-10-08 20:37 /falcon/demo/bcp/processed/enron/2014-10-03-00/_SUCCESS
-rwxrwxrwx   3 hrt_qa hdfs    9485831 2014-10-08 20:37 /falcon/demo/bcp/processed/enron/2014-10-03-00/part-m-00000



> Remove dependency on custom InMobi DistCp
> -----------------------------------------
>
>                 Key: FALCON-20
>                 URL: https://issues.apache.org/jira/browse/FALCON-20
>             Project: Falcon
>          Issue Type: Sub-task
>          Components: replication
>    Affects Versions: 0.3
>            Reporter: Venkatesh Seetharam
>            Assignee: Sowmya Ramesh
>             Fix For: 0.6
>
>         Attachments: FALCON-20-C.patch, FALCON-20.patch, FALCON-20.r1.patch, FALCON-20.v2.patch,
cluster01.xml, cluster02.xml, feed.xml
>
>
> Falcon depends on custom DistCp 0.3 since DistCp v2 was part of Hadoop 2.x line. 
> https://issues.apache.org/jira/browse/MAPREDUCE-5081 has back ported DistCp v2 to Hadoop
1.x line and Falcon should instead depend on that.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message