incubator-s4-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Daniel Gómez Ferro (JIRA) <j...@apache.org>
Subject [jira] [Commented] (S4-25) Write S4 Application Master to deploy S4 in Yarn
Date Wed, 21 Nov 2012 10:55:59 GMT

    [ https://issues.apache.org/jira/browse/S4-25?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13501852#comment-13501852
] 

Daniel Gómez Ferro commented on S4-25:
--------------------------------------

The patch looks good and it works perfectly. Great feature!

I just have a couple of minor comments:

There is a Thread.sleep(10000) in S4ApplicationMaster.main() which I don't think it's needed.

For the memory, I'd let the user set it. It he doesn't, I'd just scale the memory reserved
for the container by a fixed factor, maybe 0.8. So, for the -Xmx parameter I'd use a different
parameter (-jvmMemory, -nodeMemory?) and if it's not set use containerMemory * 0.8.

I think that YARN stopping only the application master is a YARN bug. I'd open a ticket, if
it turns out there's a better way to stop it they'll tell us.

                
> Write S4 Application Master to deploy S4 in Yarn
> ------------------------------------------------
>
>                 Key: S4-25
>                 URL: https://issues.apache.org/jira/browse/S4-25
>             Project: Apache S4
>          Issue Type: New Feature
>            Reporter: J Mohamed Zahoor
>             Fix For: 0.6
>
>         Attachments: S4-ApplicationMaster.diff, S4-Client.diff, S4-Constants.diff, S4-YARN-1.patch
>
>
> On the lines of s4PigWrapper, write a s4 application master to host s4 piper inside Hadoop
Yarn. This could be useful not only for reading data stored in hadoop ( to build or train
a model)... But we could make use of the resource manager to deploy s4 instances in remote
machine and monitor them. In short, we could make use of most of the resource management ,
scheduling and other good stuff in Yarn.
> - Yarn is useful to deploy and launch s4 instances.
> - It still requires deploying node managers on each box which means it will
> be useful if one is running more than one s4 process on a node.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message