Return-Path: Delivered-To: apmail-hadoop-common-user-archive@www.apache.org Received: (qmail 58488 invoked from network); 12 Aug 2009 00:30:27 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 12 Aug 2009 00:30:27 -0000 Received: (qmail 80073 invoked by uid 500); 12 Aug 2009 00:05:59 -0000 Delivered-To: apmail-hadoop-common-user-archive@hadoop.apache.org Received: (qmail 79663 invoked by uid 500); 12 Aug 2009 00:05:58 -0000 Mailing-List: contact common-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: common-user@hadoop.apache.org Delivered-To: mailing list common-user@hadoop.apache.org Received: (qmail 78731 invoked by uid 99); 12 Aug 2009 00:02:21 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 12 Aug 2009 00:02:21 +0000 X-ASF-Spam-Status: No, hits=1.2 required=10.0 tests=SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (athena.apache.org: local policy) Received: from [69.147.107.21] (HELO mrout2-b.corp.re1.yahoo.com) (69.147.107.21) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 12 Aug 2009 00:02:10 +0000 Received: from [172.21.149.106] (wlanvpn-mc2e-247-106.corp.yahoo.com [172.21.149.106]) by mrout2-b.corp.re1.yahoo.com (8.13.8/8.13.8/y.out) with ESMTP id n7BNxONK088198 for ; Tue, 11 Aug 2009 16:59:25 -0700 (PDT) DomainKey-Signature: a=rsa-sha1; s=serpent; d=yahoo-inc.com; c=nofws; q=dns; h=message-id:date:from:user-agent:mime-version:to:subject: references:in-reply-to:content-type:content-transfer-encoding; b=mHJCW9iugGD4+1vebvYDse01tZPAhrU/oiVhLejlZKhwPp0Mh0DLhjtZNpIVyvZL Message-ID: <4A8205DC.3040907@yahoo-inc.com> Date: Tue, 11 Aug 2009 16:59:24 -0700 From: Jakob Homan User-Agent: Thunderbird 2.0.0.12 (Macintosh/20080213) MIME-Version: 1.0 To: common-user@hadoop.apache.org Subject: Re: Creating a job References: <77f4f8890908111459w53ab1a1bq455f1e937409c12a@mail.gmail.com> In-Reply-To: <77f4f8890908111459w53ab1a1bq455f1e937409c12a@mail.gmail.com> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org Hey Mithila- I would point you to the WordCount example (http://hadoop.apache.org/common/docs/current/mapred_tutorial.html) for a basic example of how jobs are created by supplying a JobConf to the JobClient. This will submit your conf to the cluster which will create and run the job. The JobControl class is to manager a series of jobs that are dependent on each other. Is this a situation you're facing? If not, the job submission strategy in the WordCount example should be sufficient. Regarding threading: Writing multi-thread apps is generally not needed, as Hadoop provides parallelization via MapReduce. However, there is a MultithreadedMapper for situations where you may not be maxing out the CPU in a specific Mapper. It sounds like it may be helpful to check out the job submission documentation: http://hadoop.apache.org/common/docs/current/mapred_tutorial.html#Job+Submission+and+Monitoring Let us know if anything is unclear after that. Thanks, Jakob Homan Yahoo! Mithila Nagendra wrote: > Hello All > > How do I create a Job in Hadoop using Class Job? And how do I run it? > Generally JobClient.runJob(conf) is used, but the parameter in not of the > type Job. > > Also How do I use the class JobControl? Can I create Threads in a Hadoop > (similar to multithreading in JAVA), where different Threads call diffrent > hadoop jobs? I guess JobControl is connected to all this in some way. > > Thanks for you help > Mithila Nagendra >