Return-Path: Delivered-To: apmail-hadoop-core-user-archive@www.apache.org Received: (qmail 42571 invoked from network); 12 Aug 2008 22:15:47 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 12 Aug 2008 22:15:47 -0000 Received: (qmail 81446 invoked by uid 500); 12 Aug 2008 22:15:41 -0000 Delivered-To: apmail-hadoop-core-user-archive@hadoop.apache.org Received: (qmail 81407 invoked by uid 500); 12 Aug 2008 22:15:41 -0000 Mailing-List: contact core-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: core-user@hadoop.apache.org Delivered-To: mailing list core-user@hadoop.apache.org Received: (qmail 81396 invoked by uid 99); 12 Aug 2008 22:15:41 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 12 Aug 2008 15:15:41 -0700 X-ASF-Spam-Status: No, hits=2.0 required=10.0 tests=HTML_MESSAGE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of arv.andrew@gmail.com designates 66.249.90.177 as permitted sender) Received: from [66.249.90.177] (HELO ik-out-1112.google.com) (66.249.90.177) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 12 Aug 2008 22:14:45 +0000 Received: by ik-out-1112.google.com with SMTP id c28so3067786ika.5 for ; Tue, 12 Aug 2008 15:15:12 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:received:message-id:date:from:sender :to:subject:in-reply-to:mime-version:content-type:references :x-google-sender-auth; bh=fD/IiE4yaC0eOX+f0yzwr3pV3W2CVYnrTVJoAsLA8Sc=; b=uWiPWvtDDlygylce4TOgpQEpDEMsb78K8YBFytrcOqyWHPBG0D0bdqgnM84tde4iFb OlkOZt1OrSMn+O1YFoyiP6XXm6fQFJKRgWgkpt5iDlDG9500iQoJ2emhdMzeSwfFhjdY TTR2ZztituuEB7mTwnpiclei+9sR/T4Beh46o= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=message-id:date:from:sender:to:subject:in-reply-to:mime-version :content-type:references:x-google-sender-auth; b=Gt4SzIQGWW/B8Bpg8CFXNFZehdMnXw6dRMBwjpa3h30L+81uEF64T5OC8oFdsJ5Wq5 XbXGo2N7nQ5F0z0kI8i8UwY8dF/SS2TBu6v9b7LcthJywRtc8SjQDh76J6j+W8g7999L XWmQWAWfS48sZ7S7grhGDH0zKoltR4BvNrNJw= Received: by 10.210.25.20 with SMTP id 20mr11000593eby.46.1218579312051; Tue, 12 Aug 2008 15:15:12 -0700 (PDT) Received: by 10.210.59.18 with HTTP; Tue, 12 Aug 2008 15:15:12 -0700 (PDT) Message-ID: Date: Tue, 12 Aug 2008 15:15:12 -0700 From: "Ashish Venugopal" Sender: arv.andrew@gmail.com To: core-user@hadoop.apache.org Subject: Re: Difference between Hadoop Streaming and "Normal" mode In-Reply-To: MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="----=_Part_58023_13368204.1218579312040" References: X-Google-Sender-Auth: d3f3f01dcb266303 X-Virus-Checked: Checked by ClamAV on apache.org ------=_Part_58023_13368204.1218579312040 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Content-Disposition: inline There is definitely functionality in "normal" mode that is not available in streaming, like the ability to write counters to instruments jobs. I personally just use streaming, so I am interested to see if there are further key differences... Ashish On Tue, Aug 12, 2008 at 3:09 PM, Gaurav Veda > wrote: > Hi All, > > This might seem too silly, but I couldn't find a satisfactory answer > to this yet. What are the advantages / disadvantages of using Hadoop > Streaming over the normal mode (wherein you write your own mapper and > reducer in Java)? From what I gather, the real advantage of Hadoop > Streaming is that you can use any executable (in c / perl / python > etc) as a mapper / reducer. > A slight disadvantage is that the default is to read (write) from the > standard input (output) ... though one can specify their own Input and > Output format (and package it with the default hadoop streaming jar > file). > > My point is, why should I ever use the normal mode? Streaming seems > just as good. Is there a performance problem or do I have only limited > control over my job if I use the streaming mode or some other issue? > > Thanks! > Gaurav > -- > Share what you know, learn what you don't ! > ------=_Part_58023_13368204.1218579312040--