Return-Path: X-Original-To: apmail-hadoop-mapreduce-user-archive@minotaur.apache.org Delivered-To: apmail-hadoop-mapreduce-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id B0C8C912D for ; Sat, 16 Jun 2012 09:23:56 +0000 (UTC) Received: (qmail 70857 invoked by uid 500); 16 Jun 2012 09:23:55 -0000 Delivered-To: apmail-hadoop-mapreduce-user-archive@hadoop.apache.org Received: (qmail 70631 invoked by uid 500); 16 Jun 2012 09:23:52 -0000 Mailing-List: contact mapreduce-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: mapreduce-user@hadoop.apache.org Delivered-To: mailing list mapreduce-user@hadoop.apache.org Received: (qmail 70617 invoked by uid 99); 16 Jun 2012 09:23:51 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 16 Jun 2012 09:23:51 +0000 X-ASF-Spam-Status: No, hits=1.8 required=5.0 tests=FREEMAIL_ENVFROM_END_DIGIT,FSL_RCVD_USER,HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of psdc1978@gmail.com designates 74.125.82.48 as permitted sender) Received: from [74.125.82.48] (HELO mail-wg0-f48.google.com) (74.125.82.48) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 16 Jun 2012 09:23:43 +0000 Received: by wgbdq11 with SMTP id dq11so3448870wgb.29 for ; Sat, 16 Jun 2012 02:23:23 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=subject:references:from:content-type:x-mailer:in-reply-to :message-id:date:to:content-transfer-encoding:mime-version; bh=7AzTPwHDsLU7ASzu7m3sQPPgfFGR3cWNx3MyzK0gnMk=; b=Y5Gm1CAa0Iv/HQZOZH5m75MYf+zWVJlneNWPvsxCUIspDt5rXz7sPcA0N7/S4ovs5B qh4NO03AH6OdCLhBBLdyY4YmmCHP99tZs+iT9J3MSJWDrPIVP427DVjhdgx2Jkl4xyYL LZnvWiNPxf75j15qBVbr5VO1n5jRO69P3CUnm40+2jdvkfinsgO94UbTTH39dz8nNrap x9FHm1BISy8gfekU90tuM7e20l039VaMTNqJRzWHTPhn2jvb5MMTnwPt6i+s/GMHBb1K Ff7E7+2wDfQsJjH8Qral5NlL4b/HlR/keqLn6i1YkS3oiGar8XchRRFCJKTfboukLkPK V7Tg== Received: by 10.180.78.161 with SMTP id c1mr10366654wix.1.1339838602104; Sat, 16 Jun 2012 02:23:22 -0700 (PDT) Received: from [192.168.10.5] (bl12-78-74.dsl.telepac.pt. [85.245.78.74]) by mx.google.com with ESMTPS id k8sm16022152wia.6.2012.06.16.02.23.20 (version=TLSv1/SSLv3 cipher=OTHER); Sat, 16 Jun 2012 02:23:21 -0700 (PDT) Subject: Re: Streaming in mapreduce References: From: Pedro Costa Content-Type: multipart/alternative; boundary=Apple-Mail-7D576DAF-50DE-4219-9D60-961404491DA1 X-Mailer: iPad Mail (9B206) In-Reply-To: Message-Id: <403191DC-12AE-4920-9AEC-18FF24685340@gmail.com> Date: Sat, 16 Jun 2012 10:23:20 +0100 To: "mapreduce-user@hadoop.apache.org" Content-Transfer-Encoding: 7bit Mime-Version: 1.0 (1.0) --Apple-Mail-7D576DAF-50DE-4219-9D60-961404491DA1 Content-Transfer-Encoding: quoted-printable Content-Type: text/plain; charset=us-ascii I still don't get why hadoop streaming is useful. If I have man and reduce f= unctions defined in shell script, like the one below, why should I use Hadoo= p? cat someInputFile | shellMapper.sh | shellReducer.sh > someOutputFile On 16/06/2012, at 01:21, Ruslan Al-Fakikh wrote: > Hi Pedro, >=20 > You can find it here > http://wiki.apache.org/hadoop/HadoopStreaming >=20 > Thanks >=20 > On Sat, Jun 16, 2012 at 2:46 AM, Pedro Costa wrote: >> Hi, >>=20 >> Hadoop mapreduce can be used for streaming. But what is streaming from th= e point of view of mapreduce? For me, streaming are video and audio data. >>=20 >> Why mapreduce supports streaming? >>=20 >> Can anyone give me an example on why to use streaming in mapreduce? >>=20 >> Thanks, >> Pedro --Apple-Mail-7D576DAF-50DE-4219-9D60-961404491DA1 Content-Transfer-Encoding: quoted-printable Content-Type: text/html; charset=utf-8
I still don't get why hado= op streaming is useful. If I have man and reduce functions defined in shell s= cript, like the one below, why should I use Hadoop?
cat someInputFile | shellMapper.sh | shellReducer.sh > someOutputFile=


On 16/06/2012, at 01:21, R= uslan Al-Fakikh <metaruslan@gmail= .com> wrote:

<= span>Hi Pedro,

You can find it here<= br>http://wi= ki.apache.org/hadoop/HadoopStreaming

Th= anks

On Sat, Jun 16, 2012 at 2:46 AM, Pedro= Costa <psdc1978@gmail.com> w= rote:
Hi,
<= blockquote type=3D"cite">
Hadoop mapreduce can be used for streaming. But what is streaming= from the point of view of mapreduce? For me, streaming are video and audio d= ata.

 Why mapreduce supports streami= ng?

Can anyone give me an example on why t= o use streaming in mapreduce?

Thanks,
Pedro
= --Apple-Mail-7D576DAF-50DE-4219-9D60-961404491DA1--