Return-Path: X-Original-To: apmail-hadoop-user-archive@minotaur.apache.org Delivered-To: apmail-hadoop-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 65459CCC5 for ; Tue, 16 Jul 2013 18:15:07 +0000 (UTC) Received: (qmail 1111 invoked by uid 500); 16 Jul 2013 18:15:02 -0000 Delivered-To: apmail-hadoop-user-archive@hadoop.apache.org Received: (qmail 1016 invoked by uid 500); 16 Jul 2013 18:15:02 -0000 Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hadoop.apache.org Delivered-To: mailing list user@hadoop.apache.org Delivered-To: moderator for user@hadoop.apache.org Received: (qmail 45515 invoked by uid 99); 16 Jul 2013 09:10:20 -0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of austincv@gmail.com designates 74.125.82.44 as permitted sender) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:date:message-id:subject:from:to:content-type; bh=gpuYJgal5N3/5vvDau7JTPHXoL6GnFMXC9MMaS0iObQ=; b=E2QZlIH+DpJXpZssEI4sZCkAZaKr7Gt+tdxgt/fHCez4aKA/R5pP+VfT3I7rI508qc aw7nQVyhwPR81w2jnLleyeE4bM36+f+VheaMgEQEZNVkZgr1T9P6sIIb11Q9/ARLdrzx ovtZbbJsol6/JT3UUl0b9FoQfCrjAWCl8ZxDkWUbd5O2/brjtngSSRYyQksHxBE8PcVi RhLGdAa6A21DEmEUWqgHiIoZ4kbKgokdHIEXc1lut0REG/PRDeHHLqvvLLJx533uu6C0 DWf6zNldbEDsVduaT6iChQ2kaXA5j4BzB9R0wVTlTXc4BOrOWBOaWOcjzy5Jjx2FoYfK A8tg== MIME-Version: 1.0 X-Received: by 10.180.107.71 with SMTP id ha7mr394930wib.28.1373965793829; Tue, 16 Jul 2013 02:09:53 -0700 (PDT) Date: Tue, 16 Jul 2013 14:39:53 +0530 Message-ID: Subject: spawn maps without any input data - hadoop streaming From: Austin Chungath To: user@hadoop.apache.org Content-Type: multipart/alternative; boundary=e89a8f2353231334c304e19d575b X-Virus-Checked: Checked by ClamAV on apache.org --e89a8f2353231334c304e19d575b Content-Type: text/plain; charset=ISO-8859-1 Hi, I am trying to generate random data using hadoop streaming & python. It's a map only job and I need to run a number of maps. There is no input to the map as it's just going to generate random data. How do I specify the number of maps to run? ( I am confused here because, if I am not wrong, the number of maps spawned is related to the input data size ) Any ideas as to how this can be done? Warm regards, Austin --e89a8f2353231334c304e19d575b Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable
Hi,

I am trying to generate rando= m data using hadoop streaming & python. It's a map only job and I n= eed to run a number of maps. There is no input to the map as it's just = going to generate random data.

How do I specify the number of maps to run? ( I am confused = here because, if I am not wrong, the number of maps spawned is related to t= he input data size )
Any ideas as to how this can be done?

Warm regards,
Austin
--e89a8f2353231334c304e19d575b--