Return-Path: Delivered-To: apmail-hadoop-common-user-archive@www.apache.org Received: (qmail 38741 invoked from network); 4 Jan 2011 08:29:32 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 4 Jan 2011 08:29:32 -0000 Received: (qmail 57906 invoked by uid 500); 4 Jan 2011 08:29:30 -0000 Delivered-To: apmail-hadoop-common-user-archive@hadoop.apache.org Received: (qmail 57491 invoked by uid 500); 4 Jan 2011 08:29:29 -0000 Mailing-List: contact common-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: common-user@hadoop.apache.org Delivered-To: mailing list common-user@hadoop.apache.org Received: (qmail 57482 invoked by uid 99); 4 Jan 2011 08:29:29 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 04 Jan 2011 08:29:29 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=10.0 tests=FREEMAIL_FROM,RCVD_IN_DNSWL_LOW,RFC_ABUSE_POST,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of qwertymaniac@gmail.com designates 209.85.161.48 as permitted sender) Received: from [209.85.161.48] (HELO mail-fx0-f48.google.com) (209.85.161.48) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 04 Jan 2011 08:29:24 +0000 Received: by fxm2 with SMTP id 2so13376408fxm.35 for ; Tue, 04 Jan 2011 00:29:03 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:mime-version:received:in-reply-to :references:from:date:message-id:subject:to:content-type; bh=10MicfBjhDdcPkmK9ozJQPsg+VVgGf0hajoEAwaY0MM=; b=LPAYjgT9gKJtVc9QoibP3rhpNIXKMDb2X3sxzFUVjiMrnQS+cvCgCIKXKus9Kj1TgA yWrO9Qy0f2rys8KeP3hK7BE20HudRhHpP5VP0Lj+BP7qDU6rkLw3uBPTSaiP1E0UjF3M XoIdqCdLzGX0r63cWjEdDWNrarE+AkWAW9UIs= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type; b=q8rukBgSGkDgbHNuPU5oTbIBRu8oc4MXnLr1F2yEZHdlCj4dZfg6qIKKeojOGmpwBQ oPPsuULRjca0u33HWvjC7Vs9yHf191bTFqvG56k3eb/qH6XdGmjrJv+Ln4qQuenPVoz+ nHHrUmwBuK9bEx4w55gK20rbx8dR73MuEpLTA= Received: by 10.223.100.8 with SMTP id w8mr1380821fan.55.1294129742554; Tue, 04 Jan 2011 00:29:02 -0800 (PST) MIME-Version: 1.0 Received: by 10.223.120.14 with HTTP; Tue, 4 Jan 2011 00:28:42 -0800 (PST) In-Reply-To: <4D22CCBB.9010409@orkash.com> References: <4D22CCBB.9010409@orkash.com> From: Harsh J Date: Tue, 4 Jan 2011 13:58:42 +0530 Message-ID: Subject: Re: Data for Testing in Hadoop To: common-user@hadoop.apache.org Content-Type: text/plain; charset=ISO-8859-1 You can use MR to generate the data itself. Checkout GridMix in Hadoop, or PigMix from Pig for examples on general load tests. On Tue, Jan 4, 2011 at 1:01 PM, Adarsh Sharma wrote: > Dear all, > > Designing the architecture is very important for the Hadoop in Production > Clusters. > > We are researching to run Hadoop in Cloud in Individual Nodes and in Cloud > Environment ( VM's ). > > For this, I require some data for testing. Would anyone send me some links > for data of different sizes ( 10Gb, 20GB, 30 Gb , 50GB ) . > I shall be grateful for this kindness. > > > Thanks & Regards > > Adarsh Sharma > > -- Harsh J www.harshj.com