Return-Path: Delivered-To: apmail-hadoop-common-user-archive@www.apache.org Received: (qmail 47321 invoked from network); 5 Jan 2011 03:29:27 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 5 Jan 2011 03:29:27 -0000 Received: (qmail 22070 invoked by uid 500); 5 Jan 2011 03:29:24 -0000 Delivered-To: apmail-hadoop-common-user-archive@hadoop.apache.org Received: (qmail 21950 invoked by uid 500); 5 Jan 2011 03:29:24 -0000 Mailing-List: contact common-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: common-user@hadoop.apache.org Delivered-To: mailing list common-user@hadoop.apache.org Received: (qmail 21942 invoked by uid 99); 5 Jan 2011 03:29:23 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 05 Jan 2011 03:29:23 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=10.0 tests=FREEMAIL_FROM,RCVD_IN_DNSWL_LOW,RFC_ABUSE_POST,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of goksron@gmail.com designates 209.85.161.48 as permitted sender) Received: from [209.85.161.48] (HELO mail-fx0-f48.google.com) (209.85.161.48) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 05 Jan 2011 03:29:17 +0000 Received: by fxm2 with SMTP id 2so14358352fxm.35 for ; Tue, 04 Jan 2011 19:28:57 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:received:in-reply-to :references:date:message-id:subject:from:to:content-type; bh=kPURiyOdIn6tM5VLhzSnml3hJw7m64j9sPQODplvU5I=; b=l0jAddcDPpECXTR3L+shq0mjH/R18yt7sK+VqOfSQS5INSsNzf/a3roa2KDotImhvd D9Y5WKbloFXzUJA40Y9OC6gPMP42fxeAsnt90MuspJXujBfvD+zAQJX/yhnwenWhdBBF D7cuBnMi0RFcTIsZ6jUKtWUkbbgUId/LzbZeI= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; b=RPOGaLyjPPFmYphpwCIp3xQVwWM4DjQkMVOIj0Z8LRBOckpp78qb5AYz5ucQCuochK Dxkg1WxGfOkeA4VkmL66prxT7AoO0ln4CTIt3Kl6O7USJHvQmuj/L7kBcqXKJwbmVZGT JN2CFRNw2u5YKgb7YZHhmyjoyMSYaXa5ar00U= MIME-Version: 1.0 Received: by 10.223.86.13 with SMTP id q13mr470060fal.53.1294198137778; Tue, 04 Jan 2011 19:28:57 -0800 (PST) Received: by 10.223.83.204 with HTTP; Tue, 4 Jan 2011 19:28:57 -0800 (PST) In-Reply-To: References: <4D22CCBB.9010409@orkash.com> Date: Tue, 4 Jan 2011 19:28:57 -0800 Message-ID: Subject: Re: Data for Testing in Hadoop From: Lance Norskog To: common-user@hadoop.apache.org Content-Type: text/plain; charset=UTF-8 X-Virus-Checked: Checked by ClamAV on apache.org https://cwiki.apache.org/confluence/display/MAHOUT/Collections All the collections you can imagine. On Tue, Jan 4, 2011 at 12:28 AM, Harsh J wrote: > You can use MR to generate the data itself. Checkout GridMix in > Hadoop, or PigMix from Pig for examples on general load tests. > > On Tue, Jan 4, 2011 at 1:01 PM, Adarsh Sharma wrote: >> Dear all, >> >> Designing the architecture is very important for the Hadoop in Production >> Clusters. >> >> We are researching to run Hadoop in Cloud in Individual Nodes and in Cloud >> Environment ( VM's ). >> >> For this, I require some data for testing. Would anyone send me some links >> for data of different sizes ( 10Gb, 20GB, 30 Gb , 50GB ) . >> I shall be grateful for this kindness. >> >> >> Thanks & Regards >> >> Adarsh Sharma >> >> > > > > -- > Harsh J > www.harshj.com > -- Lance Norskog goksron@gmail.com