Return-Path: Delivered-To: apmail-hadoop-general-archive@minotaur.apache.org Received: (qmail 49009 invoked from network); 15 Aug 2010 03:50:02 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 15 Aug 2010 03:50:02 -0000 Received: (qmail 30714 invoked by uid 500); 15 Aug 2010 03:50:01 -0000 Delivered-To: apmail-hadoop-general-archive@hadoop.apache.org Received: (qmail 30561 invoked by uid 500); 15 Aug 2010 03:49:59 -0000 Mailing-List: contact general-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: general@hadoop.apache.org Delivered-To: mailing list general@hadoop.apache.org Received: (qmail 30553 invoked by uid 99); 15 Aug 2010 03:49:58 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 15 Aug 2010 03:49:58 +0000 X-ASF-Spam-Status: No, hits=2.2 required=10.0 tests=FREEMAIL_ENVFROM_END_DIGIT,FREEMAIL_FROM,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of thinke365@gmail.com designates 74.125.82.48 as permitted sender) Received: from [74.125.82.48] (HELO mail-ww0-f48.google.com) (74.125.82.48) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 15 Aug 2010 03:49:50 +0000 Received: by wwb22 with SMTP id 22so5741951wwb.29 for ; Sat, 14 Aug 2010 20:49:30 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:received:in-reply-to :references:date:message-id:subject:from:to:content-type; bh=QkRD85raJYRUQ/rkLIRKESjB5usvrm6gqYr+F/aePgY=; b=uBywzI2DulDRFw1ldQyp/ymTzOe59M0Vedm1JPbL0di2SxkidM8V6b4f2N/MJAbZ35 uakgBN9VqbaYJSolkqCIzxWj4UaWupL9YjTtRo8jIIOtLovXb8jfY0f79NY87hvgSi/+ 25IFfjkNCFKNLPeraNlyP8UQxV0kuhohrYr0s= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; b=Cp7ISrzehndkSRLxTPuCNiQ56M2WDiF9fUraFzj31aUvX/3pAfXPE/AUGKMKIv0Qkx QFavPNyWsZfLskYrvMw4u+YoWEtBSqUfa30nbRV2llgvA5AnZudGJDzY89TJdQhIKXV+ dkRPVXVoqvcsVXI5yvlq3Pc71DetQYGXygNvk= MIME-Version: 1.0 Received: by 10.216.54.73 with SMTP id h51mr1156998wec.100.1281844170272; Sat, 14 Aug 2010 20:49:30 -0700 (PDT) Received: by 10.216.234.7 with HTTP; Sat, 14 Aug 2010 20:49:30 -0700 (PDT) In-Reply-To: References: Date: Sun, 15 Aug 2010 11:49:30 +0800 Message-ID: Subject: Re: Hadoop basics From: smith jack To: general@hadoop.apache.org Content-Type: text/plain; charset=ISO-8859-1 X-Virus-Checked: Checked by ClamAV on apache.org There is still a long way to go before you start debuging hadoop, the first point is that, you should run hadoop first! how to make this done. simple run bin/start-all, then hadoop can response to your request. you can use FsShell when you begin. hope you will enjoy it 2010/8/15 Rita Liu : > Hi! > > I am a total beginner, but I am very interested in hadoop. I've already > downloaded hadoop 0.19.2 and run on Ubuntu in single-node mode. Now I want > to do two things: > > 1. Explore how hadoop works internally with one of the example applications > hadoop provides > 2. Write an application on my own > > Those two things bring me following questions: > > a. debugger? > I am stuck since I don't know how to "explore" hadoop. I used to trace > through the code using a debugger, but in this case, I don't know if there > is a good debugger to use; or -- maybe a debugger is not necessary for > hadoop? If not, then how do you trace through the code to either debug or > just gain an understanding about the system? May I know what you, > experienced experts, do? :) > > b. Where to run hadoop? > Also -- may I know where you run your hadoop? Do you run on linux, or on VM > -- in particular, Cloudera? I heard that Cloudera is good for writing > mapreduce applications with hadoop itself as a blackbox; is it true? If my > ultimate goal is to understand how hadoop works internally, would it be > better if I directly run it on linux? > > c. Single-node or multi-node? > In the beginning (just like my case :p) would it be better to use > single-node or multi-node? If the latter is true, should I obtain more > machines, or should I use more virtual machines to create more nodes? > > As a newbie, I am sorry for all those basic (and silly, I know :$) > questions. If possible, please help me out? Any suggestion or advice will be > greatly appreciated. Thank you very much! > > Best, > Rita :) > > P.S. If my questions are not suitable for this mailing-list, please let me > apologize, and then, could you please direct me to other mailing-lists? > Sorry, and thanks a lot! :) >