Return-Path: Delivered-To: apmail-lucene-hadoop-commits-archive@locus.apache.org Received: (qmail 46911 invoked from network); 8 Aug 2006 21:29:53 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur.apache.org with SMTP; 8 Aug 2006 21:29:53 -0000 Received: (qmail 24332 invoked by uid 500); 8 Aug 2006 21:29:53 -0000 Delivered-To: apmail-lucene-hadoop-commits-archive@lucene.apache.org Received: (qmail 24304 invoked by uid 500); 8 Aug 2006 21:29:53 -0000 Mailing-List: contact hadoop-commits-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hadoop-dev@lucene.apache.org Delivered-To: mailing list hadoop-commits@lucene.apache.org Received: (qmail 24295 invoked by uid 99); 8 Aug 2006 21:29:53 -0000 Received: from asf.osuosl.org (HELO asf.osuosl.org) (140.211.166.49) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 08 Aug 2006 14:29:52 -0700 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_HELO_PASS X-Spam-Check-By: apache.org Received: from [192.87.106.226] (HELO ajax.apache.org) (192.87.106.226) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 08 Aug 2006 14:29:52 -0700 Received: from ajax.apache.org (localhost [127.0.0.1]) by ajax.apache.org (Postfix) with ESMTP id 855F5D49BD for ; Tue, 8 Aug 2006 22:29:31 +0100 (BST) Content-Type: text/plain; charset="us-ascii" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit From: Apache Wiki To: hadoop-commits@lucene.apache.org Date: Tue, 08 Aug 2006 21:29:31 -0000 Message-ID: <20060808212931.18121.95600@ajax.apache.org> Subject: [Lucene-hadoop Wiki] Update of "FrontPage" by OwenOMalley X-Virus-Checked: Checked by ClamAV on apache.org X-Spam-Rating: minotaur.apache.org 1.6.2 0/1000/N Dear Wiki user, You have subscribed to a wiki page or wiki category on "Lucene-hadoop Wiki" for change notification. The following page has been changed by OwenOMalley: http://wiki.apache.org/lucene-hadoop/FrontPage ------------------------------------------------------------------------------ [http://lucene.apache.org/hadoop/ Hadoop] is a framework for running applications on large clusters built of commodity hardware. The Hadoop framework transparently provides applications both reliability and data motion. Hadoop implements a computational paradigm named [:HadoopMapReduce: Map/Reduce], where the application is divided into many small fragments of work, each of which may be executed or reexecuted on any node in the cluster. In addition, it provides a distributed file system that stores data on the compute nodes and thereby provides very high aggregate bandwidth across the cluster. Both Map/Reduce and the distributed file system are designed so that node failures are automatically handled by the network. The intent is to scale Hadoop up to handling thousand of computers. The current high water marks that have been reported are: - * !DataNodes: 620 - * !TaskTrackers: 500 + * Nodes in a single file system cluster (!DataNodes): 620 + * Nodes in a single map/reduce cluster (!TaskTrackers): 500 Hadoop was originally built as infrastructure for the [http://lucene.apache.org/nutch/ Nutch] project, which crawls the web and builds a search engine index for the crawled pages. Both Hadoop and Nutch are part of the [http://lucene.apache.org/java/docs/index.html Lucene] [http://www.apache.org/ Apache] project.