hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ankur C. Goel" <gan...@yahoo-inc.com>
Subject Re: Data-Intensive Text Processing with MapReduce
Date Mon, 22 Feb 2010 06:47:44 GMT

Hi Jimmy,
            Congratulations on the good work. In chapter 6 it would be good to supplement
EM examples with more sudo code as the chapter is quite mathematical in nature.


On 2/19/10 9:53 PM, "Jimmy Lin" <jimmylin@umd.edu> wrote:

Hi everyone,

I'm pleased to present the first complete draft of a forthcoming book:

Data-Intensive Text Processing with MapReduce
by Jimmy Lin and Chris Dyer

The complete text is available at:

It's slated for publication by Morgan & Claypool in mid-2010.

This text is currently being used in the MapReduce course at the
University of Maryland.  The focus of the book is on algorithm design
and "thinking at scale".  Quite explicitly, the book is *not* about
Hadoop programming.  Tom White's book already does that quite well... :)

Table of Contents

    1. Introduction
    2. MapReduce Basics
    3. MapReduce algorithm design
    4. Inverted Indexing for Text Retrieval
    5. Graph Algorithms
    6. EM Algorithms for Text Processing
    7. Closing Remarks

We hope you find this resource helpful... Comments and feedback are welcome!


  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message