hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Mark Kerzner <markkerz...@gmail.com>
Subject Re: Data-Intensive Text Processing with MapReduce
Date Sun, 09 May 2010 18:06:47 GMT
Dear Jimmy and Chris:

I am reading your book (thank you for providing the pre-release version) and
I find it great in contents and in style. Thank you!


On Sat, May 8, 2010 at 1:25 PM, Jimmy Lin <jimmylin@umd.edu> wrote:

> Hi everyone,
> I'm pleased to announce the publication a new book on MapReduce algorithm
> design:
> Data-Intensive Text Processing with MapReduce
> by Jimmy Lin and Chris Dyer
> Morgan & Claypool Publishers, 2010
> http://mapreduce.me/
> Abstract
> Our world is being revolutionized by data-driven methods: access to large
> amounts of data has generated new insights and opened exciting new
> opportunities in commerce, science, and computing applications. Processing
> the enormous quantities of data necessary for these advances requires large
> clusters, making distributed computing paradigms more crucial than ever.
> MapReduce is a programming model for expressing distributed computations on
> massive datasets and an execution framework for large-scale data processing
> on clusters of commodity servers. The programming model provides an
> easy-to-understand abstraction for designing scalable algorithms, while the
> execution framework transparently handles many system-level details, ranging
> from scheduling to synchronization to fault tolerance. This book focuses on
> MapReduce algorithm design, with an emphasis on text processing algorithms
> common in natural language processing, information retrieval, and machine
> learning. We introduce the notion of MapReduce design patterns, which
> represent general reusable solutions to commonly occurring problems across a
> variety of problem domains. This book not only intends to help the reader
> "think in MapReduce", but also discusses limitations of the programming
> model as well.
> Table of Contents
>   1. Introduction
>   2. MapReduce Basics
>   3. MapReduce algorithm design
>   4. Inverted Indexing for Text Retrieval
>   5. Graph Algorithms
>   6. EM Algorithms for Text Processing
>   7. Closing Remarks
> Enjoy!
> -Jimmy

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message