hama-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Praveen Sripati <praveensrip...@gmail.com>
Subject Hama and Data Locality
Date Mon, 02 Apr 2012 16:17:51 GMT
> https://issues.apache.org/jira/browse/HAMA-543

> While working on it, I realized that this won't necessarily improve the
performance, because the resource requirements for Hama is different from
Hadoop. This change would move the mapper tasks closer to the input as in
Hadoop. But in case of Hama tasks continue running on that machine
throughout its lifetime. If in search of data-locality, the tasks get
scheduled such that the communication between the nodes are costlier than
normal (e.g. tasks resident in separate racks), then this change would
degrade the performance.

Doesn't data locality improve the performance of Hama?

Praveen

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message