Intermediate data is written to local disk, not to HDFS.

Ian.

On May 21, 2013, at 1:57 PM, John Lilley <john.lilley@redpoint.net> wrote:

When MapReduce enters “shuffle” to partition the tuples, I am assuming that it writes intermediate data to HDFS.  What replication factor is used for those temporary files?
john
 


---

Ian Wrigley
Sr. Curriculum Manager
Cloudera, Inc
Cell: (323) 819 4075