hadoop-hdfs-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From 臧冬松 <donal0...@gmail.com>
Subject structured data split
Date Fri, 11 Nov 2011 07:43:47 GMT
Usually large file in HDFS is split into bulks and store in different
DataNodes.
A map task is assigned to deal with that bulk, I wonder what if the
Structured data(i.e a word) was split into two bulks?
How MapReduce and HDFS deal with this?

Thanks!
Donal

Mime
View raw message