hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Zhong Wang <wangzhong....@gmail.com>
Subject Re: Large size Text file split
Date Thu, 11 Jun 2009 14:47:48 GMT
> Mapper 2 starts reading at byte 10000. It finds the first newline at byte
> 10020, so the first "real" record it processes starts at byte 10021.
>

There's one problem: how does Mapper2 know the "real" record start at
10021 before Mapper1 reach the end of Split1 (9999)? Mappers starts at
the same time.

-- 
Zhong Wang

Mime
View raw message