hadoop-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Sai Sai <saigr...@yahoo.in>
Subject Re: Block vs FileSplit vs record vs line
Date Thu, 14 Mar 2013 08:45:53 GMT
Just wondering if this is right way to understand this:
A large file is split into multiple blocks and each block is split into multiple file splits
and each file split has multiple records and each record has multiple lines. Each line is
processed by 1 instance of mapper.
Any help is appreciated.
View raw message