hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Teodor Macicas <teodor.maci...@epfl.ch>
Subject Control the file splits size
Date Mon, 23 Aug 2010 10:38:16 GMT
Hi all,

Can anyone please tell me how to control the splits size ? I have one 
big file which will be splitted by the number of maps. The input file is 
binary and contains some objects. I do not want to split an object into 
2 separate files, for sure.
I overwrite the computeSplitSize() file and I forced the size to be a 
multiple of my objects size. It worked, but it seems that on certain 
points of the output file objects are missing. And now I am thinking 
that this could be my problem.

Have anyone faced this problem before ?

Thank you.

View raw message