hadoop-hdfs-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Grandl Robert <rgra...@yahoo.com>
Subject HDFS splits based on content semantics
Date Wed, 01 Aug 2012 13:44:51 GMT

Probably this question is answered many times but I could not clarify yet after searching
on google. 

Does HDFS split the input solely based on fixed block size or take in consideration the semantics
of it ?
For example, if I have a binary file, or I want the block to not cut some lines of text, etc.
will I be able to instruct HDFS where to stop with each block ?


View raw message