hadoop-hdfs-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From maninder batth <batth.manin...@gmail.com>
Subject Clarification on T file
Date Fri, 20 Apr 2012 14:49:15 GMT
My requirements are to save variable sized binary records and ability to
query them later on. So i was looking at Tfile and had some doubts.

1. Is the datablock in the tfile a fixed size or variable size? If it is
fixed, what happens when a record cannot fit in the datablock? Would you
normally fill the empty space with zeros or spread the record over 2
datablocks?

2. Is there any downside of having a variable sized datablocks?

3. Are the records synced with file at the boundary of a datablock or they
just written to file system. The question is like write() call in linux vs
fsync()?

Thank you,
-- Maninder Batth
No trees were killed in the creation of this message. However, many
electrons were terrible inconvenienced.

Mime
View raw message