hadoop-mapreduce-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Mapred Learn <mapred.le...@gmail.com>
Subject Sequence File usage queries
Date Thu, 17 Feb 2011 21:16:23 GMT
I have a use case to upload some tera-bytes of text files as sequences files
on HDFS.

These text files have several layouts ranging from 32 to 62 columns

What would be a good way to upload these files along with their metadata:

i) creating a key, value class per text file layout and use it to create and
upload as sequence files ?

ii) create SequenceFile.Metadata header in each file being uploaded as
sequence file individually ?

Any inputs are appreciated !


View raw message