hadoop-common-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Johan Oskarsson <jo...@oskarsson.nu>
Subject Merge sequence files
Date Tue, 15 May 2007 18:10:15 GMT

I'm considering using the sequence file output of hadoop jobs to serve 
data from as it would mean I could skip the conversion from sequence 
file -> other file format step.

To do this efficiently I would need the data to be in one file.
I know that the sequence file class has code internally to merge files 
into one. Would it be possible to make these methods and internal 
classes available outside of SequenceFile?


View raw message