hadoop-common-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Johan Oskarsson <jo...@oskarsson.nu>
Subject Merge sequence files
Date Tue, 15 May 2007 18:10:15 GMT
Hey.

I'm considering using the sequence file output of hadoop jobs to serve 
data from as it would mean I could skip the conversion from sequence 
file -> other file format step.

To do this efficiently I would need the data to be in one file.
I know that the sequence file class has code internally to merge files 
into one. Would it be possible to make these methods and internal 
classes available outside of SequenceFile?

/Johan

Mime
View raw message