hadoop-common-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Doug Cutting <cutt...@apache.org>
Subject Re: Merge sequence files
Date Tue, 15 May 2007 18:50:07 GMT
Johan Oskarsson wrote:
> I'm considering using the sequence file output of hadoop jobs to serve 
> data from as it would mean I could skip the conversion from sequence 
> file -> other file format step.
> To do this efficiently I would need the data to be in one file.

I think it should be more efficient to keep things in separate files. 
If you use MapFileOutputFormat, there are methods to randomly access 
entries from job output:


SequenceFileOutputFormat will also let you open all readers, but there's 
no random access, since a SequenceFile has no index.


Will these suffice?


View raw message