hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Bwolen Yang" <wbwo...@gmail.com>
Subject map interface to outputs of map/reduce
Date Mon, 04 Jun 2007 22:00:17 GMT

Given that map/reduce produces a partitioned set of sorted output
files, I was wondering if a map implementation exists for doing
lookups or iterate thru subranges of these files.

This would be similar to the java SortedMap, except
  - the map is read-only,
  - works with data on disk (instead of in memory),
  - for lookups, it should know to tradeoff seeks (with binary search)
vs disk read



View raw message