hadoop-common-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Paul Sutter" <sut...@gmail.com>
Subject Re: Redundant (?) lengths in SequenceFile
Date Tue, 27 Jun 2006 04:07:34 GMT
its fine either way, i just wanted to know that its deleberate. it certainly
doesnt cost much space.

an external sorter can work only on keytypes it understands, and our keys
are compound and may include strings, floats, and ints.

the length is no problem at all.

On 6/26/06, Doug Cutting <cutting@apache.org> wrote:
>
> Paul Sutter wrote:
> > However -> It still seems to me that the key length in the sequence file
> is
> > redundant.
>
> What if your keys are compound, containing, say, a combination of
> floats, ints and strings?  Then the key may not include a length of the
> entire key entry.  So you're seeking to optimize a special (if common)
> case.
>
> Doug
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message