hadoop-common-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Paul Sutter" <sut...@gmail.com>
Subject Re: Redundant (?) lengths in SequenceFile
Date Mon, 19 Jun 2006 17:46:09 GMT
How can I define a comparator that can compare raw binary data? (our keys
are raw binary data)


On 6/19/06, Doug Cutting <cutting@apache.org> wrote:
>
> Paul Sutter wrote:
> > Is there a reason for the key and record length included in the
> > SequenceFile
> > format?
>
> This permits code to process entries without parsing them, an important
> optimization.  For example, when seeking a MapFile, keys are parsed but
> not values.  When sorting, if a comparator is defined that can compare
> raw binary data, then even keys are not parsed.
>
> Doug
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message