hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Tom White <...@cloudera.com>
Subject Re: A record version mismatch occured. Expecting v6, found v32
Date Mon, 02 Feb 2009 15:58:01 GMT
The SequenceFile format is described here:
http://hadoop.apache.org/core/docs/current/api/org/apache/hadoop/io/SequenceFile.html.
The format of the keys and values depends on the serialization classes
used. For example, BytesWritable writes out the length of its byte
array followed by the actual bytes in the array (see the write()
method in BytesWritable).

Hope this helps.
Tom

On Mon, Feb 2, 2009 at 3:21 PM, Rasit OZDAS <rasitozdas@gmail.com> wrote:
> I tried to use SequenceFile.Writer to convert my binaries into Sequence
> Files,
> I read the binary data with FileInputStream, getting all bytes with
> reader.read(byte[])  , wrote it to a file with SequenceFile.Writer, with
> parameters NullWritable as key, BytesWritable as value. But the content
> changes,
> (I can see that by converting to Base64)
>
> Binary File:
> 73 65 65 65 81 65 65 65 65 65 81 81 65 119 84 81 65 111 67 81 65 52 57 81 65
> 103 54 81 65 65 97 81 65 65 65 81 ...
>
> Sequence File:
> 73 65 65 65 65 69 65 65 65 65 65 65 65 69 66 65 65 77 66 77 81 103 67 103 67
> 69 77 65 52 80 86 67 65 73 68 114 ...
>
> Thanks for any points..
> Rasit
>
> 2009/2/2 Rasit OZDAS <rasitozdas@gmail.com>
>
>> Hi,
>> I tried to use SequenceFileInputFormat, for this I appended "SEQ" as first
>> bytes of my "binary" files (with hex editor).
>> but I get this exception:
>>
>> A record version mismatch occured. Expecting v6, found v32
>>     at
>> org.apache.hadoop.io.SequenceFile$Reader.init(SequenceFile.java:1460)
>>     at
>> org.apache.hadoop.io.SequenceFile$Reader.<init>(SequenceFile.java:1428)
>>     at
>> org.apache.hadoop.io.SequenceFile$Reader.<init>(SequenceFile.java:1417)
>>     at
>> org.apache.hadoop.io.SequenceFile$Reader.<init>(SequenceFile.java:1412)
>>     at
>> org.apache.hadoop.mapred.SequenceFileRecordReader.<init>(SequenceFileRecordReader.java:43)
>>     at
>> org.apache.hadoop.mapred.SequenceFileInputFormat.getRecordReader(SequenceFileInputFormat.java:58)
>>     at org.apache.hadoop.mapred.MapTask.run(MapTask.java:321)
>>     at org.apache.hadoop.mapred.Child.main(Child.java:155)
>>
>> What could it be? Is it not enough just to add "SEQ" to binary files?
>> I use Hadoop v.0.19.0 .
>>
>> Thanks in advance..
>> Rasit
>>
>>
>> different *version* of *Hadoop* between your server and your client.
>>
>> --
>> M. Raşit ÖZDAŞ
>>
>
>
>
> --
> M. Raşit ÖZDAŞ
>

Mime
View raw message