hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Greg Roelofs <roel...@yahoo-inc.com>
Subject Re: Losing Records with Block Compressed Sequence File
Date Mon, 24 Jan 2011 21:34:59 GMT
> A few days ago I tried my Unit test against bzip2 and found a similar
> effect: records go missing at the seems between the splits.

> Perhaps my unit test is buggy, perhaps you and I have independently
> found something that should be reported as a bug.

Probably.  I found a different bug (apparently?) in the bzip2 decoder
a while back:  HADOOP-6852.  If you're not concatenating bzip2 streams,
you're seeing something else.

Greg

Mime
View raw message