Hadoop's support for zlib library lacks support to perform flushes (Z_SYNC_FLUSH and Z_FULL_FLUSH)
--------------------------------------------------------------------------------------------------
Key: HADOOP-6297
URL: https://issues.apache.org/jira/browse/HADOOP-6297
Project: Hadoop Common
Issue Type: Improvement
Components: io
Affects Versions: 0.21.0
Reporter: Kevin J. Price
Priority: Minor
The zlib library supports the ability to perform two types of flushes when deflating data.
It can perform both a Z_SYNC_FLUSH, which forces all input to be written as output and byte-aligned
and resets the Huffman coding, and it also supports a Z_FULL_FLUSH, which does the same thing
but additionally resets the compression dictionary. The Hadoop wrapper for the zlib library
does not support either of these two methods.
Adding support should be fairly trivial. An additional deflate method that takes a fourth
"flush" parameter, and a modification to the native c code to accept this fourth parameter
and pass it along to the zlib library. I can submit a patch for this if desired.
It should be noted that the native SUN Java API is likewise missing this functionality, as
has been noted for over a decade here: http://bugs.sun.com/bugdatabase/view_bug.do?bug_id=4206909
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
|