hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ashutosh Chauhan (Commented) (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HIVE-2711) Make the header of RCFile unique
Date Sat, 24 Mar 2012 00:59:25 GMT

    [ https://issues.apache.org/jira/browse/HIVE-2711?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13237371#comment-13237371
] 

Ashutosh Chauhan commented on HIVE-2711:
----------------------------------------

@Owen,
 I think original design of RCFile was done with compatibility with Sequence File in mind.
This patch will break that. Whats the advantage of this change?
                
> Make the header of RCFile unique
> --------------------------------
>
>                 Key: HIVE-2711
>                 URL: https://issues.apache.org/jira/browse/HIVE-2711
>             Project: Hive
>          Issue Type: Bug
>          Components: Serializers/Deserializers
>            Reporter: Owen O'Malley
>            Assignee: Owen O'Malley
>         Attachments: HIVE-2711.D2115.1.patch
>
>
> The RCFile implementation was copied from Hadoop's SequenceFile and copied the 'magic'
string in the header. This means that you can't use the header to distinguish between RCFiles
and SequenceFiles.
> I'd propose that we create a new header for RCFiles (RCF?) to replace the current SEQ.
To maintain compatibility, we'll need to continue to accept the current 'SEQ\06' and just
make new files contain the new header.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Mime
View raw message