hadoop-common-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ashish Thusoo (JIRA)" <j...@apache.org>
Subject [jira] Commented: (HADOOP-4169) 'compressed' keyword in DDL syntax misleading and does not compress
Date Thu, 18 Sep 2008 22:44:44 GMT

    [ https://issues.apache.org/jira/browse/HADOOP-4169?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12632425#action_12632425
] 

Ashish Thusoo commented on HADOOP-4169:
---------------------------------------

sorry my mistake. The patch is ok.

> 'compressed' keyword in DDL syntax misleading and does not compress
> -------------------------------------------------------------------
>
>                 Key: HADOOP-4169
>                 URL: https://issues.apache.org/jira/browse/HADOOP-4169
>             Project: Hadoop Core
>          Issue Type: Bug
>          Components: contrib/hive
>            Reporter: Joydeep Sen Sarma
>            Assignee: Joydeep Sen Sarma
>             Fix For: 0.19.0
>
>         Attachments: 4169-1.txt
>
>
> Hive produces two types of data files - flat files and sequencefiles. Syntax should reflect
this. Currently the 'compressed' keyword is used to choose sequencefile format - but does
not actually compress the files. this is misleading. In addition - flat files can also be
compressed.
> Proposal is to replace 'compressed' with 'sequencefile'. And compression options should
be applied from standard hadoop way of specifying whether output should be compressed (''mapred.output.compress')
- ie. session options. (session options will also define codec etc.). default file format
and compression options can be specified in conf file.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message