hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Harsh J (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HIVE-2250) "DESCRIBE EXTENDED table_name" shows inconsistent compression information.
Date Fri, 30 Nov 2012 11:21:58 GMT

    [ https://issues.apache.org/jira/browse/HIVE-2250?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13507271#comment-13507271
] 

Harsh J commented on HIVE-2250:
-------------------------------

If we don't really make use of the IS_COMPRESSED attribute of a table, should we just get
rid of it (or at least not print it in the {{describe extended/formatted}} output, which causes
great confusion as it is always certainly {{No}})?
                
> "DESCRIBE EXTENDED table_name" shows inconsistent compression information.
> --------------------------------------------------------------------------
>
>                 Key: HIVE-2250
>                 URL: https://issues.apache.org/jira/browse/HIVE-2250
>             Project: Hive
>          Issue Type: Bug
>          Components: CLI, Diagnosability
>    Affects Versions: 0.7.0
>         Environment: RHEL, Full Cloudera stack
>            Reporter: Travis Powell
>            Assignee: subramanian raghunathan
>            Priority: Critical
>         Attachments: HIVE-2250.patch
>
>
> Commands executed in this order:
> user@node # hive
> hive> SET hive.exec.compress.output=true; 
> hive> SET io.seqfile.compression.type=BLOCK;
> hive> CREATE TABLE table_name ( [...] ) ROW FORMAT DELIMITED FIELDS TERMINATED BY
'\t' STORED AS SEQUENCEFILE;
> hive> CREATE TABLE staging_table ( [...] ) ROW FORMAT DELIMITED FIELDS TERMINATED
BY '\t';
> hive> LOAD DATA LOCAL INPATH 'file:///root/input/' OVERWRITE INTO TABLE staging_table;
> hive> INSERT OVERWRITE TABLE table_name SELECT * FROM staging_table;
> (Map reduce job to change to sequence file...)
> hive> DESCRIBE EXTENDED table_name;
> Detailed Table Information      Table(tableName:table_name, dbName:benchmarking, owner:root,
createTime:1309480053, lastAccessTime:0, retention:0, sd:StorageDescriptor(cols:[FieldSchema(name:session_key,
type:string, comment:null), FieldSchema(name:remote_address, type:string, comment:null), FieldSchema(name:canister_lssn,
type:string, comment:null), FieldSchema(name:canister_session_id, type:bigint, comment:null),
FieldSchema(name:tltsid, type:string, comment:null), FieldSchema(name:tltuid, type:string,
comment:null), FieldSchema(name:tltvid, type:string, comment:null), FieldSchema(name:canister_server,
type:string, comment:null), FieldSchema(name:session_timestamp, type:string, comment:null),
FieldSchema(name:session_duration, type:string, comment:null), FieldSchema(name:hit_count,
type:bigint, comment:null), FieldSchema(name:http_user_agent, type:string, comment:null),
FieldSchema(name:extractid, type:bigint, comment:null), FieldSchema(name:site_link, type:string,
comment:null), FieldSchema(name:dt, type:string, comment:null), FieldSchema(name:hour, type:int,
comment:null)], location:hdfs://hadoop2/user/hive/warehouse/benchmarking.db/table_name, inputFormat:org.apache.hadoop.mapred.SequenceFileInputFormat,
outputFormat:org.apache.hadoop.hive.ql.io.HiveSequenceFileOutputFormat, compressed:false,
numBuckets:-1, serdeInfo:SerDeInfo(name:null, serializationLib:org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe,
parameters:{serialization.format=   , field.delim=
> *** SEE ABOVE: Compression is set to FALSE, even though contents of table is compressed.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Mime
View raw message