hadoop-hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Matt Pestritto (JIRA)" <j...@apache.org>
Subject [jira] Commented: (HIVE-820) Describe Extended Line Breaks When Delimiter is \n
Date Wed, 06 Jan 2010 15:59:54 GMT

    [ https://issues.apache.org/jira/browse/HIVE-820?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12797150#action_12797150
] 

Matt Pestritto commented on HIVE-820:
-------------------------------------

All -

Do we have a decision on what you want the output to show ?  A few different ideas were being
thrown around.

I would rather replace only characters that would break the output ( tab, \n ) with something
meaningful vs, as Edward stated, always showing the octal representation which would require
an ascii table to figure out what the delimiter is.  If something is | ( pipe ) delimited,
I always need to look it up when that is a printable character.

I'll wait for feedback from the FB team and make the changes.

Thanks.

> Describe Extended Line Breaks When Delimiter is \n
> --------------------------------------------------
>
>                 Key: HIVE-820
>                 URL: https://issues.apache.org/jira/browse/HIVE-820
>             Project: Hadoop Hive
>          Issue Type: Bug
>          Components: Query Processor
>    Affects Versions: 0.2.0, 0.3.0, 0.3.1, 0.3.2, 0.4.0, 0.5.0
>            Reporter: Matt Pestritto
>            Assignee: Matt Pestritto
>            Priority: Minor
>             Fix For: 0.5.0
>
>         Attachments: hive_820.patch
>
>
> Tables defined delimited with \t and breaks using \n has output of describe extended
that is not contiguous.
> Line.delim outputs an actual \n which breaks the display output so using the hiveservice
you have to do another FetchOne to get the rest of the line.
> For example.
> Original Output:
> Detailed Table Information    Table(tableName:cobra_merchandise, dbName:default, owner:hive,
createTime:1248726291, lastAccessTime:0, retention:0, sd:StorageDescriptor(cols:[FieldSchema(name:merchandise_tid,
type:string, comment:null), FieldSchema(name:client_merch_type_tid, type:string, comment:null),
FieldSchema(name:description, type:string, comment:null), FieldSchema(name:client_description,
type:string, comment:null), FieldSchema(name:price, type:string, comment:null), FieldSchema(name:cost,
type:string, comment:null), FieldSchema(name:start_date, type:string, comment:null), FieldSchema(name:end_date,
type:string, comment:null)], location:hdfs://mustique:9000/user/hive/warehouse/m, inputFormat:org.apache.hadoop.mapred.TextInputFormat,
outputFormat:org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat, compressed:false,
numBuckets:-1, serdeInfo:SerDeInfo(name:null, serializationLib:org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe,
parameters:{serialization.format=9,line.delim=
> ,field.delim=    }), bucketCols:[], sortCols:[], parameters:{}), partitionKeys:[FieldSchema(name:client_tid,
type:int, comment:null)], parameters:{})   
> Proposed Output:
> Detailed Table Information    Table(tableName:cobra_merchandise, dbName:default, owner:hive,
createTime:1248726291, lastAccessTime:0, retention:0, sd:StorageDescriptor(cols:[FieldSchema(name:merchandise_tid,
type:string, comment:null), FieldSchema(name:client_merch_type_tid, type:string, comment:null),
FieldSchema(name:description, type:string, comment:null), FieldSchema(name:client_description,
type:string, comment:null), FieldSchema(name:price, type:string, comment:null), FieldSchema(name:cost,
type:string, comment:null), FieldSchema(name:start_date, type:string, comment:null), FieldSchema(name:end_date,
type:string, comment:null)], location:hdfs://mustique:9000/user/hive/warehouse/m, inputFormat:org.apache.hadoop.mapred.TextInputFormat,
outputFormat:org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat, compressed:false,
numBuckets:-1, serdeInfo:SerDeInfo(name:null, serializationLib:org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe,
parameters:{serialization.format=9,line.delim=<LF>,field.delim=<TAB>}), bucketCols:[],
sortCols:[], parameters:{}), partitionKeys:[FieldSchema(name:client_tid, type:int, comment:null)],
parameters:{})   

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message