spark-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Reynold Xin (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (SPARK-7993) Improve DataFrame.show() output
Date Mon, 01 Jun 2015 19:53:17 GMT

    [ https://issues.apache.org/jira/browse/SPARK-7993?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14567887#comment-14567887
] 

Reynold Xin commented on SPARK-7993:
------------------------------------

Yes would be great to handle those as well. Maybe we can just hande Seq, which is a common
base data type.


> Improve DataFrame.show() output
> -------------------------------
>
>                 Key: SPARK-7993
>                 URL: https://issues.apache.org/jira/browse/SPARK-7993
>             Project: Spark
>          Issue Type: Sub-task
>          Components: SQL
>            Reporter: Reynold Xin
>            Priority: Blocker
>              Labels: starter
>
> 1. Each column should be at the minimum 3 characters wide. Right now if the widest value
is 1, it is just 1 char wide, which looks ugly. Example below:
> 2. If a DataFrame have more than N number of rows (N = 20 by default for show), at the
end we should display a message like "only showing the top 20 rows".
> {code}
> +--+--+-+
> | a| b|c|
> +--+--+-+
> | 1| 2|3|
> | 1| 2|1|
> | 1| 2|3|
> | 3| 6|3|
> | 1| 2|3|
> | 5|10|1|
> | 1| 2|3|
> | 7|14|3|
> | 1| 2|3|
> | 9|18|1|
> | 1| 2|3|
> |11|22|3|
> | 1| 2|3|
> |13|26|1|
> | 1| 2|3|
> |15|30|3|
> | 1| 2|3|
> |17|34|1|
> | 1| 2|3|
> |19|38|3|
> +--+--+-+
> only showing top 20 rows   <---- add this at the end
> {code}
> 3. For array values, instead of printing "ArrayBuffer", we should just print square brackets:
> {code}
> +------------------+------------------+-----------------+
> |       a_freqItems|       b_freqItems|      c_freqItems|
> +------------------+------------------+-----------------+
> |ArrayBuffer(11, 1)|ArrayBuffer(2, 22)|ArrayBuffer(1, 3)|
> +------------------+------------------+-----------------+
> {code}
> should be
> {code}
> +-----------+-----------+-----------+
> |a_freqItems|b_freqItems|c_freqItems|
> +-----------+-----------+-----------+
> |    [11, 1]|    [2, 22]|     [1, 3]|
> +-----------+-----------+-----------+
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org


Mime
View raw message