spark-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Apache Spark (JIRA)" <>
Subject [jira] [Commented] (SPARK-24215) Implement __repr__ and _repr_html_ for dataframes in PySpark
Date Sat, 19 May 2018 12:05:00 GMT


Apache Spark commented on SPARK-24215:

User 'xuanyuanking' has created a pull request for this issue:

> Implement __repr__ and _repr_html_ for dataframes in PySpark
> ------------------------------------------------------------
>                 Key: SPARK-24215
>                 URL:
>             Project: Spark
>          Issue Type: Improvement
>          Components: PySpark, Spark Core, SQL
>    Affects Versions: 2.3.0
>            Reporter: Ryan Blue
>            Priority: Major
> To help people that are new to Spark get feedback more easily, we should implement the
repr methods for Jupyter python kernels. That way, when users run pyspark in jupyter console
or notebooks, they get good feedback about the queries they've defined.
> This should include an option for eager evaluation, (maybe spark.jupyter.eager-eval?).
When set, the formatting methods would run dataframes and produce output like {{show}}. This
is a good balance between not hiding Spark's action behavior and getting feedback to users
that don't know to call actions.
> Here's the dev list thread for context:

This message was sent by Atlassian JIRA

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message