spark-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Apache Spark (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (SPARK-3103) Fix UTF8 encoding in PySpark saveAsTextFile().
Date Mon, 18 Aug 2014 18:02:30 GMT

    [ https://issues.apache.org/jira/browse/SPARK-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14100958#comment-14100958
] 

Apache Spark commented on SPARK-3103:
-------------------------------------

User 'davies' has created a pull request for this issue:
https://github.com/apache/spark/pull/2018

> Fix UTF8 encoding in PySpark saveAsTextFile().
> ----------------------------------------------
>
>                 Key: SPARK-3103
>                 URL: https://issues.apache.org/jira/browse/SPARK-3103
>             Project: Spark
>          Issue Type: Bug
>          Components: PySpark
>    Affects Versions: 1.0.2, 1.1.0
>            Reporter: Josh Rosen
>              Labels: starter
>
> This is a follow-up JIRA for https://github.com/apache/spark/pull/1914, where Ahir and
Davies identified a bug in Python JsonRDD when trying to encode non-ASCII strings into unicode.
> The same underlying issue affects saveAsTextFile, so we should apply the same fix there,
too, and search for any other code that needs to be updated (and maybe refactor this out into
a utility function).



--
This message was sent by Atlassian JIRA
(v6.2#6252)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org


Mime
View raw message