spark-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Sean Owen (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (SPARK-8296) Not able to load Dataframe using Python throws py4j.protocol.Py4JJavaError
Date Thu, 11 Jun 2015 07:48:00 GMT

     [ https://issues.apache.org/jira/browse/SPARK-8296?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Sean Owen updated SPARK-8296:
-----------------------------
    Component/s: SQL
                 PySpark

> Not able to load Dataframe using Python throws py4j.protocol.Py4JJavaError
> --------------------------------------------------------------------------
>
>                 Key: SPARK-8296
>                 URL: https://issues.apache.org/jira/browse/SPARK-8296
>             Project: Spark
>          Issue Type: Bug
>          Components: PySpark, SQL
>    Affects Versions: 1.3.1
>            Reporter: ABHISHEK CHOUDHARY
>              Labels: test
>
> While trying to load a json file using sqlcontext in prebuilt spark-1.3.1-bin-hadoop2.4
version, it throws py4j.protocol.Py4JJavaError
> from pyspark.sql import SQLContext
> from pyspark import SparkContext
> sc = SparkContext()
> sqlContext = SQLContext(sc)
> # Create the DataFrame
> df = sqlContext.jsonFile("changes.json")
> # Show the content of the DataFrame
> df.show()
> Error thrown -
>   File "/Users/abhishekchoudhary/Work/python/evolveML/kaggle/avirto/test.py", line 11,
in <module>
>     df = sqlContext.jsonFile("changes.json")
>   File "/Users/abhishekchoudhary/bigdata/cdh5.2.0/spark-1.3.1/python/pyspark/sql/context.py",
line 377, in jsonFile
>     df = self._ssql_ctx.jsonFile(path, samplingRatio)
>   File "/Users/abhishekchoudhary/bigdata/cdh5.2.0/spark-1.3.1/python/lib/py4j-0.8.2.1-src.zip/py4j/java_gateway.py",
line 538, in __call__
>   File "/Users/abhishekchoudhary/bigdata/cdh5.2.0/spark-1.3.1/python/lib/py4j-0.8.2.1-src.zip/py4j/protocol.py",
line 300, in get_return_value
> py4j.protocol.Py4JJavaError
> On checking through the source code, I found that 'gateway_client' is not valid .



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org


Mime
View raw message