spark-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Ankur (JIRA)" <j...@apache.org>
Subject [jira] [Comment Edited] (SPARK-21755) Spark 2.1.1 UI page not displaying any dynamic updates on job progress after showing progress for initial few minutes of job run.
Date Thu, 14 Sep 2017 03:42:00 GMT

    [ https://issues.apache.org/jira/browse/SPARK-21755?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16165678#comment-16165678
] 

Ankur edited comment on SPARK-21755 at 9/14/17 3:41 AM:
--------------------------------------------------------

Issue can also be reproduced on Spark 2.2 version on an EMR cluster with configurations: 

* EMR Release label:      emr-5.8.0
* Hadoop distribution:      Amazon 2.7.3





was (Author: ankurkasliwal26):
Issue can also be reproduced on Spark 2.2 version on an EMR cluster with same configurations:


* EMR Release label:      emr-5.8.0
* Hadoop distribution:      Amazon 2.7.3




> Spark 2.1.1 UI page not displaying any dynamic updates on job progress after showing
progress for initial few minutes of job run.
> ---------------------------------------------------------------------------------------------------------------------------------
>
>                 Key: SPARK-21755
>                 URL: https://issues.apache.org/jira/browse/SPARK-21755
>             Project: Spark
>          Issue Type: Bug
>          Components: Web UI
>    Affects Versions: 2.1.1
>         Environment: Issue was produced on an EMR cluster with following configurations:
> ### EMR Release label:      emr-5.6.0
> ### Hadoop distribution:      Amazon 2.7.3
> ### Applications installed:   Hive 2.1.1, Spark 2.1.1
>            Reporter: Ankur
>
> When a Spark SQL job is ran, Spark Application’s Web Console ( UI ) is getting intermittently
updated for initial few minutes ( ~ 10-15 minutes ) and after that there are no updates on
job progress ( even after job execution completes).  As soon as "Spark SQL" session is terminated
I can see Spark UI got updated with the job summary.
> Issue was reproduced by using spark-sql on a data-set of around 1.2 TB size. Here are
the steps:
> Step 1> An EMR cluster is launched ( release emr-5.6.0 and applications as Hive 2.1.1,
Spark 2.1.1 )
> Step 2>> Following command is ran:
> spark-sql> CREATE TABLE total_flights USING com.databricks.spark.csv OPTIONS (path
"s3://bucket/test_web_UI/flight/", header "true", inferSchema "true");
> Data-set used : Flights history in CSV files provided by US Department of Transportation,
Bureau of Transportation Statistics - https://www.transtats.bts.gov/DL_SelectFields.asp?Table_ID=236&DB_Short_Name=On-Time
> Step 3> There were no updates on Web UI after initial ~10 minutes. Web UI did not
got updated even after few hours when job was completed successfully. 
> Step 4> Once the spark-sql session is ended, Spark UI got updated with the job summary
correctly as expected. 
> I have verified that "spark.history.fs.update.interval" is set to default value of 10
seconds as mentioned in this document "https://spark.apache.org/docs/latest/monitoring.html
".  



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org


Mime
View raw message