spark-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Park Chan Min (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (SPARK-23872) You can not connect to another metastore uri using two Spark sessions
Date Thu, 05 Apr 2018 02:07:00 GMT

     [ https://issues.apache.org/jira/browse/SPARK-23872?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Park Chan Min updated SPARK-23872:
----------------------------------
    Description: 
In Spark 2.1.0, two sessions worked normally In 2.3.0, the first session metasore information
is used when the second session is run

-------------------     Code    ----------------------
 val spark_1 = SparkSession.builder()
              .enableHiveSupport()
              .config("hive.metastore.uris", "thrift://HOST_A:9083")
              .getOrCreate()

spark_1.sql("SELECT A_FIELD FROM TABLE_A").show()

SparkSession.clearActiveSession()
SparkSession.clearDefaultSession()

val spark_2 = SparkSession.builder()
              .enableHiveSupport()
              .config("hive.metastore.uris", "thrift://HOST_B:9083")
              .getOrCreate()

spark_2.sql("SELECT B_FIELD FROM TABLE_B").show()

------------------- spark 2.1.0 result ------------------
INFO metastore: Trying to connect to metastore with URI thrift://HOST_A:9083
......
INFO DAGScheduler: Job 3 finished: show at SparkHDFSTest.scala:20, took 0.807905 s
+-------+
|A_FIELD|
+-------+
|     A   |
+-------+
......
INFO metastore: Trying to connect to metastore with URI thrift://HOST_B:9083
INFO DAGScheduler: Job 3 finished: show at SparkHDFSTest.scala:20, took 0.807905 s
+-------+
|B_FIELD|
+-------+
|   B   |
+-------+
......

------------------- spark 2.3.0 result ------------------
INFO metastore: Trying to connect to metastore with URI thrift://HOST_A:9083
......
INFO DAGScheduler: Job 3 finished: show at SparkHDFSTest.scala:20, took 0.807905 s
+-------+
|A_FIELD|
+-------+
|   A   |
+-------+
......
INFO metastore: Trying to connect to metastore with URI thrift://HOST_A:9083
INFO DAGScheduler: Job 3 finished: show at SparkHDFSTest.scala:20, took 0.807905 s
......
Exception in thread "main" org.apache.spark.sql.AnalysisException: Table or view not found:
`default`.`TABLE_B`; line 1 pos 19;
......

  was:
In Spark 2.1.0, two sessions worked normally In 2.3.0, the first session metasore information
is used when the second session is run

-------------------     Code    ----------------------
 val spark_1 = SparkSession.builder()
              .enableHiveSupport()
              .config("hive.metastore.uris", "thrift://HOST_A:9083")
              .getOrCreate()

spark_1.sql("SELECT A_FIELD FROM TABLE_A").show()

SparkSession.clearActiveSession()
SparkSession.clearDefaultSession()

val spark_2 = SparkSession.builder()
              .enableHiveSupport()
              .config("hive.metastore.uris", "thrift://HOST_B:9083")
              .getOrCreate()

spark_2.sql("SELECT B_FIELD FROM TABLE_B").show()

------------------- spark 2.1.0 result ------------------
INFO metastore: Trying to connect to metastore with URI thrift://HOST_A:9083
......
INFO DAGScheduler: Job 3 finished: show at SparkHDFSTest.scala:20, took 0.807905 s
+-------+
|A_FIELD|
+-------+
|   A   |
+-------+
......
INFO metastore: Trying to connect to metastore with URI thrift://HOST_B:9083
INFO DAGScheduler: Job 3 finished: show at SparkHDFSTest.scala:20, took 0.807905 s
+-------+
|B_FIELD|
+-------+
|   B   |
+-------+
......

------------------- spark 2.3.0 result ------------------
INFO metastore: Trying to connect to metastore with URI thrift://HOST_A:9083
......
INFO DAGScheduler: Job 3 finished: show at SparkHDFSTest.scala:20, took 0.807905 s
+-------+
|A_FIELD|
+-------+
|   A   |
+-------+
......
INFO metastore: Trying to connect to metastore with URI thrift://HOST_A:9083
INFO DAGScheduler: Job 3 finished: show at SparkHDFSTest.scala:20, took 0.807905 s
......
Exception in thread "main" org.apache.spark.sql.AnalysisException: Table or view not found:
`default`.`TABLE_B`; line 1 pos 19;
......


> You can not connect to another metastore uri using two Spark sessions
> ---------------------------------------------------------------------
>
>                 Key: SPARK-23872
>                 URL: https://issues.apache.org/jira/browse/SPARK-23872
>             Project: Spark
>          Issue Type: Bug
>          Components: SQL
>    Affects Versions: 2.3.0
>         Environment: OS  :CentOS release 6.8 (Final)
> JAVA : build 1.8.0_101-b13
> SPARK : 2.3.0
>  
>            Reporter: Park Chan Min
>            Priority: Major
>
> In Spark 2.1.0, two sessions worked normally In 2.3.0, the first session metasore information
is used when the second session is run
> -------------------     Code    ----------------------
>  val spark_1 = SparkSession.builder()
>               .enableHiveSupport()
>               .config("hive.metastore.uris", "thrift://HOST_A:9083")
>               .getOrCreate()
> spark_1.sql("SELECT A_FIELD FROM TABLE_A").show()
> SparkSession.clearActiveSession()
> SparkSession.clearDefaultSession()
> val spark_2 = SparkSession.builder()
>               .enableHiveSupport()
>               .config("hive.metastore.uris", "thrift://HOST_B:9083")
>               .getOrCreate()
> spark_2.sql("SELECT B_FIELD FROM TABLE_B").show()
> ------------------- spark 2.1.0 result ------------------
> INFO metastore: Trying to connect to metastore with URI thrift://HOST_A:9083
> ......
> INFO DAGScheduler: Job 3 finished: show at SparkHDFSTest.scala:20, took 0.807905 s
> +-------+
> |A_FIELD|
> +-------+
> |     A   |
> +-------+
> ......
> INFO metastore: Trying to connect to metastore with URI thrift://HOST_B:9083
> INFO DAGScheduler: Job 3 finished: show at SparkHDFSTest.scala:20, took 0.807905 s
> +-------+
> |B_FIELD|
> +-------+
> |   B   |
> +-------+
> ......
> ------------------- spark 2.3.0 result ------------------
> INFO metastore: Trying to connect to metastore with URI thrift://HOST_A:9083
> ......
> INFO DAGScheduler: Job 3 finished: show at SparkHDFSTest.scala:20, took 0.807905 s
> +-------+
> |A_FIELD|
> +-------+
> |   A   |
> +-------+
> ......
> INFO metastore: Trying to connect to metastore with URI thrift://HOST_A:9083
> INFO DAGScheduler: Job 3 finished: show at SparkHDFSTest.scala:20, took 0.807905 s
> ......
> Exception in thread "main" org.apache.spark.sql.AnalysisException: Table or view not
found: `default`.`TABLE_B`; line 1 pos 19;
> ......



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org


Mime
View raw message