spark-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
Subject spark git commit: [SPARK-18941][SQL][DOC] Add a new behavior document on `CREATE/DROP TABLE` with `LOCATION`
Date Sun, 08 Jan 2017 02:55:33 GMT
Repository: spark
Updated Branches:
  refs/heads/branch-2.1 c95b58557 -> ecc16220d

[SPARK-18941][SQL][DOC] Add a new behavior document on `CREATE/DROP TABLE` with `LOCATION`

## What changes were proposed in this pull request?

This PR adds a new behavior change description on `CREATE TABLE ... LOCATION` at ``
clearly under `Upgrading From Spark SQL 1.6 to 2.0`. This change is introduced at Apache Spark
2.0.0 as [SPARK-15276](

## How was this patch tested?

SKIP_API=1 jekyll build

**Newly Added Description**
<img width="913" alt="new" src="">

Author: Dongjoon Hyun <>

Closes #16400 from dongjoon-hyun/SPARK-18941.

(cherry picked from commit 923e594844a7ad406195b91877f0fb374d5a454b)
Signed-off-by: gatorsmile <>


Branch: refs/heads/branch-2.1
Commit: ecc16220d2d9eace81de44c4b0aff1c364a35e3f
Parents: c95b585
Author: Dongjoon Hyun <>
Authored: Sat Jan 7 18:55:01 2017 -0800
Committer: gatorsmile <>
Committed: Sat Jan 7 18:55:26 2017 -0800

 docs/ | 8 ++++++++
 1 file changed, 8 insertions(+)
diff --git a/docs/ b/docs/
index 58de0e1..fb3c6a7 100644
--- a/docs/
+++ b/docs/
@@ -1358,6 +1358,14 @@ options.
  - Dataset and DataFrame API `explode` has been deprecated, alternatively, use `functions.explode()`
with `select` or `flatMap`
  - Dataset and DataFrame API `registerTempTable` has been deprecated and replaced by `createOrReplaceTempView`
+ - Changes to `CREATE TABLE ... LOCATION` behavior for Hive tables.
+    - From Spark 2.0, `CREATE TABLE ... LOCATION` is equivalent to `CREATE EXTERNAL TABLE
+      in order to prevent accidental dropping the existing data in the user-provided locations.
+      That means, a Hive table created in Spark SQL with the user-specified location is always
a Hive external table.
+      Dropping external tables will not remove the data. Users are not allowed to specify
the location for Hive managed tables.
+      Note that this is different from the Hive behavior.
+    - As a result, `DROP TABLE` statements on those tables will not remove the data.
 ## Upgrading From Spark SQL 1.5 to 1.6
  - From Spark 1.6, by default the Thrift server runs in multi-session mode. Which means each

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message