spark-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Xin Wu (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (SPARK-18727) Support schema evolution as new files are inserted into table
Date Wed, 07 Dec 2016 19:36:58 GMT

    [ https://issues.apache.org/jira/browse/SPARK-18727?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15729696#comment-15729696
] 

Xin Wu commented on SPARK-18727:
--------------------------------

I am currently working on ALTER TABLE ADD COLUMNS, to tables with provider = hive and will
submit a PR soon. Just wondering whether it will solve part of this JIRA. Please advise! Thanks!

> Support schema evolution as new files are inserted into table
> -------------------------------------------------------------
>
>                 Key: SPARK-18727
>                 URL: https://issues.apache.org/jira/browse/SPARK-18727
>             Project: Spark
>          Issue Type: Bug
>          Components: SQL
>    Affects Versions: 2.1.0
>            Reporter: Eric Liang
>            Priority: Critical
>
> Now that we have pushed partition management of all tables to the catalog, one issue
for scalable partition handling remains: handling schema updates.
> Currently, a schema update requires dropping and recreating the entire table, which does
not scale well with the size of the table.
> We should support updating the schema of the table, either via ALTER TABLE, or automatically
as new files with compatible schemas are appended into the table.
> cc [~rxin]



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org


Mime
View raw message