spark-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Zhenhua Wang (JIRA)" <j...@apache.org>
Subject [jira] [Created] (SPARK-18911) Decouple Statistics and CatalogTable
Date Sat, 17 Dec 2016 05:26:58 GMT
Zhenhua Wang created SPARK-18911:
------------------------------------

             Summary: Decouple Statistics and CatalogTable
                 Key: SPARK-18911
                 URL: https://issues.apache.org/jira/browse/SPARK-18911
             Project: Spark
          Issue Type: Sub-task
          Components: SQL
            Reporter: Zhenhua Wang


Statistics in LogicalPlan should use attributes to refer to columns rather than column names,
because two columns from two relations can have the same column name. But CatalogTable doesn't
have the concepts of attribute or broadcast hint in Statistics. Therefore, putting Statistics
in CatalogTable is confusing. We need to define a different statistic structure in CatalogTable,
which is only responsible for interacting with metastore, and is converted to statistics in
LogicalPlan when it is used.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org


Mime
View raw message