spark-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From wenc...@apache.org
Subject [spark] branch branch-3.0 updated: [SPARK-32095][SQL] Update documentation to reflect usage of updated statistics
Date Wed, 01 Jul 2020 07:16:18 GMT
This is an automated email from the ASF dual-hosted git repository.

wenchen pushed a commit to branch branch-3.0
in repository https://gitbox.apache.org/repos/asf/spark.git


The following commit(s) were added to refs/heads/branch-3.0 by this push:
     new 49855e7  [SPARK-32095][SQL] Update documentation to reflect usage of updated statistics
49855e7 is described below

commit 49855e78a3ae430cd67c50d6dd81f6688fb3de15
Author: Micah Kornfield <micahk@google.com>
AuthorDate: Wed Jul 1 07:13:10 2020 +0000

    [SPARK-32095][SQL] Update documentation to reflect usage of updated statistics
    
    ### What changes were proposed in this pull request?
    Update documentation to reflect changes in https://github.com/apache/spark/commit/faf220aad9051c224a630e678c54098861f6b482
    
    I've changed the documentation to reflect updated statistics may be used to improve query
plan.
    
    ### Why are the changes needed?
    I believe the documentation is stale and misleading.
    
    ### Does this PR introduce _any_ user-facing change?
    Yes, this is a javadoc documentation fix.
    
    ### How was this patch tested?
    Doc fix.
    
    Closes #28925 from emkornfield/spark-32095.
    
    Authored-by: Micah Kornfield <micahk@google.com>
    Signed-off-by: Wenchen Fan <wenchen@databricks.com>
    (cherry picked from commit a4ba34434c55c7f0eafa32d8c7ea6e052813f293)
    Signed-off-by: Wenchen Fan <wenchen@databricks.com>
---
 .../apache/spark/sql/connector/read/SupportsReportStatistics.java   | 6 +++---
 1 file changed, 3 insertions(+), 3 deletions(-)

diff --git a/sql/catalyst/src/main/java/org/apache/spark/sql/connector/read/SupportsReportStatistics.java
b/sql/catalyst/src/main/java/org/apache/spark/sql/connector/read/SupportsReportStatistics.java
index b839fd5..1e0c9ca 100644
--- a/sql/catalyst/src/main/java/org/apache/spark/sql/connector/read/SupportsReportStatistics.java
+++ b/sql/catalyst/src/main/java/org/apache/spark/sql/connector/read/SupportsReportStatistics.java
@@ -23,9 +23,9 @@ import org.apache.spark.annotation.Evolving;
  * A mix in interface for {@link Scan}. Data sources can implement this interface to
  * report statistics to Spark.
  *
- * As of Spark 2.4, statistics are reported to the optimizer before any operator is pushed
to the
- * data source. Implementations that return more accurate statistics based on pushed operators
will
- * not improve query performance until the planner can push operators before getting stats.
+ * As of Spark 3.0, statistics are reported to the optimizer after operators are pushed to
the
+ * data source. Implementations may return more accurate statistics based on pushed operators
+ * which may improve query performance by providing better information to the optimizer.
  *
  * @since 3.0.0
  */


---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscribe@spark.apache.org
For additional commands, e-mail: commits-help@spark.apache.org


Mime
View raw message