From reviews-return-613359-archive-asf-public=cust-asf.ponee.io@spark.apache.org  Wed Feb  7 05:39:04 2018
Return-Path: <reviews-return-613359-archive-asf-public=cust-asf.ponee.io@spark.apache.org>
X-Original-To: archive-asf-public@eu.ponee.io
Delivered-To: archive-asf-public@eu.ponee.io
Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183])
	by mx-eu-01.ponee.io (Postfix) with ESMTP id 4B46E180657
	for <archive-asf-public@eu.ponee.io>; Wed,  7 Feb 2018 05:39:04 +0100 (CET)
Received: by cust-asf.ponee.io (Postfix)
	id 3AB5F160C46; Wed,  7 Feb 2018 04:39:04 +0000 (UTC)
Delivered-To: archive-asf-public@cust-asf.ponee.io
Received: from mail.apache.org (hermes.apache.org [140.211.11.3])
	by cust-asf.ponee.io (Postfix) with SMTP id 82091160C45
	for <archive-asf-public@cust-asf.ponee.io>; Wed,  7 Feb 2018 05:39:03 +0100 (CET)
Received: (qmail 1477 invoked by uid 500); 7 Feb 2018 04:39:02 -0000
Mailing-List: contact reviews-help@spark.apache.org; run by ezmlm
Precedence: bulk
List-Help: <mailto:reviews-help@spark.apache.org>
List-Unsubscribe: <mailto:reviews-unsubscribe@spark.apache.org>
List-Post: <mailto:reviews@spark.apache.org>
List-Id: <reviews.spark.apache.org>
Delivered-To: mailing list reviews@spark.apache.org
Received: (qmail 1462 invoked by uid 99); 7 Feb 2018 04:39:01 -0000
Received: from git1-us-west.apache.org (HELO git1-us-west.apache.org) (140.211.11.23)
    by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 07 Feb 2018 04:39:01 +0000
Received: by git1-us-west.apache.org (ASF Mail Server at git1-us-west.apache.org, from userid 33)
	id 4E7B9DFBBB; Wed,  7 Feb 2018 04:38:59 +0000 (UTC)
From: cloud-fan <git@git.apache.org>
To: reviews@spark.apache.org
Reply-To: reviews@spark.apache.org
References: <git-pr-20521-spark@git.apache.org>
In-Reply-To: <git-pr-20521-spark@git.apache.org>
Subject: [GitHub] spark pull request #20521: [SPARK-22977][SQL] fix web UI SQL tab for CTAS
Content-Type: text/plain
Message-Id: <20180207043900.4E7B9DFBBB@git1-us-west.apache.org>
Date: Wed,  7 Feb 2018 04:38:59 +0000 (UTC)

Github user cloud-fan commented on a diff in the pull request:

    https://github.com/apache/spark/pull/20521#discussion_r166517178
  
    --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/DataSource.scala ---
    @@ -493,9 +510,23 @@ case class DataSource(
             dataSource.createRelation(
               sparkSession.sqlContext, mode, caseInsensitiveOptions, Dataset.ofRows(sparkSession, data))
           case format: FileFormat =>
    -        sparkSession.sessionState.executePlan(planForWritingFileFormat(format, mode, data)).toRdd
    +        val cmd = planForWritingFileFormat(format, mode, data)
    +        val resolvedPartCols = cmd.partitionColumns.map { col =>
    +          // The partition columns created in `planForWritingFileFormat` should always be
    +          // `UnresolvedAttribute` with a single name part.
    +          assert(col.isInstanceOf[UnresolvedAttribute])
    +          val unresolved = col.asInstanceOf[UnresolvedAttribute]
    +          assert(unresolved.nameParts.length == 1)
    +          val name = unresolved.nameParts.head
    +          outputColumns.find(a => equality(a.name, name)).getOrElse {
    +            throw new AnalysisException(
    +              s"Unable to resolve $name given [${data.output.map(_.name).mkString(", ")}]")
    +          }
    +        }
    +        val resolved = cmd.copy(partitionColumns = resolvedPartCols, outputColumns = outputColumns)
    --- End diff --
    
    Because the CTAS node itself doesn't have a `QueryExecution`. It only has the physical plan that was produced by the planner. `QueryExecution` only exists in the places that drive the  analze/.../exeucte, e.g. `Dataset`.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org