spark-reviews mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From kiszk <...@git.apache.org>
Subject [GitHub] spark pull request #21618: [SPARK-20408][SQL] Get the glob path in parallel ...
Date Thu, 05 Jul 2018 19:21:08 GMT
Github user kiszk commented on a diff in the pull request:

    https://github.com/apache/spark/pull/21618#discussion_r200462611
  
    --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/internal/SQLConf.scala ---
    @@ -656,6 +656,25 @@ object SQLConf {
           .intConf
           .createWithDefault(10000)
     
    +  val PARALLEL_GET_GLOBBED_PATH_THRESHOLD =
    +    buildConf("spark.sql.sources.parallelGetGlobbedPath.threshold")
    +      .doc("The maximum number of subfiles or directories allowed after a globbed path
" +
    +        "expansion. If the number of paths exceeds this value during expansion, it tries
to " +
    +        "expand the globbed in parallel with multi-thread.")
    +      .intConf
    +      .checkValue(threshlod => threshlod >= 0, "The maximum number of subfiles
or directories " +
    --- End diff --
    
    nit: threshlod  -> threshold 


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


Mime
View raw message