spark-reviews mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From gatorsmile <...@git.apache.org>
Subject [GitHub] spark pull request #20484: [SPARK-23313][DOC] Add a migration guide for ORC
Date Fri, 02 Feb 2018 06:34:37 GMT
Github user gatorsmile commented on a diff in the pull request:

    https://github.com/apache/spark/pull/20484#discussion_r165567650
  
    --- Diff: docs/sql-programming-guide.md ---
    @@ -1776,6 +1776,77 @@ working with timestamps in `pandas_udf`s to get the best performance,
see
     
     ## Upgrading From Spark SQL 2.2 to 2.3
     
    +  - Since Spark 2.3, Spark supports a vectorized ORC reader with a new ORC file format
for ORC files and Hive ORC tables. To do that, the following configurations are newly added
or change their default values.
    +
    +    <table class="table">
    +      <tr>
    +        <th>
    +          <b>Property Name</b>
    +        </th>
    +        <th>
    +          <b>Default</b>
    +        </th>
    +        <th>
    +          <b>Meaning</b>
    +        </th>
    +      </tr>
    +      <tr>
    +        <td>
    +          spark.sql.orc.impl
    +        </td>
    +        <td>
    +          native
    +        </td>
    +        <td>
    +          The name of ORC implementation: 'native' means the native version of ORC support
instead of the ORC library in Hive 1.2.1. It is 'hive' by default prior to Spark 2.3.
    --- End diff --
    
    ` the native version of ORC support` -> `the native ORC support that is built on Apache
ORC 1.4.1`


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


Mime
View raw message