Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id EF964200BC8 for ; Wed, 9 Nov 2016 01:14:46 +0100 (CET) Received: by cust-asf.ponee.io (Postfix) id EE09C160B0A; Wed, 9 Nov 2016 00:14:46 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id E0EBE160B13 for ; Wed, 9 Nov 2016 01:14:44 +0100 (CET) Received: (qmail 59255 invoked by uid 500); 9 Nov 2016 00:14:44 -0000 Mailing-List: contact commits-help@impala.incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@impala.incubator.apache.org Delivered-To: mailing list commits@impala.incubator.apache.org Received: (qmail 59242 invoked by uid 99); 9 Nov 2016 00:14:44 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd4-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 09 Nov 2016 00:14:44 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd4-us-west.apache.org (ASF Mail Server at spamd4-us-west.apache.org) with ESMTP id 83CB9C0D53 for ; Wed, 9 Nov 2016 00:14:43 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd4-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -6.218 X-Spam-Level: X-Spam-Status: No, score=-6.218 tagged_above=-999 required=6.31 tests=[KAM_ASCII_DIVIDERS=0.8, KAM_LAZY_DOMAIN_SECURITY=1, RCVD_IN_DNSWL_HI=-5, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, RP_MATCHES_RCVD=-2.999, URIBL_BLOCKED=0.001] autolearn=disabled Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd4-us-west.apache.org [10.40.0.11]) (amavisd-new, port 10024) with ESMTP id VnoHTwVvd7El for ; Wed, 9 Nov 2016 00:14:31 +0000 (UTC) Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with SMTP id EA79F5FD17 for ; Wed, 9 Nov 2016 00:14:29 +0000 (UTC) Received: (qmail 58928 invoked by uid 99); 9 Nov 2016 00:14:29 -0000 Received: from git1-us-west.apache.org (HELO git1-us-west.apache.org) (140.211.11.23) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 09 Nov 2016 00:14:29 +0000 Received: by git1-us-west.apache.org (ASF Mail Server at git1-us-west.apache.org, from userid 33) id 5CF52E0AF6; Wed, 9 Nov 2016 00:14:29 +0000 (UTC) Content-Type: text/plain; charset="us-ascii" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit From: jrussell@apache.org To: commits@impala.incubator.apache.org Date: Wed, 09 Nov 2016 00:14:31 -0000 Message-Id: <59c43d690c644b078677208dc9e9d4c1@git.apache.org> In-Reply-To: References: X-Mailer: ASF-Git Admin Mailer Subject: [3/5] incubator-impala git commit: Bring in the latest updates to downstream docs. archived-at: Wed, 09 Nov 2016 00:14:47 -0000 http://git-wip-us.apache.org/repos/asf/incubator-impala/blob/04cd1c16/docs/shared/impala_common.xml ---------------------------------------------------------------------- diff --git a/docs/shared/impala_common.xml b/docs/shared/impala_common.xml index e8b367f..b0c9ee5 100644 --- a/docs/shared/impala_common.xml +++ b/docs/shared/impala_common.xml @@ -33,12 +33,12 @@
  • - Impala integrates with the existing CDH ecosystem, meaning data can be stored, shared, and accessed using - the various solutions included with CDH. This also avoids data silos and minimizes expensive data movement. + Impala integrates with the existing ecosystem, meaning data can be stored, shared, and accessed using + the various solutions included with . This also avoids data silos and minimizes expensive data movement.
  • - Impala provides access to data stored in CDH without requiring the Java skills required for MapReduce jobs. + Impala provides access to data stored in without requiring the Java skills required for MapReduce jobs. Impala can access data directly from the HDFS file system. Impala also provides a SQL front-end to access data in the HBase database system, or in the Amazon Simple Storage System (S3).
  • @@ -544,7 +544,7 @@

    - In CDH 5.8 / Impala 2.6 and higher, Impala recognizes the auth_to_local setting, + In and higher, Impala recognizes the auth_to_local setting, specified through the HDFS configuration setting hadoop.security.auth_to_local or the Cloudera Manager setting @@ -780,17 +780,6 @@ select concat('abc','mno','xyz'); -

    - - Kudu Snippets - -

    - If any advice, background info, or warnings are needed in multiple - places for interaction of Impala with Kudu, put them under here. -

    - -
    -
    SQL Language Reference Snippets @@ -873,7 +862,7 @@ select * from t2;

    The Avro specification allows string values up to 2**64 bytes in length. Impala queries for Avro tables use 32-bit integers to hold string lengths. - In CDH 5.7 / Impala 2.5 and higher, Impala truncates CHAR + In and higher, Impala truncates CHAR and VARCHAR values in Avro tables to (2**31)-1 bytes. If a query encounters a STRING value longer than (2**31)-1 bytes in an Avro table, the query fails. In earlier releases, @@ -932,7 +921,7 @@ alter table partitioned_data set tblproperties ('numRows'='1030000', 'STATS_GENE

    If you frequently run aggregate functions such as MIN(), MAX(), and COUNT(DISTINCT) on partition key columns, consider enabling the OPTIMIZE_PARTITION_KEY_SCANS - query option, which optimizes such queries. This feature is available in CDH 5.7 / Impala 2.5 and higher. + query option, which optimizes such queries. This feature is available in and higher. See for the kinds of queries that this option applies to, and slight differences in how partitions are evaluated when this query option is enabled. @@ -996,7 +985,7 @@ alter table partitioned_data set tblproperties ('numRows'='1030000', 'STATS_GENE

  • Likewise, the impala-shell command relies on - some information only available in Impala 2.3 / CDH 5.5 and higher + some information only available in and higher to prepare live progress reports and query summaries. The LIVE_PROGRESS and LIVE_SUMMARY query options have no effect when impala-shell connects @@ -1036,7 +1025,7 @@ drop database temp; use default; -- Before dropping a database, first drop all the tables inside it, --- or in CDH 5.5 and higher use the CASCADE clause. +-- or in and higher use the CASCADE clause. drop database temp; ERROR: ImpalaRuntimeException: Error making 'dropDatabase' RPC to Hive Metastore: CAUSED BY: InvalidOperationException: Database temp is not empty @@ -1047,7 +1036,7 @@ show tables in temp; | t3 | +------+ --- CDH 5.5 and higher: +-- and higher: drop database temp cascade; -- CDH 5.4 and lower: @@ -1115,7 +1104,7 @@ drop database temp;

    - In CDH 5.8 / Impala 2.6 and higher, Impala queries are optimized for files stored in Amazon S3. + In and higher, Impala queries are optimized for files stored in Amazon S3. For Impala tables that use the file formats Parquet, RCFile, SequenceFile, Avro, and uncompressed text, the setting fs.s3a.block.size in the core-site.xml configuration file determines @@ -1131,7 +1120,7 @@ drop database temp;

    - In CDH 5.8 / Impala 2.6 and higher, Impala supports both queries (SELECT) + In and higher, Impala supports both queries (SELECT) and DML (INSERT, LOAD DATA, CREATE TABLE AS SELECT) for data residing on Amazon S3. With the inclusion of write support, @@ -1148,7 +1137,7 @@ drop database temp;

    - In CDH 5.8 / Impala 2.6 and higher, Impala DDL statements such as + In and higher, Impala DDL statements such as CREATE DATABASE, CREATE TABLE, DROP DATABASE CASCADE, DROP TABLE, and ALTER TABLE [ADD|DROP] PARTITION can create or remove folders as needed in the Amazon S3 system. Prior to CDH 5.8 / Impala 2.6, you had to create folders yourself and point @@ -1157,7 +1146,7 @@ drop database temp;

    - In CDH 5.8 / Impala 2.6 and higher, the Impala DML statements (INSERT, LOAD DATA, + In and higher, the Impala DML statements (INSERT, LOAD DATA, and CREATE TABLE AS SELECT) can write data into a table or partition that resides in the Amazon Simple Storage Service (S3). The syntax of the DML statements is the same as for any other tables, because the S3 location for tables and @@ -1227,7 +1216,7 @@ drop database temp;

    - In CDH 5.7 / Impala 2.5 and higher, Impala UDFs and UDAs written in C++ are persisted in the metastore database. + In and higher, Impala UDFs and UDAs written in C++ are persisted in the metastore database. Java UDFs are also persisted, if they were created with the new CREATE FUNCTION syntax for Java UDFs, where the Java function argument and return types are omitted. Java-based UDFs created with the old CREATE FUNCTION syntax do not persist across restarts @@ -1235,7 +1224,7 @@ drop database temp; Until you re-create such Java UDFs using the new CREATE FUNCTION syntax, you must reload those Java-based UDFs by running the original CREATE FUNCTION statements again each time you restart the catalogd daemon. - Prior to CDH 5.7 / Impala 2.5, the requirement to reload functions after a restart applied to both C++ and Java functions. + Prior to the requirement to reload functions after a restart applied to both C++ and Java functions.

    @@ -1317,7 +1306,7 @@ select c_first_name, c_last_name from customer where lower(trim(c_last_name)) rl

    - In CDH 5.7 / Impala 2.5 and higher, you can simplify queries that + In and higher, you can simplify queries that use many UPPER() and LOWER() calls to do case-insensitive comparisons, by using the ILIKE or IREGEXP operators instead. See @@ -1857,11 +1846,11 @@ show functions in _impala_builtins like '*substring*'; Complex type considerations: Although you can create tables in this file format using the complex types (ARRAY, STRUCT, - and MAP) available in CDH 5.5 / Impala 2.3 and higher, + and MAP) available in and higher, currently, Impala can query these types only in Parquet tables. The one exception to the preceding rule is COUNT(*) queries on RCFile tables that include complex types. - Such queries are allowed in CDH 5.8 / Impala 2.6 and higher. + Such queries are allowed in and higher.

    @@ -1906,7 +1895,7 @@ show functions in _impala_builtins like '*substring*';

    The Impala complex types (STRUCT, ARRAY, or MAP) - are available in CDH 5.5 / Impala 2.3 and higher. + are available in and higher. To use these types with JDBC requires version 2.5.28 or higher of the Cloudera JDBC Connector for Impala. To use these types with ODBC requires version 2.5.30 or higher of the Cloudera ODBC Connector for Impala. Consider upgrading all JDBC and ODBC drivers at the same time you upgrade from CDH 5.5 or higher. @@ -2117,7 +2106,7 @@ order by r_name; The arguments to this command let you perform operations such as:

    • - cat: Print a file's contents to standard out. In CDH 5.5 and higher, you can use + cat: Print a file's contents to standard out. In CDH 5.5 and higher, you can use the -j option to output JSON.
    • @@ -2430,6 +2419,10 @@ flight_num: INT32 SNAPPY DO:83456393 FPO:83488603 SZ:10216514/11474301 HBase considerations:

      +

      + The LOAD DATA statement cannot be used with HBase tables. +

      +

      HBase considerations: This data type is fully compatible with HBase tables.

      @@ -2782,7 +2775,7 @@ select max(height), avg(height) from census_data where age > 20;

      - In Impala 2.2 / CDH 5.4 and higher, the optional WITH REPLICATION clause + In and higher, the optional WITH REPLICATION clause for CREATE TABLE and ALTER TABLE lets you specify a replication factor, the number of hosts on which to cache the same data blocks. When Impala processes a cached data block, where the cache replication factor is greater than 1, Impala randomly @@ -2961,7 +2954,7 @@ Query finished, fetching results ...

      - In CDH 5.8 / Impala 2.6 and higher, Impala can optionally + In and higher, Impala can optionally skip an arbitrary number of header lines from text input files on HDFS based on the skip.header.line.count value in the TBLPROPERTIES field of the table metadata. For example: @@ -3198,7 +3191,7 @@ sudo pip-python install ssl Prior to CDH 5.5 / Impala 2.3, the impala user was required to be a member of the hdfs group for the resource management feature to work (in combination with CDH 5 and the YARN and Llama components). - This requirement has been lifted in CDH 5.5 / Impala 2.3 and higher. The impala + This requirement has been lifted in and higher. The impala user remains in the hdfs group on upgraded systems if it was already there, but is no longer put into that group during new installs.

      @@ -3673,6 +3666,25 @@ sudo pip-python install ssl

  • +
    + + Kudu + +

    + Kudu-related content. This category gets its own special area because there could be considerations + around sharing content between the Impala documentation and the Kudu documentation. +

    + +

    + Kudu considerations: +

    + +

    + The LOAD DATA statement cannot be used with Kudu tables. +

    + +
    + http://git-wip-us.apache.org/repos/asf/incubator-impala/blob/04cd1c16/docs/topics/impala_admission.xml ---------------------------------------------------------------------- diff --git a/docs/topics/impala_admission.xml b/docs/topics/impala_admission.xml index 41f5852..ad53823 100644 --- a/docs/topics/impala_admission.xml +++ b/docs/topics/impala_admission.xml @@ -82,7 +82,7 @@

    - In CDH 5.7 / Impala 2.5 and higher, you can specify these limits and thresholds for each + In and higher, you can specify these limits and thresholds for each pool rather than globally. That way, you can balance the resource usage and throughput between steady well-defined workloads, rare resource-intensive queries, and ad hoc exploratory queries. @@ -388,9 +388,9 @@ Although the following options are still present in the Cloudera Manager interface under the Admission Control configuration settings dialog, - Cloudera recommends you not use them in CDH 5.7 / Impala 2.5 and higher. + Cloudera recommends you not use them in and higher. These settings only apply if you enable admission control but leave dynamic resource pools disabled. - In CDH 5.7 / Impala 2.5 and higher, prefer to set up dynamic resource pools and + In and higher, prefer to set up dynamic resource pools and customize the settings for each pool, as described in and . @@ -441,7 +441,7 @@

    Default: - -1, meaning unlimited (prior to CDH 5.7 / Impala 2.5, the default was 200) + -1, meaning unlimited (prior to the default was 200)

    @@ -673,7 +673,7 @@ impala.admission-control.pool-queue-timeout-ms.queue_name The impala.admission-control.* configuration settings are available in - CDH 5.7 / Impala 2.5 and higher. + and higher.

    @@ -684,7 +684,7 @@ impala.admission-control.pool-queue-timeout-ms.queue_name - + Examples of Admission Control Configurations http://git-wip-us.apache.org/repos/asf/incubator-impala/blob/04cd1c16/docs/topics/impala_alter_table.xml ---------------------------------------------------------------------- diff --git a/docs/topics/impala_alter_table.xml b/docs/topics/impala_alter_table.xml index 5e6e7a0..61199cf 100644 --- a/docs/topics/impala_alter_table.xml +++ b/docs/topics/impala_alter_table.xml @@ -76,7 +76,7 @@ statsKey ::= numDVs | numNulls | avgSize | maxSize

    - In CDH 5.5 / Impala 2.3 and higher, the ALTER TABLE statement can + In and higher, the ALTER TABLE statement can change the metadata for tables containing complex types (ARRAY, STRUCT, and MAP). For example, you can use an ADD COLUMNS, DROP COLUMN, or CHANGE @@ -110,7 +110,7 @@ statsKey ::= numDVs | numNulls | avgSize | maxSize

    You can specify an s3a:// prefix on the LOCATION attribute of a table or partition - to make Impala query data from the Amazon S3 filesystem. In CDH 5.8 / Impala 2.6 and higher, Impala automatically + to make Impala query data from the Amazon S3 filesystem. In and higher, Impala automatically handles creating or removing the associated folders when you issue ALTER TABLE statements with the ADD PARTITION or DROP PARTITION clauses.

    @@ -209,7 +209,7 @@ alter table p1 partition (month=1, day=1) set location '/usr/external_data/new_y

    - In CDH 5.5 / Impala 2.3 and higher, the RECOVER PARTITIONS clause scans + In and higher, the RECOVER PARTITIONS clause scans a partitioned table to detect if any new partition directories were added outside of Impala, such as by Hive ALTER TABLE statements or by hdfs dfs or hadoop fs commands. The RECOVER PARTITIONS clause @@ -378,7 +378,7 @@ yes,no

    - In CDH 5.8 / Impala 2.6 and higher, you can use the SET COLUMN STATS clause + In and higher, you can use the SET COLUMN STATS clause to set a specific stats value for a particular column.

    @@ -566,7 +566,7 @@ optional int32 x [i:1 d:1 r:0]

    - In CDH 5.8 / Impala 2.6 and higher, if an Avro table is created without column definitions in the + In and higher, if an Avro table is created without column definitions in the CREATE TABLE statement, and columns are later added through ALTER TABLE, the resulting table is now queryable. Missing values from the newly added @@ -705,7 +705,7 @@ show partitions partition_t;

    The optional PURGE keyword, available in - CDH 5.5 / Impala 2.3 and higher, is used with the DROP + and higher, is used with the DROP PARTITION clause to remove associated HDFS data files immediately rather than going through the HDFS trashcan mechanism. Use this keyword when dropping a partition if it is crucial to remove the data http://git-wip-us.apache.org/repos/asf/incubator-impala/blob/04cd1c16/docs/topics/impala_analytic_functions.xml ---------------------------------------------------------------------- diff --git a/docs/topics/impala_analytic_functions.xml b/docs/topics/impala_analytic_functions.xml index 7fc9b3c..0a31363 100644 --- a/docs/topics/impala_analytic_functions.xml +++ b/docs/topics/impala_analytic_functions.xml @@ -474,7 +474,7 @@ output as above. Because RANGE currently does not support numer - CUME_DIST Function (CDH 5.5 or higher only) + CUME_DIST Function (<keyword keyref="impala23"/> or higher only) @@ -1210,7 +1210,7 @@ from stock_ticker - NTILE Function (CDH 5.5 or higher only) + NTILE Function (<keyword keyref="impala23"/> or higher only) @@ -1345,7 +1345,7 @@ order by kilos desc; - PERCENT_RANK Function (CDH 5.5 or higher only) + PERCENT_RANK Function (<keyword keyref="impala23"/> or higher only) http://git-wip-us.apache.org/repos/asf/incubator-impala/blob/04cd1c16/docs/topics/impala_appx_count_distinct.xml ---------------------------------------------------------------------- diff --git a/docs/topics/impala_appx_count_distinct.xml b/docs/topics/impala_appx_count_distinct.xml index 854bbd0..04078db 100644 --- a/docs/topics/impala_appx_count_distinct.xml +++ b/docs/topics/impala_appx_count_distinct.xml @@ -2,7 +2,7 @@ - APPX_COUNT_DISTINCT Query Option (CDH 5.2 or higher only) + APPX_COUNT_DISTINCT Query Option (<keyword keyref="impala20"/> or higher only) APPX_COUNT_DISTINCT http://git-wip-us.apache.org/repos/asf/incubator-impala/blob/04cd1c16/docs/topics/impala_array.xml ---------------------------------------------------------------------- diff --git a/docs/topics/impala_array.xml b/docs/topics/impala_array.xml index 4cab7f6..f9519c1 100644 --- a/docs/topics/impala_array.xml +++ b/docs/topics/impala_array.xml @@ -2,7 +2,7 @@ - ARRAY Complex Type (CDH 5.5 or higher only) + ARRAY Complex Type (<keyword keyref="impala23"/> or higher only) http://git-wip-us.apache.org/repos/asf/incubator-impala/blob/04cd1c16/docs/topics/impala_authorization.xml ---------------------------------------------------------------------- diff --git a/docs/topics/impala_authorization.xml b/docs/topics/impala_authorization.xml index f12b6d1..5163b2f 100644 --- a/docs/topics/impala_authorization.xml +++ b/docs/topics/impala_authorization.xml @@ -56,7 +56,7 @@

    The object hierarchy for Impala covers Server, URI, Database, Table, and Column. (The Table privileges apply to views as well; anywhere you specify a table name, you can specify a view name instead.) - Column-level authorization is available in CDH 5.5 / Impala 2.3 and higher, as described in + Column-level authorization is available in and higher, as described in . Previously, you constructed views to query specific columns and assigned privileges based on the views rather than the base tables. @@ -253,7 +253,7 @@ report_generator = server=server1->db=reporting_db->table=*->action=SEL - Using Impala with the Sentry Service (CDH 5.1 or higher only) + Using Impala with the Sentry Service (<keyword keyref="impala14"/> or higher only) @@ -261,7 +261,7 @@ report_generator = server=server1->db=reporting_db->table=*->action=SEL When you use the Sentry service rather than the policy file, you set up privileges through GRANT and REVOKE statement in either Impala or Hive, then both components use those same privileges automatically. (Impala added the GRANT and - REVOKE statements in Impala 2.0.0 / CDH 5.2.0.) + REVOKE statements in .)

    @@ -298,7 +298,7 @@ report_generator = server=server1->db=reporting_db->table=*->action=SEL

    - In CDH 5 and higher, Cloudera recommends + In CDH 5 and higher, Cloudera recommends managing privileges through SQL statements, as described in . If you are still using policy files, plan to migrate to the new approach some time in the future. @@ -913,7 +913,7 @@ Database

    - In CDH 5.5 / Impala 2.3 and higher, you can specify privileges for individual columns, + In and higher, you can specify privileges for individual columns, as described in . Formerly, to specify @@ -1311,7 +1311,7 @@ it can be queried by a lower-privileged user who does not have full query privil for the underlying tables. See for details about column-level authorization. -Before column-level authorization was added in CDH 5.5 / Impala 2.3, the way to implement +Before column-level authorization was added in the way to implement column-level security was by creating views that referenced particular sets of columns. http://git-wip-us.apache.org/repos/asf/incubator-impala/blob/04cd1c16/docs/topics/impala_avro.xml ---------------------------------------------------------------------- diff --git a/docs/topics/impala_avro.xml b/docs/topics/impala_avro.xml index d89d782..04e84aa 100644 --- a/docs/topics/impala_avro.xml +++ b/docs/topics/impala_avro.xml @@ -75,7 +75,7 @@

    - In CDH 5.5 / Impala 2.3 and higher, the CREATE TABLE for Avro tables can include + In and higher, the CREATE TABLE for Avro tables can include SQL-style column definitions rather than specifying Avro notation through the TBLPROPERTIES clause. Impala issues warning messages if there are any mismatches between the types specified in the SQL column definitions and the underlying types; for example, any TINYINT or @@ -215,7 +215,7 @@ for this information.

    In rare instances, a mismatch could occur between the Avro schema and the column definitions in the - metastore database. In CDH 5.5 / Impala 2.3 and higher, Impala checks for such inconsistencies during + metastore database. In and higher, Impala checks for such inconsistencies during a CREATE TABLE statement and each time it loads the metadata for a table (for example, after INVALIDATE METADATA). Impala uses the following rules to determine how to treat mismatching columns, a process known as schema reconciliation: @@ -228,9 +228,9 @@ for this information. If there is a mismatch in column name or type, Impala uses the column definition from the Avro schema. Because a CHAR or VARCHAR column in Impala maps to an Avro STRING, this case is not considered a mismatch and the column is preserved as CHAR or VARCHAR - in the reconciled schema. Prior to CDH 5.9 / Impala 2.7, the column + in the reconciled schema. Prior to the column name and comment for such CHAR and VARCHAR columns was also taken from the SQL column definition. - In CDH 5.9 / Impala 2.7 and higher, the column name and comment from the Avro schema file take precedence for such columns, + In and higher, the column name and comment from the Avro schema file take precedence for such columns, and only the CHAR or VARCHAR type is preserved from the SQL column definition.

  • http://git-wip-us.apache.org/repos/asf/incubator-impala/blob/04cd1c16/docs/topics/impala_batch_size.xml ---------------------------------------------------------------------- diff --git a/docs/topics/impala_batch_size.xml b/docs/topics/impala_batch_size.xml index afd3c17..eb54b14 100644 --- a/docs/topics/impala_batch_size.xml +++ b/docs/topics/impala_batch_size.xml @@ -24,7 +24,7 @@

    - This option is primarily for Cloudera testing, or for use under the direction of Cloudera Support. + This option is primarily for testing during Impala development, or for use under the direction of .

    http://git-wip-us.apache.org/repos/asf/incubator-impala/blob/04cd1c16/docs/topics/impala_breakpad.xml ---------------------------------------------------------------------- diff --git a/docs/topics/impala_breakpad.xml b/docs/topics/impala_breakpad.xml index 026fa1d..f764c2f 100644 --- a/docs/topics/impala_breakpad.xml +++ b/docs/topics/impala_breakpad.xml @@ -2,7 +2,7 @@ - Breakpad Minidumps for Impala (CDH 5.8 or higher only) + Breakpad Minidumps for Impala (<keyword keyref="impala26"/> or higher only) Breakpad Minidumps @@ -18,7 +18,7 @@

    The breakpad project is an open-source framework for crash reporting. - In CDH 5.8 / Impala 2.6 and higher, Impala can use breakpad to record stack information and + In and higher, Impala can use breakpad to record stack information and register values when any of the Impala-related daemons crash due to an error such as SIGSEGV or unhandled exceptions. The dump files are much smaller than traditional core dump files. The dump mechanism itself uses very little @@ -50,7 +50,7 @@

    - In CDH 5.9 / Impala 2.7 and higher, + In and higher, you can send a SIGUSR1 signal to any Impala-related daemon to write a Breakpad minidump. For advanced troubleshooting, you can now produce a minidump without triggering a crash. @@ -134,7 +134,8 @@ Using the Minidump Files for Problem Resolution

    - Typically, you provide minidump files to Cloudera Support as part of problem resolution, in the same way that you might provide a core dump. The Send Diagnostic Data + Typically, you provide minidump files to as part of problem resolution, + in the same way that you might provide a core dump. The Send Diagnostic Data under the Support menu in Cloudera Manager guides you through the process of selecting a time period and volume of diagnostic data, then collects the data from all hosts and transmits the relevant information for you. @@ -144,10 +145,10 @@

    - Cloudera Support might provide additional instructions about collecting minidumps to better isolate a specific problem. + You might get additional instructions from about collecting minidumps to better isolate a specific problem. Because the information in the minidump files is limited to stack traces and register contents, the possibility of including sensitive information is much lower than with core dump files. - If any sensitive information is included in the minidump, Cloudera Support preserves the confidentiality of that information. + If any sensitive information is included in the minidump, preserves the confidentiality of that information.

    @@ -236,7 +237,7 @@ Wrote minidump to /var/log/impala-minidumps/impalad/0980da2d-a905-01e1-25ff883a-

    The resulting minidump file is much smaller than the corresponding core file, - making it much easier to supply diagnostic information to Cloudera Support. + making it much easier to supply diagnostic information to . The transmission process for the minidump files is automated through Cloudera Manager.

    - CHAR Data Type (CDH 5.2 or higher only) + CHAR Data Type (<keyword keyref="impala20"/> or higher only) CHAR http://git-wip-us.apache.org/repos/asf/incubator-impala/blob/04cd1c16/docs/topics/impala_complex_types.xml ---------------------------------------------------------------------- diff --git a/docs/topics/impala_complex_types.xml b/docs/topics/impala_complex_types.xml index 77e9707..105ecdd 100644 --- a/docs/topics/impala_complex_types.xml +++ b/docs/topics/impala_complex_types.xml @@ -2,7 +2,7 @@ - Complex Types (CDH 5.5 or higher only) + Complex Types (<keyword keyref="impala23"/> or higher only) @@ -351,7 +351,7 @@

    The one exception to the preceding rule is COUNT(*) queries on RCFile tables that include complex types. - Such queries are allowed in CDH 5.8 / Impala 2.6 and higher. + Such queries are allowed in and higher.

    @@ -2615,7 +2615,7 @@ STORED AS PARQUET;

    If you have a Hive-created Parquet table that includes ARRAY, STRUCT, or MAP - columns, Impala can query that same table in Impala 2.3 / CDH 5.5 and higher, subject to the usual restriction that all other + columns, Impala can query that same table in and higher, subject to the usual restriction that all other columns are of data types supported by Impala, and also that the file type of the table must be Parquet.

    http://git-wip-us.apache.org/repos/asf/incubator-impala/blob/04cd1c16/docs/topics/impala_compression_codec.xml ---------------------------------------------------------------------- diff --git a/docs/topics/impala_compression_codec.xml b/docs/topics/impala_compression_codec.xml index b00089f..739c651 100644 --- a/docs/topics/impala_compression_codec.xml +++ b/docs/topics/impala_compression_codec.xml @@ -2,7 +2,7 @@ - COMPRESSION_CODEC Query Option (CDH 5.2 or higher only) + COMPRESSION_CODEC Query Option (<keyword keyref="impala20"/> or higher only) COMPRESSION_CODEC http://git-wip-us.apache.org/repos/asf/incubator-impala/blob/04cd1c16/docs/topics/impala_config_options.xml ---------------------------------------------------------------------- diff --git a/docs/topics/impala_config_options.xml b/docs/topics/impala_config_options.xml index 686df87..a08aafa 100644 --- a/docs/topics/impala_config_options.xml +++ b/docs/topics/impala_config_options.xml @@ -301,7 +301,7 @@ Starting Impala Catalog Server: [ OK ]
  • - During troubleshooting, Cloudera Support might direct you to change other values, + During troubleshooting, might direct you to change other values, particularly for IMPALA_SERVER_ARGS, to work around issues or gather debugging information.

    @@ -469,7 +469,7 @@ Starting Impala Catalog Server: [ OK ] -->
  • - During troubleshooting, Cloudera Support might direct you to change other values, + During troubleshooting, might direct you to change other values, particularly for IMPALA_SERVER_ARGS, to work around issues or gather debugging information.
  • http://git-wip-us.apache.org/repos/asf/incubator-impala/blob/04cd1c16/docs/topics/impala_create_database.xml ---------------------------------------------------------------------- diff --git a/docs/topics/impala_create_database.xml b/docs/topics/impala_create_database.xml index cb9dd84..de556bc 100644 --- a/docs/topics/impala_create_database.xml +++ b/docs/topics/impala_create_database.xml @@ -95,7 +95,7 @@

    The SHOW DATABASES statement lists all databases, or the databases whose name - matches a wildcard pattern. In CDH 5.7 / Impala 2.5 and higher, the + matches a wildcard pattern. In and higher, the SHOW DATABASES output includes a second column that displays the associated comment, if any, for each database.

    @@ -105,7 +105,7 @@

    To specify that any tables created within a database reside on the Amazon S3 system, you can include an s3a:// prefix on the LOCATION - attribute. In CDH 5.8 / Impala 2.6 and higher, Impala automatically creates any + attribute. In and higher, Impala automatically creates any required folders as the databases, tables, and partitions are created, and removes them when they are dropped.

    http://git-wip-us.apache.org/repos/asf/incubator-impala/blob/04cd1c16/docs/topics/impala_create_function.xml ---------------------------------------------------------------------- diff --git a/docs/topics/impala_create_function.xml b/docs/topics/impala_create_function.xml index 77815f7..5b81f50 100644 --- a/docs/topics/impala_create_function.xml +++ b/docs/topics/impala_create_function.xml @@ -34,7 +34,7 @@

    - In CDH 5.7 / Impala 2.5 and higher, the syntax is also different for creating or dropping scalar Java-based UDFs. + In and higher, the syntax is also different for creating or dropping scalar Java-based UDFs. The statements for Java UDFs use a new syntax, without any argument types or return type specified. Java-based UDFs created using the new syntax persist across restarts of the Impala catalog server, and can be shared transparently between Impala and Hive. @@ -227,7 +227,7 @@ DROP FUNCTION IF EXISTS bar;

  • - CDH 5.7 / Impala 2.5 introduces UDF improvements to persistence for both C++ and Java UDFs, + introduces UDF improvements to persistence for both C++ and Java UDFs, and better compatibility between Impala and Hive for Java UDFs. See for details.
  • @@ -330,7 +330,7 @@ DROP FUNCTION IF EXISTS bar; into UDFs under a single (overloaded) function name in Impala. Each CREATE FUNCTION or DROP FUNCTION statement applies to all the overloaded Java functions with the same name. This example uses the signatureless syntax for CREATE FUNCTION and DROP FUNCTION, - which is available in CDH 5.7 / Impala 2.5 and higher. + which is available in and higher.

    At the start, the jar file is in the local filesystem. Then it is copied into HDFS, so that it is http://git-wip-us.apache.org/repos/asf/incubator-impala/blob/04cd1c16/docs/topics/impala_create_role.xml ---------------------------------------------------------------------- diff --git a/docs/topics/impala_create_role.xml b/docs/topics/impala_create_role.xml index 8258b1f..2308490 100644 --- a/docs/topics/impala_create_role.xml +++ b/docs/topics/impala_create_role.xml @@ -2,7 +2,7 @@ - CREATE ROLE Statement (CDH 5.2 or higher only) + CREATE ROLE Statement (<keyword keyref="impala20"/> or higher only) CREATE ROLE http://git-wip-us.apache.org/repos/asf/incubator-impala/blob/04cd1c16/docs/topics/impala_create_table.xml ---------------------------------------------------------------------- diff --git a/docs/topics/impala_create_table.xml b/docs/topics/impala_create_table.xml index 1b5552f..ba83fc5 100644 --- a/docs/topics/impala_create_table.xml +++ b/docs/topics/impala_create_table.xml @@ -186,7 +186,7 @@ file_format:

    The Impala complex types (STRUCT, ARRAY, or MAP) - are available in CDH 5.5 / Impala 2.3 and higher. + are available in and higher. Because you can nest these types (for example, to make an array of maps or a struct with an array field), these types are also sometimes referred to as nested types. See for usage details. @@ -252,9 +252,9 @@ file_format:

    - Prior to CDH 5.7 / Impala 2.5, you could use a partitioned table + Prior to you could use a partitioned table as the source and copy data from it, but could not specify any partitioning clauses for the new table. - In CDH 5.7 / Impala 2.5 and higher, you can now use the PARTITIONED BY clause with a + In and higher, you can now use the PARTITIONED BY clause with a CREATE TABLE AS SELECT statement. See the examples under the following discussion of the CREATE TABLE AS SELECT syntax variation.

    @@ -484,9 +484,9 @@ CREATE TABLE synthesized_columns AS SELECT upper(c1) AS all_caps, c2+c3 AS total

    - Prior to CDH 5.7 / Impala 2.5, you could use a partitioned table + Prior to you could use a partitioned table as the source and copy data from it, but could not specify any partitioning clauses for the new table. - In CDH 5.7 / Impala 2.5 and higher, you can now use the PARTITIONED BY clause with a + In and higher, you can now use the PARTITIONED BY clause with a CREATE TABLE AS SELECT statement. The following example demonstrates how you can copy data from an unpartitioned table in a CREATE TABLE AS SELECT operation, creating a new partitioned table in the process. The main syntax consideration is the column order in the PARTITIONED BY @@ -779,7 +779,7 @@ ERROR: AnalysisException: Partition column name mismatch: year != month

    - In CDH 5.8 / Impala 2.6 and higher, you can + In and higher, you can use this special LOCATION syntax as part of a CREATE TABLE AS SELECT statement.

    http://git-wip-us.apache.org/repos/asf/incubator-impala/blob/04cd1c16/docs/topics/impala_date.xml ---------------------------------------------------------------------- diff --git a/docs/topics/impala_date.xml b/docs/topics/impala_date.xml index 7a63aad..ae17b9a 100644 --- a/docs/topics/impala_date.xml +++ b/docs/topics/impala_date.xml @@ -2,7 +2,7 @@ - DATE Data Type (CDH 5.3 or higher only) + DATE Data Type (<keyword keyref="impala21"/> or higher only) http://git-wip-us.apache.org/repos/asf/incubator-impala/blob/04cd1c16/docs/topics/impala_datetime_functions.xml ---------------------------------------------------------------------- diff --git a/docs/topics/impala_datetime_functions.xml b/docs/topics/impala_datetime_functions.xml index be6afa9..dddfee6 100644 --- a/docs/topics/impala_datetime_functions.xml +++ b/docs/topics/impala_datetime_functions.xml @@ -2203,11 +2203,11 @@ select now() + interval 2 weeks as 2_weeks_from_now, positions of elements and use different separator characters.

    - In CDH 5.4.3 and higher, you can include a trailing uppercase Z qualifier + In CDH 5.4.3 and higher, you can include a trailing uppercase Z qualifier to indicate Zulu time, a synonym for UTC.

    - In CDH 5.5.0 and higher, you can include a timezone offset specified as minutes and hours, + In and higher, you can include a timezone offset specified as minutes and hours, provided you also specify the details in the format string argument. The offset is specified in the format string as a plus or minus sign followed by hh:mm, hhmm, or hh. The hh must be lowercase, to distinguish it from the HH represent http://git-wip-us.apache.org/repos/asf/incubator-impala/blob/04cd1c16/docs/topics/impala_decimal.xml ---------------------------------------------------------------------- diff --git a/docs/topics/impala_decimal.xml b/docs/topics/impala_decimal.xml index b566860..58958ab 100644 --- a/docs/topics/impala_decimal.xml +++ b/docs/topics/impala_decimal.xml @@ -2,7 +2,7 @@ - DECIMAL Data Type (CDH 5.1 or higher only) + DECIMAL Data Type (<keyword keyref="impala14"/> or higher only) DECIMAL @@ -163,7 +163,7 @@

    • - Using the DECIMAL type is only supported under CDH 5.1.0 and higher. + Using the DECIMAL type is only supported under and higher.
    • http://git-wip-us.apache.org/repos/asf/incubator-impala/blob/04cd1c16/docs/topics/impala_delete.xml ---------------------------------------------------------------------- diff --git a/docs/topics/impala_delete.xml b/docs/topics/impala_delete.xml index 997bd49..5715cc7 100644 --- a/docs/topics/impala_delete.xml +++ b/docs/topics/impala_delete.xml @@ -2,7 +2,7 @@ - DELETE Statement (CDH 5.10 or higher only) + DELETE Statement (<keyword keyref="impala28"/> or higher only) DELETE http://git-wip-us.apache.org/repos/asf/incubator-impala/blob/04cd1c16/docs/topics/impala_describe.xml ---------------------------------------------------------------------- diff --git a/docs/topics/impala_describe.xml b/docs/topics/impala_describe.xml index c7a2088..c4070c3 100644 --- a/docs/topics/impala_describe.xml +++ b/docs/topics/impala_describe.xml @@ -23,9 +23,9 @@ DESCRIBE statement The DESCRIBE statement displays metadata about a table, such as the column names and their data types. - In CDH 5.5 / Impala 2.3 and higher, you can specify the name of a complex type column, which takes + In and higher, you can specify the name of a complex type column, which takes the form of a dotted path. The path might include multiple components in the case of a nested type definition. - In CDH 5.7 / Impala 2.5 and higher, the DESCRIBE DATABASE form can display + In and higher, the DESCRIBE DATABASE form can display information about a database.

      @@ -75,7 +75,7 @@ object_name ::=

      The following examples show the variations in syntax and output for -describing databases. This feature is available in CDH 5.7 / Impala 2.5 +describing databases. This feature is available in and higher.

      @@ -178,7 +178,7 @@ describe formatted my_table;

      For the ARRAY, STRUCT, and MAP types available in - CDH 5.5 / Impala 2.3 and higher, the DESCRIBE output is formatted to avoid + and higher, the DESCRIBE output is formatted to avoid excessively long lines for multiple fields within a STRUCT, or a nested sequence of complex types.

      http://git-wip-us.apache.org/repos/asf/incubator-impala/blob/04cd1c16/docs/topics/impala_disable_codegen.xml ---------------------------------------------------------------------- diff --git a/docs/topics/impala_disable_codegen.xml b/docs/topics/impala_disable_codegen.xml index bcc5787..ab306e8 100644 --- a/docs/topics/impala_disable_codegen.xml +++ b/docs/topics/impala_disable_codegen.xml @@ -20,7 +20,7 @@ This is a debug option, intended for diagnosing and working around issues that cause crashes. If a query fails with an illegal instruction or other hardware-specific message, try setting DISABLE_CODEGEN=true and running the query again. If the query succeeds only when the - DISABLE_CODEGEN option is turned on, submit the problem to Cloudera support and include that + DISABLE_CODEGEN option is turned on, submit the problem to and include that detail in the problem report. Do not otherwise run with this setting turned on, because it results in lower overall performance.

      http://git-wip-us.apache.org/repos/asf/incubator-impala/blob/04cd1c16/docs/topics/impala_disable_row_runtime_filtering.xml ---------------------------------------------------------------------- diff --git a/docs/topics/impala_disable_row_runtime_filtering.xml b/docs/topics/impala_disable_row_runtime_filtering.xml index 4ba6e7a..a8c07cb 100644 --- a/docs/topics/impala_disable_row_runtime_filtering.xml +++ b/docs/topics/impala_disable_row_runtime_filtering.xml @@ -2,7 +2,7 @@ - DISABLE_ROW_RUNTIME_FILTERING Query Option (CDH 5.7 or higher only) + DISABLE_ROW_RUNTIME_FILTERING Query Option (<keyword keyref="impala25"/> or higher only) DISABLE_ROW_RUNTIME_FILTERING http://git-wip-us.apache.org/repos/asf/incubator-impala/blob/04cd1c16/docs/topics/impala_disable_streaming_preaggregations.xml ---------------------------------------------------------------------- diff --git a/docs/topics/impala_disable_streaming_preaggregations.xml b/docs/topics/impala_disable_streaming_preaggregations.xml index 683a16d..a3919af 100644 --- a/docs/topics/impala_disable_streaming_preaggregations.xml +++ b/docs/topics/impala_disable_streaming_preaggregations.xml @@ -2,7 +2,7 @@ - DISABLE_STREAMING_PREAGGREGATIONS Query Option (CDH 5.7 or higher only) + DISABLE_STREAMING_PREAGGREGATIONS Query Option (<keyword keyref="impala25"/> or higher only) DISABLE_STREAMING_PREAGGREGATIONS @@ -20,7 +20,7 @@

      DISABLE_STREAMING_PREAGGREGATIONS query option - Turns off the streaming preaggregation optimization that is available in CDH 5.7 / Impala 2.5 + Turns off the streaming preaggregation optimization that is available in and higher. This optimization reduces unnecessary work performed by queries that perform aggregation operations on columns with few or no duplicate values, for example DISTINCT id_column or GROUP BY unique_column. If the optimization causes regressions in http://git-wip-us.apache.org/repos/asf/incubator-impala/blob/04cd1c16/docs/topics/impala_disable_unsafe_spills.xml ---------------------------------------------------------------------- diff --git a/docs/topics/impala_disable_unsafe_spills.xml b/docs/topics/impala_disable_unsafe_spills.xml index 17ad2e1..14b687c 100644 --- a/docs/topics/impala_disable_unsafe_spills.xml +++ b/docs/topics/impala_disable_unsafe_spills.xml @@ -2,7 +2,7 @@ - DISABLE_UNSAFE_SPILLS Query Option (CDH 5.2 or higher only) + DISABLE_UNSAFE_SPILLS Query Option (<keyword keyref="impala20"/> or higher only) DISABLE_UNSAFE_SPILLS http://git-wip-us.apache.org/repos/asf/incubator-impala/blob/04cd1c16/docs/topics/impala_dml.xml ---------------------------------------------------------------------- diff --git a/docs/topics/impala_dml.xml b/docs/topics/impala_dml.xml index ecce473..b6c6399 100644 --- a/docs/topics/impala_dml.xml +++ b/docs/topics/impala_dml.xml @@ -25,20 +25,24 @@

        -
      • - ; works for Kudu tables only +
      • + . Works for Kudu tables only.
      • - + .
      • - + . Does not apply for HBase or Kudu tables.
      • -
      • - ; works for Kudu tables only +
      • + . Works for Kudu tables only. +
      • + +
      • + . Works for Kudu tables only.
      @@ -58,21 +62,23 @@

      - To simulate the effects of an UPDATE or DELETE statement in other database - systems, typically you use INSERT or CREATE TABLE AS SELECT to copy data + In and higher, Impala does support the UPDATE, DELETE, + and UPSERT statements for Kudu tables. + For HDFS or S3 tables, to simulate the effects of an UPDATE or DELETE statement + in other database systems, typically you use INSERT or CREATE TABLE AS SELECT to copy data from one table to another, filtering out or changing the appropriate rows during the copy operation.

      - Although Impala currently does not have an UPDATE statement, you can achieve a similar - result by using Impala tables stored in HBase. When you insert a row into an HBase table, and the table + You can also achieve a result similar to UPDATE by using Impala tables stored in HBase. + When you insert a row into an HBase table, and the table already contains a row with the same value for the key column, the older row is hidden, effectively the same as a single-row UPDATE.

      -

      - Currently, Impala cannot perform DML operations for tables or partitions stored in the Amazon S3 filesystem. - See for details. +

      + Impala can perform DML operations for tables or partitions stored in the Amazon S3 filesystem + with and higher. See for details.

      http://git-wip-us.apache.org/repos/asf/incubator-impala/blob/04cd1c16/docs/topics/impala_drop_database.xml ---------------------------------------------------------------------- diff --git a/docs/topics/impala_drop_database.xml b/docs/topics/impala_drop_database.xml index fca7a60..fb36dca 100644 --- a/docs/topics/impala_drop_database.xml +++ b/docs/topics/impala_drop_database.xml @@ -37,7 +37,7 @@

      - In CDH 5.5 / Impala 2.3 and higher, you can include the CASCADE + In and higher, you can include the CASCADE clause to make Impala drop all tables and other objects in the database before dropping the database itself. The RESTRICT clause enforces the original requirement that the database be empty before being dropped. Because the RESTRICT behavior is still the default, this http://git-wip-us.apache.org/repos/asf/incubator-impala/blob/04cd1c16/docs/topics/impala_drop_function.xml ---------------------------------------------------------------------- diff --git a/docs/topics/impala_drop_function.xml b/docs/topics/impala_drop_function.xml index 0f6c33b..dfad51b 100644 --- a/docs/topics/impala_drop_function.xml +++ b/docs/topics/impala_drop_function.xml @@ -37,7 +37,7 @@

      The preceding syntax, which includes the function signature, also applies to Java UDFs that were created using the corresponding CREATE FUNCTION syntax that includes the argument and return types. - After upgrading to CDH 5.7 / Impala 2.5 or higher, consider re-creating all Java UDFs with the + After upgrading to or higher, consider re-creating all Java UDFs with the CREATE FUNCTION syntax that does not include the function signature. Java UDFs created this way are now persisted in the metastore database and do not need to be re-created after an Impala restart.

      @@ -84,7 +84,7 @@ DROP FUNCTION IF EXISTS bar;

      The following example shows how to drop Java functions created with the signatureless - CREATE FUNCTION syntax in CDH 5.7 / Impala 2.5 and higher. + CREATE FUNCTION syntax in and higher. Issuing DROP FUNCTION function_name removes all the overloaded functions under that name. (See for a longer example http://git-wip-us.apache.org/repos/asf/incubator-impala/blob/04cd1c16/docs/topics/impala_drop_role.xml ---------------------------------------------------------------------- diff --git a/docs/topics/impala_drop_role.xml b/docs/topics/impala_drop_role.xml index b60f465..7383b46 100644 --- a/docs/topics/impala_drop_role.xml +++ b/docs/topics/impala_drop_role.xml @@ -2,7 +2,7 @@ - DROP ROLE Statement (CDH 5.2 or higher only) + DROP ROLE Statement (<keyword keyref="impala20"/> or higher only) DROP ROLE http://git-wip-us.apache.org/repos/asf/incubator-impala/blob/04cd1c16/docs/topics/impala_drop_table.xml ---------------------------------------------------------------------- diff --git a/docs/topics/impala_drop_table.xml b/docs/topics/impala_drop_table.xml index 81ce8c6..8a75158 100644 --- a/docs/topics/impala_drop_table.xml +++ b/docs/topics/impala_drop_table.xml @@ -48,7 +48,7 @@

      The optional PURGE keyword, available in - CDH 5.5 / Impala 2.3 and higher, causes Impala to remove the associated + and higher, causes Impala to remove the associated HDFS data files immediately, rather than going through the HDFS trashcan mechanism. Use this keyword when dropping a table if it is crucial to remove the data as quickly as possible to free up space, or if there is a @@ -113,7 +113,7 @@ drop table temporary.trivial;

      The DROP TABLE statement can remove data files from S3 if the associated S3 table is an internal table. - In CDH 5.8 / Impala 2.6 and higher, as part of improved support for writing + In and higher, as part of improved support for writing to S3, Impala also removes the associated folder when dropping an internal table that resides on S3. See for details about working with S3 tables. http://git-wip-us.apache.org/repos/asf/incubator-impala/blob/04cd1c16/docs/topics/impala_exec_single_node_rows_threshold.xml ---------------------------------------------------------------------- diff --git a/docs/topics/impala_exec_single_node_rows_threshold.xml b/docs/topics/impala_exec_single_node_rows_threshold.xml index c677a64..a7cf102 100644 --- a/docs/topics/impala_exec_single_node_rows_threshold.xml +++ b/docs/topics/impala_exec_single_node_rows_threshold.xml @@ -2,7 +2,7 @@ - EXEC_SINGLE_NODE_ROWS_THRESHOLD Query Option (CDH 5.3 or higher only) + EXEC_SINGLE_NODE_ROWS_THRESHOLD Query Option (<keyword keyref="impala21"/> or higher only) EXEC_SINGLE_NODE_ROWS_THRESHOLD @@ -57,7 +57,7 @@

      - In CDH 5.5 / Impala 2.3 and higher, where Impala supports the complex data types STRUCT, + In and higher, where Impala supports the complex data types STRUCT, ARRAY, and MAP, if a query refers to any column of those types, the small-query optimization is turned off for that query regardless of the EXEC_SINGLE_NODE_ROWS_THRESHOLD setting. http://git-wip-us.apache.org/repos/asf/incubator-impala/blob/04cd1c16/docs/topics/impala_faq.xml ---------------------------------------------------------------------- diff --git a/docs/topics/impala_faq.xml b/docs/topics/impala_faq.xml index 94b0b33..852beaf 100644 --- a/docs/topics/impala_faq.xml +++ b/docs/topics/impala_faq.xml @@ -672,7 +672,7 @@ through the DEFAULT_ORDER_BY_LIMIT query option.

      - In Impala 2.2 / CDH 5.4 and higher, you can use the log redaction feature + In and higher, you can use the log redaction feature to obfuscate sensitive information in Impala log files. See @@ -1256,10 +1256,7 @@ Should consolidate since makes sense to have one faq_hive ID.

      You can enable HDFS HA for the Hive metastore. See the - CDH5 High Availability Guide - or the - CDH4 High Availability Guide for details.

      http://git-wip-us.apache.org/repos/asf/incubator-impala/blob/04cd1c16/docs/topics/impala_fixed_issues.xml ---------------------------------------------------------------------- diff --git a/docs/topics/impala_fixed_issues.xml b/docs/topics/impala_fixed_issues.xml index efc7a6b..f55d95c 100644 --- a/docs/topics/impala_fixed_issues.xml +++ b/docs/topics/impala_fixed_issues.xml @@ -3011,7 +3011,7 @@ IMPALA-1120: Fetch column statistics using Hive 0.13 bulk API - also in 5.2.4 / IMPALA-1674

      - +
      @@ -3032,7 +3032,7 @@ IMPALA-1120: Fetch column statistics using Hive 0.13 bulk API - also in 5.2.4 / IMPALA-1668

      - +
      http://git-wip-us.apache.org/repos/asf/incubator-impala/blob/04cd1c16/docs/topics/impala_grant.xml ---------------------------------------------------------------------- diff --git a/docs/topics/impala_grant.xml b/docs/topics/impala_grant.xml index 7204445..a14a717 100644 --- a/docs/topics/impala_grant.xml +++ b/docs/topics/impala_grant.xml @@ -2,7 +2,7 @@ - GRANT Statement (CDH 5.2 or higher only) + GRANT Statement (<keyword keyref="impala20"/> or higher only) GRANT @@ -73,7 +73,7 @@ object_type ::= TABLE | DATABASE | SERVER | URI

      The ability to grant or revoke SELECT privilege on specific columns is available - in CDH 5.5 / Impala 2.3 and higher. See + in and higher. See for details.

      http://git-wip-us.apache.org/repos/asf/incubator-impala/blob/04cd1c16/docs/topics/impala_group_by.xml ---------------------------------------------------------------------- diff --git a/docs/topics/impala_group_by.xml b/docs/topics/impala_group_by.xml index 055ae2a..2c6ade3 100644 --- a/docs/topics/impala_group_by.xml +++ b/docs/topics/impala_group_by.xml @@ -31,7 +31,7 @@

      - In CDH 5.5 / Impala 2.3 and higher, the complex data types STRUCT, + In and higher, the complex data types STRUCT, ARRAY, and MAP are available. These columns cannot be referenced directly in the ORDER BY clause. When you query a complex type column, you use join notation to unpack the elements http://git-wip-us.apache.org/repos/asf/incubator-impala/blob/04cd1c16/docs/topics/impala_hbase.xml ---------------------------------------------------------------------- diff --git a/docs/topics/impala_hbase.xml b/docs/topics/impala_hbase.xml index 0260545..8b8d013 100644 --- a/docs/topics/impala_hbase.xml +++ b/docs/topics/impala_hbase.xml @@ -35,14 +35,9 @@

      For background information on HBase, see the snapshot of the Apache HBase site (including documentation) for the level of HBase that comes with - CDH 4 or CDH 5. To install HBase on a CDH cluster, see the installation instructions for - CDH - 4 or - - CDH - 5. + CDH 5.

      @@ -750,10 +745,13 @@ set hbase_caching=1000;

    • Because the complex data types (ARRAY, STRUCT, and MAP) - available in CDH 5.5 / Impala 2.3 and higher are currently only supported in Parquet tables, you cannot + available in and higher are currently only supported in Parquet tables, you cannot use these types in HBase tables that are queried through Impala.

    • +
    • +

      +

    http://git-wip-us.apache.org/repos/asf/incubator-impala/blob/04cd1c16/docs/topics/impala_hints.xml ---------------------------------------------------------------------- diff --git a/docs/topics/impala_hints.xml b/docs/topics/impala_hints.xml index 3eef3d3..4c5145f 100644 --- a/docs/topics/impala_hints.xml +++ b/docs/topics/impala_hints.xml @@ -60,7 +60,7 @@ INSERT insert_clauses

    - In Impala 2.0 and higher, or CDH 5.2 and higher, you can also specify the hints inside comments that use + In and higher, you can also specify the hints inside comments that use either the /* */ or -- notation. Specify a + symbol immediately before the hint name.

    http://git-wip-us.apache.org/repos/asf/incubator-impala/blob/04cd1c16/docs/topics/impala_impala_shell.xml ---------------------------------------------------------------------- diff --git a/docs/topics/impala_impala_shell.xml b/docs/topics/impala_impala_shell.xml index 58a96bc..afbfcaa 100644 --- a/docs/topics/impala_impala_shell.xml +++ b/docs/topics/impala_impala_shell.xml @@ -51,7 +51,7 @@ on the command line using the notation --var=variable_name=value. Within a SQL statement, you substitute the value by using the notation ${var:variable_name}. - This feature is available in CDH 5.7 / Impala 2.5 and higher. + This feature is available in and higher.
  • http://git-wip-us.apache.org/repos/asf/incubator-impala/blob/04cd1c16/docs/topics/impala_incompatible_changes.xml ---------------------------------------------------------------------- diff --git a/docs/topics/impala_incompatible_changes.xml b/docs/topics/impala_incompatible_changes.xml index a7bafcd..88a306f 100644 --- a/docs/topics/impala_incompatible_changes.xml +++ b/docs/topics/impala_incompatible_changes.xml @@ -243,7 +243,7 @@ Formerly, all Impala UDFs and UDAs required running the CREATE FUNCTION statements to re-create them after each catalogd restart. - In CDH 5.7 / Impala 2.5 and higher, functions written in C++ are persisted across + In and higher, functions written in C++ are persisted across restarts, and the requirement to re-create functions only applies to functions written in Java. Adapt any function-reloading logic that you have added to your Impala environment. http://git-wip-us.apache.org/repos/asf/incubator-impala/blob/04cd1c16/docs/topics/impala_jdbc.xml ---------------------------------------------------------------------- diff --git a/docs/topics/impala_jdbc.xml b/docs/topics/impala_jdbc.xml index 88b06aa..d667170 100644 --- a/docs/topics/impala_jdbc.xml +++ b/docs/topics/impala_jdbc.xml @@ -137,10 +137,7 @@

    To get the JAR files, install the Hive JDBC driver on each CDH-enabled host in the cluster that will run JDBC applications. Follow the instructions for - CDH - 5 or - CDH - 4. + CDH 5.

    @@ -358,7 +355,7 @@
  • - The complex types available in CDH 5.5 / Impala 2.3 and higher are supported by the + The complex types available in and higher are supported by the JDBC getColumns() API. Both MAP and ARRAY are reported as the JDBC SQL Type ARRAY, because this is the closest matching Java SQL type. This behavior is consistent with Hive. http://git-wip-us.apache.org/repos/asf/incubator-impala/blob/04cd1c16/docs/topics/impala_joins.xml ---------------------------------------------------------------------- diff --git a/docs/topics/impala_joins.xml b/docs/topics/impala_joins.xml index 0e807e8..ddd2f2b 100644 --- a/docs/topics/impala_joins.xml +++ b/docs/topics/impala_joins.xml @@ -177,7 +177,7 @@ SELECT * FROM t1 FULL OUTER JOIN t2 ON t1.id = t2.id; SELECT * FROM t1 CROSS JOIN t2 WHERE t1.total > t2.maximum_price;

    - In CDH 5.5 / Impala 2.3 and higher, additional non-equijoin queries are possible due to the addition + In and higher, additional non-equijoin queries are possible due to the addition of nested loop joins. These queries typically involve SEMI JOIN, ANTI JOIN, or FULL OUTER JOIN clauses. Impala sometimes also uses nested loop joins internally when evaluating OUTER JOIN @@ -234,12 +234,12 @@ ERROR: NotImplementedException: Join with 't2' requires at least one conjunctive SELECT t1.c1, t2.c2 FROM t1 JOIN t2 USING (id, type_flag, name, address);

    - Anti-joins (Impala 2.0 / CDH 5.2 and higher only): + Anti-joins ( and higher only):

    - Impala supports the LEFT ANTI JOIN and RIGHT ANTI JOIN clauses in Impala - 2.0 and higher on CDH 4, or CDH 5.2 and higher on CDH 5. The LEFT or RIGHT + Impala supports the LEFT ANTI JOIN and RIGHT ANTI JOIN clauses in + and higher. The LEFT or RIGHT keyword is required for this kind of join. For LEFT ANTI JOIN, this clause returns those values from the left-hand table that have no matching value in the right-hand table. RIGHT ANTI JOIN reverses the comparison and returns values from the right-hand table. You can express this @@ -310,7 +310,7 @@ are returned only if no matching values are present in another table.

  • - In CDH 5.5 / Impala 2.3 or higher, when referring to complex type columns in queries. + In or higher, when referring to complex type columns in queries. See for details.
@@ -481,7 +481,7 @@ Returned 1 row(s) in 1.00s By default, join key columns do not match if either one contains a NULL value. To treat such columns as equal if both contain NULL, you can use an expression such as A = B OR (A IS NULL AND B IS NULL). - In CDH 5.7 / Impala 2.5 and higher, the <=> operator (shorthand for + In and higher, the <=> operator (shorthand for IS NOT DISTINCT FROM) performs the same comparison in a concise and efficient form. The <=> operator is more efficient in for comparing join keys in a NULL-safe manner, because the operator can use a hash join while the OR expression cannot. http://git-wip-us.apache.org/repos/asf/incubator-impala/blob/04cd1c16/docs/topics/impala_kerberos.xml ---------------------------------------------------------------------- diff --git a/docs/topics/impala_kerberos.xml b/docs/topics/impala_kerberos.xml index d23186f..6393580 100644 --- a/docs/topics/impala_kerberos.xml +++ b/docs/topics/impala_kerberos.xml @@ -20,11 +20,7 @@

Impala supports Kerberos authentication. For more information on enabling Kerberos authentication, see the topic on Configuring Hadoop Security in the - CDH4 - Security Guide or the - - CDH - 5 Security Guide. + CDH 5 Security Guide.

@@ -116,10 +112,7 @@

To enable Impala to work with Kerberos security on your Hadoop cluster, make sure you perform the installation and configuration steps in - - Authentication in the CDH 5 Security Guide - or - the topic on Configuring Hadoop Security in the CDH4 Security Guide. + Authentication in the CDH 5 Security Guide. Also note that when Kerberos security is enabled in Impala, a web browser that supports Kerberos HTTP SPNEGO is required to access the Impala web console (for example, Firefox, Internet Explorer, or Chrome). http://git-wip-us.apache.org/repos/asf/incubator-impala/blob/04cd1c16/docs/topics/impala_known_issues.xml ---------------------------------------------------------------------- diff --git a/docs/topics/impala_known_issues.xml b/docs/topics/impala_known_issues.xml index 7b9ec2b..e57ec62 100644 --- a/docs/topics/impala_known_issues.xml +++ b/docs/topics/impala_known_issues.xml @@ -531,7 +531,7 @@ impala-shell -s impala --ssl --ca_cert /path_to_truststore/truststore.pem

- The default heap size for Impala catalogd has changed in CDH 5.7 / Impala 2.5 and higher: + The default heap size for Impala catalogd has changed in and higher:

    @@ -613,12 +613,12 @@ impala-shell -s impala --ssl --ca_cert /path_to_truststore/truststore.pem

    - The initial release of CDH 5.8 / Impala 2.6 sometimes has a higher peak memory usage than in previous releases while reading + The initial release of sometimes has a higher peak memory usage than in previous releases while reading Parquet files.

    - CDH 5.8 / Impala 2.6 addresses the issue IMPALA-2736, which improves the efficiency of Parquet scans by up to 2x. The faster scans + addresses the issue IMPALA-2736, which improves the efficiency of Parquet scans by up to 2x. The faster scans may result in a higher peak memory consumption compared to earlier versions of Impala due to the new column-wise row materialization strategy. You are likely to experience higher memory consumption in any of the following scenarios: