Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id E4FAC200C65 for ; Wed, 22 Mar 2017 07:09:58 +0100 (CET) Received: by cust-asf.ponee.io (Postfix) id E37E0160B9E; Wed, 22 Mar 2017 06:09:58 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 15A18160BA3 for ; Wed, 22 Mar 2017 07:09:55 +0100 (CET) Received: (qmail 82798 invoked by uid 500); 22 Mar 2017 06:09:55 -0000 Mailing-List: contact commits-help@bigtop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: bigtop-dev@bigtop.apache.org Delivered-To: mailing list commits@bigtop.apache.org Received: (qmail 81283 invoked by uid 99); 22 Mar 2017 06:09:52 -0000 Received: from git1-us-west.apache.org (HELO git1-us-west.apache.org) (140.211.11.23) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 22 Mar 2017 06:09:52 +0000 Received: by git1-us-west.apache.org (ASF Mail Server at git1-us-west.apache.org, from userid 33) id 9283AE965C; Wed, 22 Mar 2017 06:09:52 +0000 (UTC) Content-Type: text/plain; charset="us-ascii" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit From: rvs@apache.org To: commits@bigtop.apache.org Date: Wed, 22 Mar 2017 06:10:13 -0000 Message-Id: In-Reply-To: <69863c16852949b9998cae3af818e0a6@git.apache.org> References: <69863c16852949b9998cae3af818e0a6@git.apache.org> X-Mailer: ASF-Git Admin Mailer Subject: [22/52] bigtop git commit: ODPI-193. Add Hive 1.2 to ODPi ambari reference implementation archived-at: Wed, 22 Mar 2017 06:09:59 -0000 http://git-wip-us.apache.org/repos/asf/bigtop/blob/ba8d7f50/bigtop-packages/src/common/ambari/ODPi/1.0/services/HIVE/configuration/hive-site.xml ---------------------------------------------------------------------- diff --git a/bigtop-packages/src/common/ambari/ODPi/1.0/services/HIVE/configuration/hive-site.xml b/bigtop-packages/src/common/ambari/ODPi/1.0/services/HIVE/configuration/hive-site.xml new file mode 100755 index 0000000..cddb624 --- /dev/null +++ b/bigtop-packages/src/common/ambari/ODPi/1.0/services/HIVE/configuration/hive-site.xml @@ -0,0 +1,2796 @@ + + hive.default.fileformat.managed + TextFile + + Default file format for CREATE TABLE statement applied to managed tables only. + External tables will be created with default file format. Leaving this null + will result in using the default file format for all tables. + + hive-site.xml + false + + + + + + + datanucleus.rdbms.datastoreAdapterClassName + Datanucleus Class, This property used only when hive db is SQL Anywhere + hive-site.xml + false + + + + + + hive_database + hive-env + + + + + atlas.hook.hive.minThreads + 1 + + Minimum number of threads maintained by Atlas hook. + + hive-site.xml + false + + + + + + + atlas.hook.hive.maxThreads + 1 + + Maximum number of threads used by Atlas hook. + + hive-site.xml + false + + + + + + + hive.cbo.enable + true + Flag to control enabling Cost Based Optimizations using Calcite framework. + Enable Cost Based Optimizer + hive-site.xml + false + + + + value-list + + + true + On + + + false + Off + + + 1 + + + + + hive.compute.query.using.stats + hive-site + + + hive.stats.fetch.partition.stats + hive-site + + + hive.stats.fetch.column.stats + hive-site + + + + hive.zookeeper.quorum + localhost:2181 + List of ZooKeeper servers to talk to. This is needed for: 1. + Read/write locks - when hive.lock.manager is set to + org.apache.hadoop.hive.ql.lockmgr.zookeeper.ZooKeeperHiveLockManager, + 2. When HiveServer2 supports service discovery via Zookeeper. + hive-site.xml + false + + + + multiLine + true + + + + + hive.metastore.connect.retries + 24 + Number of retries while opening a connection to metastore + hive-site.xml + false + + + + + + + hive.metastore.failure.retries + 24 + Number of retries upon failure of Thrift metastore calls + hive-site.xml + false + + + + + + + hive.metastore.client.connect.retry.delay + 5s + + Expects a time value with unit (d/day, h/hour, m/min, s/sec, ms/msec, us/usec, ns/nsec), which is sec if not specified. + Number of seconds for the client to wait between consecutive connection attempts + + hive-site.xml + false + + + + + + + hive.metastore.client.socket.timeout + 1800s + + Expects a time value with unit (d/day, h/hour, m/min, s/sec, ms/msec, us/usec, ns/nsec), which is sec if not specified. + MetaStore Client socket timeout in seconds + + hive-site.xml + false + + + + + + + hive.mapjoin.bucket.cache.size + 10000 + + hive-site.xml + false + + + + + + + hive.security.authorization.manager + org.apache.hadoop.hive.ql.security.authorization.plugin.sqlstd.SQLStdConfOnlyAuthorizerFactory + + The Hive client authorization manager class name. The user defined authorization class should implement + interface org.apache.hadoop.hive.ql.security.authorization.HiveAuthorizationProvider. + + hive-site.xml + false + + + + + + hive_security_authorization + hive-env + + + + + hive.cluster.delegation.token.store.class + org.apache.hadoop.hive.thrift.ZooKeeperTokenStore + The delegation token store implementation. + Set to org.apache.hadoop.hive.thrift.ZooKeeperTokenStore for load-balanced cluster. + hive-site.xml + false + + + + + + + hive.cluster.delegation.token.store.zookeeper.connectString + localhost:2181 + The ZooKeeper token store connect string. + hive-site.xml + false + + + + + + + hive.server2.support.dynamic.service.discovery + true + Whether HiveServer2 supports dynamic service discovery for its clients. + To support this, each instance of HiveServer2 currently uses ZooKeeper to register itself, + when it is brought up. JDBC/ODBC clients should use the ZooKeeper ensemble: hive.zookeeper.quorum + in their connection string. + + hive-site.xml + false + + + + boolean + + + + + hive.exec.scratchdir + /tmp/hive + HDFS root scratch dir for Hive jobs which gets created with write all (733) permission. For each connecting user, an HDFS scratch dir: ${hive.exec.scratchdir}/<username> is created, with ${hive.scratch.dir.permission}. + hive-site.xml + false + + + + + + + hive.exec.submitviachild + false + + hive-site.xml + false + + + + + + + hive.exec.submit.local.task.via.child + true + + Determines whether local tasks (typically mapjoin hashtable generation phase) runs in + separate JVM (true recommended) or not. + Avoids the overhead of spawning new JVM, but can lead to out-of-memory issues. + + hive-site.xml + false + + + + + + + hive.exec.compress.output + false + + This controls whether the final outputs of a query (to a local/HDFS file or a Hive table) is compressed. + The compression codec and other options are determined from Hadoop config variables mapred.output.compress* + + hive-site.xml + false + + + + + + + hive.exec.compress.intermediate + false + + This controls whether intermediate files produced by Hive between multiple map-reduce jobs are compressed. + The compression codec and other options are determined from Hadoop config variables mapred.output.compress* + + hive-site.xml + false + + + + + + + hive.exec.reducers.bytes.per.reducer + 67108864 + Defines the size per reducer. For example, if it is set to 64M, given 256M input size, 4 reducers will be used. + Data per Reducer + hive-site.xml + false + + + + int + 4294967296 + 64 + B + + + + + + hive.exec.reducers.max + 1009 + + max number of reducers will be used. If the one specified in the configuration parameter mapred.reduce.tasks is + negative, Hive will use this one as the max number of reducers when automatically determine number of reducers. + + hive-site.xml + false + + + + + + + hive.exec.pre.hooks + org.apache.hadoop.hive.ql.hooks.ATSHook + + Comma-separated list of pre-execution hooks to be invoked for each statement. + A pre-execution hook is specified as the name of a Java class which implements the + org.apache.hadoop.hive.ql.hooks.ExecuteWithHookContext interface. + + hive-site.xml + false + + + + + + hive_timeline_logging_enabled + hive-env + + + + + hive.exec.post.hooks + org.apache.hadoop.hive.ql.hooks.ATSHook + + Comma-separated list of post-execution hooks to be invoked for each statement. + A post-execution hook is specified as the name of a Java class which implements the + org.apache.hadoop.hive.ql.hooks.ExecuteWithHookContext interface. + + hive-site.xml + false + + + + + + atlas.server.https.port + application-properties + + + atlas.server.http.port + application-properties + + + hive_timeline_logging_enabled + hive-env + + + + + hive.exec.failure.hooks + org.apache.hadoop.hive.ql.hooks.ATSHook + + Comma-separated list of on-failure hooks to be invoked for each statement. + An on-failure hook is specified as the name of Java class which implements the + org.apache.hadoop.hive.ql.hooks.ExecuteWithHookContext interface. + + hive-site.xml + false + + + + + + hive_timeline_logging_enabled + hive-env + + + + + hive.exec.parallel + false + Whether to execute jobs in parallel + hive-site.xml + false + + + + + + + hive.exec.parallel.thread.number + 8 + How many jobs at most can be executed in parallel + hive-site.xml + false + + + + + + + hive.mapred.reduce.tasks.speculative.execution + false + Whether speculative execution for reducers should be turned on. + hive-site.xml + false + + + + + + + hive.exec.dynamic.partition + true + Whether or not to allow dynamic partitions in DML/DDL. + hive-site.xml + false + + + + + + + hive.exec.dynamic.partition.mode + nonstrict + + In strict mode, the user must specify at least one static partition + in case the user accidentally overwrites all partitions. + NonStrict allows all partitions of a table to be dynamic. + + Allow all partitions to be Dynamic + hive-site.xml + false + + + + value-list + + + nonstrict + On + + + strict + Off + + + 1 + + + + hive_txn_acid + hive-env + + + + + hive.exec.max.dynamic.partitions + 5000 + Maximum number of dynamic partitions allowed to be created in total. + hive-site.xml + false + + + + + + + hive.exec.max.dynamic.partitions.pernode + 2000 + Maximum number of dynamic partitions allowed to be created in each mapper/reducer node. + hive-site.xml + false + + + + + + + hive.exec.max.created.files + 100000 + Maximum number of HDFS files created by all mappers/reducers in a MapReduce job. + hive-site.xml + false + + + + + + + hive.metastore.warehouse.dir + /apps/hive/warehouse + location of default database for the warehouse + hive-site.xml + false + + NOT_MANAGED_HDFS_PATH + + + + + hive.metastore.uris + thrift://localhost:9083 + Thrift URI for the remote metastore. Used by metastore client to connect to remote metastore. + hive-site.xml + false + + + + + + + javax.jdo.option.ConnectionPassword + + password to use against metastore database + Database Password + hive-site.xml + false + + PASSWORD + + password + false + HIVE_CLIENT,WEBHCAT_SERVER,HCAT,CONFIG_DOWNLOAD + + + + + javax.jdo.option.ConnectionURL + jdbc:mysql://localhost/hive?createDatabaseIfNotExist=true + JDBC connect string for a JDBC metastore + Database URL + hive-site.xml + false + + + + false + + + + hive_database + hive-env + + + ambari.hive.db.schema.name + hive-site + + + + + hive.metastore.server.max.threads + 100000 + Maximum number of worker threads in the Thrift server's pool. + hive-site.xml + false + + + + + + + hive.metastore.kerberos.keytab.file + /etc/security/keytabs/hive.service.keytab + The path to the Kerberos Keytab file containing the metastore Thrift server's service principal. + hive-site.xml + false + + + + + + + hive.metastore.kerberos.principal + hive/_HOST@EXAMPLE.COM + + The service principal for the metastore Thrift server. + The special string _HOST will be replaced automatically with the correct host name. + + hive-site.xml + false + + + + + + + hive.cluster.delegation.token.store.zookeeper.znode + /hive/cluster/delegation + The root path for token store data. + hive-site.xml + false + + + + + + + hive.metastore.cache.pinobjtypes + Table,Database,Type,FieldSchema,Order + List of comma separated metastore object types that should be pinned in the cache + hive-site.xml + false + + + + + + + hive.metastore.pre.event.listeners + org.apache.hadoop.hive.ql.security.authorization.AuthorizationPreEventListener + List of comma separated listeners for metastore events. + hive-site.xml + false + + + + + + + hive.metastore.authorization.storage.checks + false + + Should the metastore do authorization checks against the underlying storage (usually hdfs) + for operations like drop-partition (disallow the drop-partition if the user in + question doesn't have permissions to delete the corresponding directory + on the storage). + + hive-site.xml + false + + + + + + + javax.jdo.option.ConnectionDriverName + com.mysql.jdbc.Driver + Driver class name for a JDBC metastore + JDBC Driver Class + hive-site.xml + false + + + + false + + + + hive_database + hive-env + + + + + javax.jdo.option.ConnectionUserName + hive + username to use against metastore database + Database Username + hive-site.xml + false + + + + db_user + false + + + + + hive.map.aggr + true + Whether to use map-side aggregation in Hive Group By queries + hive-site.xml + false + + + + + + + hive.cbo.enable + true + Flag to control enabling Cost Based Optimizations using Calcite framework. + hive-site.xml + false + + + + + + + hive.mapjoin.optimized.hashtable + true + + Whether Hive should use memory-optimized hash table for MapJoin. Only works on Tez, + because memory-optimized hashtable cannot be serialized. + + hive-site.xml + false + + + + + + + hive.smbjoin.cache.rows + 10000 + How many rows with the same key value should be cached in memory per smb joined table. + hive-site.xml + false + + + + + + + hive.map.aggr.hash.percentmemory + 0.5 + Portion of total memory to be used by map-side group aggregation hash table + hive-site.xml + false + + + + + + + hive.map.aggr.hash.force.flush.memory.threshold + 0.9 + + The max memory to be used by map-side group aggregation hash table. + If the memory usage is higher than this number, force to flush data + + hive-site.xml + false + + + + + + + hive.map.aggr.hash.min.reduction + 0.5 + + Hash aggregation will be turned off if the ratio between hash table size and input rows is bigger than this number. + Set to 1 to make sure hash aggregation is never turned off. + + hive-site.xml + false + + + + + + + hive.merge.mapfiles + true + Merge small files at the end of a map-only job + hive-site.xml + false + + + + + + + hive.merge.mapredfiles + false + Merge small files at the end of a map-reduce job + hive-site.xml + false + + + + + + + hive.merge.tezfiles + false + Merge small files at the end of a Tez DAG + hive-site.xml + false + + + + + + + hive.merge.size.per.task + 256000000 + Size of merged files at the end of the job + hive-site.xml + false + + + + + + + hive.merge.smallfiles.avgsize + 16000000 + + When the average output file size of a job is less than this number, Hive will start an additional + map-reduce job to merge the output files into bigger files. This is only done for map-only jobs + if hive.merge.mapfiles is true, and for map-reduce jobs if hive.merge.mapredfiles is true. + + hive-site.xml + false + + + + + + + hive.merge.rcfile.block.level + true + + hive-site.xml + false + + + + + + + hive.merge.orcfile.stripe.level + true + + When hive.merge.mapfiles or hive.merge.mapredfiles is enabled while writing a + table with ORC file format, enabling this config will do stripe level fast merge + for small ORC files. Note that enabling this config will not honor padding tolerance + config (hive.exec.orc.block.padding.tolerance). + + hive-site.xml + false + + + + + + + hive.exec.orc.default.stripe.size + 67108864 + Define the default ORC stripe size + Default ORC Stripe Size + hive-site.xml + false + + + + int + 268435456 + 8388608 + B + 8388608 + + + + + hive.exec.orc.default.compress + ZLIB + Define the default compression codec for ORC file + ORC Compression Algorithm + hive-site.xml + false + + + + value-list + + + ZLIB + zlib Compression Library + + + SNAPPY + Snappy Compression Library + + + 1 + + + + + hive.orc.splits.include.file.footer + false + + If turned on splits generated by orc will include metadata about the stripes in the file. This + data is read remotely (from the client or HS2 machine) and sent to all the tasks. + + hive-site.xml + false + + + + + + + hive.orc.compute.splits.num.threads + 10 + How many threads orc should use to create splits in parallel. + hive-site.xml + false + + + + + + + hive.auto.convert.join + true + Whether Hive enables the optimization about converting common join into mapjoin based on the input file size + hive-site.xml + false + + + + + + + hive.auto.convert.join.noconditionaltask + true + + Whether Hive enables the optimization about converting common join into mapjoin based on the input file size. + If this parameter is on, and the sum of size for n-1 of the tables/partitions for a n-way join is smaller than the + specified size, the join is directly converted to a mapjoin (there is no conditional task). + + hive-site.xml + false + + + + + + + hive.limit.optimize.enable + true + Whether to enable to optimization to trying a smaller subset of data for simple LIMIT first. + hive-site.xml + false + + + + + + + hive.tez.cpu.vcores + -1 + By default Tez will ask for however many cpus map-reduce is configured to use per container. This can be used to overwrite. + hive-site.xml + false + + + + + + + hive.tez.log.level + INFO + + The log level to use for tasks executing as part of the DAG. + Used only if hive.tez.java.opts is used to configure Java options. + + hive-site.xml + false + + + + + + + hive.enforce.bucketing + true + Whether bucketing is enforced. If true, while inserting into the table, bucketing is enforced. + Enforce bucketing + hive-site.xml + false + + + + value-list + + + true + True + + + false + False + + + 1 + + + + hive_txn_acid + hive-env + + + + + hive.enforce.sorting + true + Whether sorting is enforced. If true, while inserting into the table, sorting is enforced. + hive-site.xml + false + + + + + + + hive.enforce.sortmergebucketmapjoin + true + If the user asked for sort-merge bucketed map-side join, and it cannot be performed, should the query fail or not ? + hive-site.xml + false + + + + + + + hive.auto.convert.sortmerge.join + true + Will the join be automatically converted to a sort-merge join, if the joined tables pass the criteria for sort-merge join. + hive-site.xml + false + + + + + + + hive.auto.convert.sortmerge.join.to.mapjoin + false + + If hive.auto.convert.sortmerge.join is set to true, and a join was converted to a sort-merge join, + this parameter decides whether each table should be tried as a big table, and effectively a map-join should be + tried. That would create a conditional task with n+1 children for a n-way join (1 child for each table as the + big table), and the backup task will be the sort-merge join. In some cases, a map-join would be faster than a + sort-merge join, if there is no advantage of having the output bucketed and sorted. For example, if a very big sorted + and bucketed table with few files (say 10 files) are being joined with a very small sorter and bucketed table + with few files (10 files), the sort-merge join will only use 10 mappers, and a simple map-only join might be faster + if the complete small table can fit in memory, and a map-join can be performed. + + hive-site.xml + false + + + + + + + hive.optimize.constant.propagation + true + Whether to enable constant propagation optimizer + hive-site.xml + false + + + + + + + hive.optimize.metadataonly + true + + hive-site.xml + false + + + + + + + hive.optimize.null.scan + true + Dont scan relations which are guaranteed to not generate any rows + hive-site.xml + false + + + + + + + hive.optimize.bucketmapjoin + true + Whether to try bucket mapjoin + hive-site.xml + false + + + + + + + hive.optimize.reducededuplication + true + + Remove extra map-reduce jobs if the data is already clustered by the same key which needs to be used again. + This should always be set to true. Since it is a new feature, it has been made configurable. + + hive-site.xml + false + + + + + + + hive.optimize.reducededuplication.min.reducer + 4 + + Reduce deduplication merges two RSs by moving key/parts/reducer-num of the child RS to parent RS. + That means if reducer-num of the child RS is fixed (order by or forced bucketing) and small, it can make very slow, single MR. + The optimization will be automatically disabled if number of reducers would be less than specified value. + + hive-site.xml + false + + + + + + + hive.optimize.sort.dynamic.partition + false + + When enabled dynamic partitioning column will be globally sorted. + This way we can keep only one record writer open for each partition value + in the reducer thereby reducing the memory pressure on reducers. + + Sort Partitions Dynamically + hive-site.xml + false + + + + value-list + + + true + True + + + false + False + + + 1 + + + + + hive.stats.autogather + true + A flag to gather statistics automatically during the INSERT OVERWRITE command. + hive-site.xml + false + + + + + + + hive.stats.dbclass + fs + + Expects one of the pattern in [jdbc(:.*), hbase, counter, custom, fs]. + The storage that stores temporary Hive statistics. Currently, jdbc, hbase, counter and custom type are supported. + + hive-site.xml + false + + + + + + + hive.stats.fetch.partition.stats + true + + Annotation of operator tree with statistics information requires partition level basic + statistics like number of rows, data size and file size. Partition statistics are fetched from + metastore. Fetching partition statistics for each needed partition can be expensive when the + number of partitions is high. This flag can be used to disable fetching of partition statistics + from metastore. When this flag is disabled, Hive will make calls to filesystem to get file sizes + and will estimate the number of rows from row schema. + + Fetch partition stats at compiler + hive-site.xml + false + + + + value-list + + + true + On + + + false + Off + + + 1 + + + + hive.cbo.enable + hive-site + + + + + hive.stats.fetch.column.stats + false + + Annotation of operator tree with statistics information requires column statistics. + Column statistics are fetched from metastore. Fetching column statistics for each needed column + can be expensive when the number of columns is high. This flag can be used to disable fetching + of column statistics from metastore. + + Fetch column stats at compiler + hive-site.xml + false + + + + value-list + + + true + On + + + false + Off + + + 1 + + + + hive.cbo.enable + hive-site + + + + + hive.zookeeper.client.port + 2181 + The port of ZooKeeper servers to talk to. If the list of Zookeeper servers specified in hive.zookeeper.quorum,does not contain port numbers, this value is used. + hive-site.xml + false + + + + + + + hive.zookeeper.namespace + hive_zookeeper_namespace + The parent node under which all ZooKeeper nodes are created. + hive-site.xml + false + + + + + + + hive.txn.manager + org.apache.hadoop.hive.ql.lockmgr.DummyTxnManager + + Transaction Manager + hive-site.xml + false + + + + value-list + + + org.apache.hadoop.hive.ql.lockmgr.DummyTxnManager + org.apache.hadoop.hive.ql.lockmgr.DummyTxnManager (off) + + + org.apache.hadoop.hive.ql.lockmgr.DbTxnManager + org.apache.hadoop.hive.ql.lockmgr.DbTxnManager (on) + + + 1 + + + + hive_txn_acid + hive-env + + + + + hive.txn.max.open.batch + 1000 + + Maximum number of transactions that can be fetched in one call to open_txns(). + Increasing this will decrease the number of delta files created when + streaming data into Hive. But it will also increase the number of + open transactions at any given time, possibly impacting read performance. + + hive-site.xml + false + + + + + + + hive.support.concurrency + false + + Support concurrency and use locks, needed for Transactions. Requires Zookeeper. + + Use Locking + hive-site.xml + false + + + + value-list + + + true + True + + + false + False + + + 1 + + + + hive_txn_acid + hive-env + + + + + hive.cli.print.header + false + + Whether to print the names of the columns in query output. + + hive-site.xml + false + + + + + + + hive.compactor.worker.timeout + 86400L + + Expects a time value with unit (d/day, h/hour, m/min, s/sec, ms/msec, us/usec, ns/nsec), which is sec if not specified. + Time before a given compaction in working state is declared a failure + and returned to the initiated state. + + hive-site.xml + false + + + + + + + hive.compactor.check.interval + 300L + + Expects a time value with unit (d/day, h/hour, m/min, s/sec, ms/msec, us/usec, ns/nsec), which is sec if not specified. + Time between checks to see if any partitions need compacted. + This should be kept high because each check for compaction requires many calls against the NameNode. + + hive-site.xml + false + + + + + + + hive.compactor.delta.pct.threshold + 0.1f + Percentage (by size) of base that deltas can be before major compaction is initiated. + hive-site.xml + false + + + + + + + hive.fetch.task.conversion + more + + Expects one of [none, minimal, more]. + Some select queries can be converted to single FETCH task minimizing latency. + Currently the query should be single sourced not having any subquery and should not have + any aggregations or distincts (which incurs RS), lateral views and joins. + 0. none : disable hive.fetch.task.conversion + 1. minimal : SELECT STAR, FILTER on partition columns, LIMIT only + 2. more : SELECT, FILTER, LIMIT only (support TABLESAMPLE and virtual columns) + + hive-site.xml + false + + + + + + + hive.fetch.task.conversion.threshold + 1073741824 + + Input threshold for applying hive.fetch.task.conversion. If target table is native, input length + is calculated by summation of file lengths. If it's not native, storage handler for the table + can optionally implement org.apache.hadoop.hive.ql.metadata.InputEstimator interface. + + hive-site.xml + false + + + + + + + hive.fetch.task.aggr + false + + Aggregation queries with no group-by clause (for example, select count(*) from src) execute + final aggregations in single reduce task. If this is set true, Hive delegates final aggregation + stage to fetch task, possibly decreasing the query time. + + hive-site.xml + false + + + + + + + hive.security.authorization.enabled + false + enable or disable the Hive client authorization + Enable Authorization + hive-site.xml + false + + + + value-list + + + true + True + + + false + False + + + 1 + + + + hive_security_authorization + hive-env + + + + + hive.security.authenticator.manager + org.apache.hadoop.hive.ql.security.ProxyUserAuthenticator + + hive client authenticator manager class name. The user defined authenticator should implement + interface org.apache.hadoop.hive.ql.security.HiveAuthenticationProvider. + + hive-site.xml + false + + + + + + hive_security_authorization + hive-env + + + + + hive.security.metastore.authorization.manager + org.apache.hadoop.hive.ql.security.authorization.StorageBasedAuthorizationProvider + + authorization manager class name to be used in the metastore for authorization. + The user defined authorization class should implement interface + org.apache.hadoop.hive.ql.security.authorization.HiveMetastoreAuthorizationProvider. + + Hive Authorization Manager + hive-site.xml + false + + + + + + hive_security_authorization + hive-env + + + + + hive.security.metastore.authorization.auth.reads + true + If this is true, metastore authorizer authorizes read actions on database, table + hive-site.xml + false + + + + + + + hive.security.metastore.authenticator.manager + org.apache.hadoop.hive.ql.security.HadoopDefaultMetastoreAuthenticator + + authenticator manager class name to be used in the metastore for authentication. + The user defined authenticator should implement interface org.apache.hadoop.hive.ql.security.HiveAuthenticationProvider. + + hive-site.xml + false + + + + + + + hive.server2.logging.operation.enabled + true + When true, HS2 will save operation logs + hive-site.xml + false + + + + + + + hive.server2.logging.operation.log.location + /tmp/hive/operation_logs + Top level directory where operation logs are stored if logging functionality is enabled + hive-site.xml + false + + + + + + + hive.server2.zookeeper.namespace + hiveserver2 + The parent node in ZooKeeper used by HiveServer2 when supporting dynamic service discovery. + hive-site.xml + false + + + + + + + hive.server2.thrift.http.port + 10001 + Port number of HiveServer2 Thrift interface when hive.server2.transport.mode is 'http'. + hive-site.xml + false + + + + + + + hive.server2.thrift.http.path + cliservice + Path component of URL endpoint when in HTTP mode. + hive-site.xml + false + + + + + + + hive.server2.thrift.port + 10000 + + TCP port number to listen on, default 10000. + + HiveServer2 Port + hive-site.xml + false + + + + int + false + + + + + hive.server2.thrift.sasl.qop + auth + + Expects one of [auth, auth-int, auth-conf]. + Sasl QOP value; Set it to one of following values to enable higher levels of + protection for HiveServer2 communication with clients. + "auth" - authentication only (default) + "auth-int" - authentication plus integrity protection + "auth-conf" - authentication plus integrity and confidentiality protection + This is applicable only if HiveServer2 is configured to use Kerberos authentication. + + hive-site.xml + false + + + + + + + hive.server2.thrift.max.worker.threads + 500 + Maximum number of Thrift worker threads + hive-site.xml + false + + + + + + + hive.server2.allow.user.substitution + true + Allow alternate user to be specified as part of HiveServer2 open connection request. + hive-site.xml + false + + + + + + + hive.server2.authentication.spnego.keytab + HTTP/_HOST@EXAMPLE.COM + + keytab file for SPNego principal, optional, + typical value would look like /etc/security/keytabs/spnego.service.keytab, + This keytab would be used by HiveServer2 when Kerberos security is enabled and + HTTP transport mode is used. + This needs to be set only if SPNEGO is to be used in authentication. + SPNego authentication would be honored only if valid + hive.server2.authentication.spnego.principal + and + hive.server2.authentication.spnego.keytab + are specified. + + hive-site.xml + false + + + + + + + hive.server2.authentication + NONE + Authentication mode, default NONE. Options are NONE, NOSASL, KERBEROS, LDAP, PAM and CUSTOM + HiveServer2 Authentication + hive-site.xml + false + + + + value-list + + + NONE + None + + + LDAP + LDAP + + + KERBEROS + Kerberos + + + PAM + PAM + + + CUSTOM + Custom + + + 1 + + + + + hive.server2.custom.authentication.class + hive-site + + + hive.server2.authentication.kerberos.principal + hive-site + + + hive.server2.authentication.kerberos.keytab + hive-site + + + hive.server2.authentication.ldap.url + hive-site + + + hive.server2.authentication.ldap.baseDN + hive-site + + + hive.server2.authentication.pam.services + hive-site + + + + hive.server2.authentication.spnego.principal + /etc/security/keytabs/spnego.service.keytab + + SPNego service principal, optional, + typical value would look like HTTP/_HOST@EXAMPLE.COM + SPNego service principal would be used by HiveServer2 when Kerberos security is enabled + and HTTP transport mode is used. + This needs to be set only if SPNEGO is to be used in authentication. + + hive-site.xml + false + + + + + + + hive.server2.enable.doAs + true + + Setting this property to true will have HiveServer2 execute + Hive operations as the user making the calls to it. + + Run as end user instead of Hive user + hive-site.xml + false + + + + value-list + + + true + True + + + false + False + + + 1 + + + + hive_security_authorization + hive-env + + + + + hive.server2.table.type.mapping + CLASSIC + + Expects one of [classic, hive]. + This setting reflects how HiveServer2 will report the table types for JDBC and other + client implementations that retrieve the available tables and supported table types + HIVE : Exposes Hive's native table types like MANAGED_TABLE, EXTERNAL_TABLE, VIRTUAL_VIEW + CLASSIC : More generic types like TABLE and VIEW + + hive-site.xml + false + + + + + + + hive.server2.use.SSL + false + + Set this to true for using SSL encryption in HiveServer2. + + Use SSL + hive-site.xml + false + + + + value-list + + + true + True + + + false + False + + + 1 + + + + + hive.conf.restricted.list + hive.security.authenticator.manager,hive.security.authorization.manager,hive.users.in.admin.role + Comma separated list of configuration options which are immutable at runtime + hive-site.xml + false + + + + + + + hive.user.install.directory + /user/ + + If hive (in tez mode only) cannot find a usable hive jar in "hive.jar.directory", + it will upload the hive jar to "hive.user.install.directory/user.name" + and use it to run queries. + + hive-site.xml + false + + + + + + + hive.vectorized.groupby.maxentries + 100000 + + Max number of entries in the vector group by aggregation hashtables. + Exceeding this will trigger a flush irrelevant of memory pressure condition. + + hive-site.xml + false + + + + + + + hive.prewarm.enabled + false + Enables container prewarm for Tez (Hadoop 2 only) + Hold Containers to Reduce Latency + hive-site.xml + false + + + + value-list + + + true + True + + + false + False + + + 1 + + + + + hive.prewarm.numcontainers + 3 + Controls the number of containers to prewarm for Tez (Hadoop 2 only) + Number of Containers Held + hive-site.xml + false + + + + int + 20 + 1 + 1 + + + + + hive.convert.join.bucket.mapjoin.tez + false + + Whether joins can be automatically converted to bucket map joins in hive + when tez is used as the execution engine. + + hive-site.xml + false + + + + + + + hive.tez.auto.reducer.parallelism + false + + Turn on Tez' auto reducer parallelism feature. When enabled, Hive will still estimate data sizes + and set parallelism estimates. Tez will sample source vertices' output sizes and adjust the estimates at runtime as + necessary. + + Allow dynamic numbers of reducers + hive-site.xml + false + + + + value-list + + + true + True + + + false + False + + + 1 + + + + + hive.tez.max.partition.factor + 2.0 + When auto reducer parallelism is enabled this factor will be used to over-partition data in shuffle edges. + hive-site.xml + false + + + + + + + hive.tez.min.partition.factor + 0.25 + + When auto reducer parallelism is enabled this factor will be used to put a lower limit to the number + of reducers that tez specifies. + + hive-site.xml + false + + + + + + + hive.tez.dynamic.partition.pruning + true + When dynamic pruning is enabled, joins on partition keys will be processed by sending events from the processing vertices to the tez application master. These events will be used to prune unnecessary partitions. + Allow dynamic partition pruning + hive-site.xml + false + + + + value-list + + + true + True + + + false + False + + + 1 + + + + + hive.tez.dynamic.partition.pruning.max.event.size + 1048576 + Maximum size of events sent by processors in dynamic pruning. If this size is crossed no pruning will take place. + hive-site.xml + false + + + + + + + hive.tez.dynamic.partition.pruning.max.data.size + 104857600 + Maximum total data size of events in dynamic pruning. + hive-site.xml + false + + + + + + + hive.tez.smb.number.waves + 0.5 + The number of waves in which to run the SMB join. Account for cluster being occupied. Ideally should be 1 wave. + hive-site.xml + false + + + + + + + ambari.hive.db.schema.name + hive + Database name used as the Hive Metastore + Database Name + hive-site.xml + false + + + + database + false + + + + + javax.jdo.option.ConnectionURL + hive-site + + + + hive.vectorized.execution.enabled + true + + This flag should be set to true to enable vectorized mode of query execution. + The default value is false. + + Enable Vectorization and Map Vectorization + hive-site.xml + false + + + + value-list + + + true + True + + + false + False + + + 1 + + + + + hive.auto.convert.join.noconditionaltask.size + 52428800 + If hive.auto.convert.join.noconditionaltask is off, this parameter does not take affect. However, if it + is on, and the sum of size for n-1 of the tables/partitions for a n-way join is smaller than this size, the join is directly + converted to a mapjoin(there is no conditional task). + + For Map Join, per Map memory threshold + hive-site.xml + false + + + + int + 17179869184 +