Return-Path: X-Original-To: apmail-ambari-commits-archive@www.apache.org Delivered-To: apmail-ambari-commits-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id DF5DA18BB0 for ; Mon, 7 Mar 2016 20:54:47 +0000 (UTC) Received: (qmail 39418 invoked by uid 500); 7 Mar 2016 20:54:47 -0000 Delivered-To: apmail-ambari-commits-archive@ambari.apache.org Received: (qmail 39385 invoked by uid 500); 7 Mar 2016 20:54:47 -0000 Mailing-List: contact commits-help@ambari.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: ambari-dev@ambari.apache.org Delivered-To: mailing list commits@ambari.apache.org Received: (qmail 39287 invoked by uid 99); 7 Mar 2016 20:54:47 -0000 Received: from git1-us-west.apache.org (HELO git1-us-west.apache.org) (140.211.11.23) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 07 Mar 2016 20:54:47 +0000 Received: by git1-us-west.apache.org (ASF Mail Server at git1-us-west.apache.org, from userid 33) id 667EFDFBD7; Mon, 7 Mar 2016 20:54:47 +0000 (UTC) Content-Type: text/plain; charset="us-ascii" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit From: alejandro@apache.org To: commits@ambari.apache.org Date: Mon, 07 Mar 2016 20:54:47 -0000 Message-Id: X-Mailer: ASF-Git Admin Mailer Subject: [1/2] ambari git commit: AMBARI-15305. Move Hive Server Interactive related files from HDP 2.4 stack to HDP 2.6 (Swapan Shridhar via alejandro) Repository: ambari Updated Branches: refs/heads/trunk 7cc897bd6 -> 037d93385 http://git-wip-us.apache.org/repos/asf/ambari/blob/037d9338/ambari-server/src/main/resources/stacks/HDP/2.6/services/HIVE/configuration/hive-interactive-site.xml ---------------------------------------------------------------------- diff --git a/ambari-server/src/main/resources/stacks/HDP/2.6/services/HIVE/configuration/hive-interactive-site.xml b/ambari-server/src/main/resources/stacks/HDP/2.6/services/HIVE/configuration/hive-interactive-site.xml new file mode 100644 index 0000000..e77b379 --- /dev/null +++ b/ambari-server/src/main/resources/stacks/HDP/2.6/services/HIVE/configuration/hive-interactive-site.xml @@ -0,0 +1,2053 @@ + + + + + + + + hive.cbo.enable + true + Flag to control enabling Cost Based Optimizations using Calcite framework. + Enable Cost Based Optimizer + + value-list + + + true + + + + false + + + + 1 + + + + + hive.zookeeper.quorum + localhost:2181 + List of ZooKeeper servers to talk to. This is needed for: 1. + Read/write locks - when hive.lock.manager is set to + org.apache.hadoop.hive.ql.lockmgr.zookeeper.ZooKeeperHiveLockManager, + 2. When HiveServer2 supports service discovery via Zookeeper. + + multiLine + true + + + + + hive.metastore.connect.retries + 24 + Number of retries while opening a connection to metastore + + + + hive.metastore.failure.retries + 24 + Number of retries upon failure of Thrift metastore calls + + + + hive.metastore.client.connect.retry.delay + 5s + + Expects a time value with unit (d/day, h/hour, m/min, s/sec, ms/msec, us/usec, ns/nsec), which is sec if not specified. + Number of seconds for the client to wait between consecutive connection attempts + + + + + hive.heapsize + 1024 + HiveServer2 heap size + true + Hive Java heap size + + int + + + + + ambari.hive.db.schema.name + hive + Database Name + Database name used as the Hive Metastore + + database + false + + + + + javax.jdo.option.ConnectionURL + jdbc:mysql://localhost/hive?createDatabaseIfNotExist=true + Database URL + JDBC connect string for a JDBC metastore + + false + + + + + javax.jdo.option.ConnectionDriverName + com.mysql.jdbc.Driver + JDBC Driver Class + Driver class name for a JDBC metastore + + false + + + + hive_database + hive-env + + + + + + javax.jdo.option.ConnectionUserName + hive + Database Username + username to use against metastore database + + db_user + false + + + + + javax.jdo.option.ConnectionPassword + + PASSWORD + Database Password + password to use against metastore database + + password + false + + + + + javax.jdo.option.ConnectionURL + jdbc:mysql://localhost/hive?createDatabaseIfNotExist=true + Database URL + JDBC connect string for a JDBC metastore + + false + + + + hive_database + hive-env + + + ambari.hive.db.schema.name + hive-site + + + + + + hive.metastore.server.max.threads + 100000 + Maximum number of worker threads in the Thrift server's pool. + + + + hive.metastore.warehouse.dir + /apps/hive/warehouse + location of default database for the warehouse + + + + hive.metastore.sasl.enabled + false + If true, the metastore thrift interface will be secured with SASL. + Clients must authenticate with Kerberos. + + + + hive.metastore.kerberos.keytab.file + /etc/security/keytabs/hive.service.keytab + The path to the Kerberos Keytab file containing the metastore Thrift server's service principal. + + + + hive.metastore.kerberos.principal + hive/_HOST@EXAMPLE.COM + + The service principal for the metastore Thrift server. + The special string _HOST will be replaced automatically with the correct host name. + + + + + hive.cluster.delegation.token.store.zookeeper.znode + /hive/cluster/delegation + The root path for token store data. + + + + hive.metastore.cache.pinobjtypes + Table,Database,Type,FieldSchema,Order + List of comma separated metastore object types that should be pinned in the cache + + + + hive.metastore.uris + thrift://localhost:9083 + Thrift URI for the remote metastore. Used by metastore client to connect to remote metastore. + + + + hive.metastore.pre.event.listeners + org.apache.hadoop.hive.ql.security.authorization.AuthorizationPreEventListener + List of comma separated listeners for metastore events. + + + + hive.metastore.authorization.storage.checks + false + + Should the metastore do authorization checks against the underlying storage (usually hdfs) + for operations like drop-partition (disallow the drop-partition if the user in + question doesn't have permissions to delete the corresponding directory + on the storage). + + + + + datanucleus.autoCreateSchema + false + + + + datanucleus.fixedDatastore + true + + + + hive.metastore.client.socket.timeout + 1800s + + Expects a time value with unit (d/day, h/hour, m/min, s/sec, ms/msec, us/usec, ns/nsec), which is sec if not specified. + MetaStore Client socket timeout in seconds + + + + + hive.metastore.execute.setugi + true + In unsecure mode, setting this property to true will cause the metastore to execute DFS operations using the client's reported user and group permissions. Note that this property must be set on both the client and server sides. Further note that its best effort. If client sets its to true and server sets it to false, client setting will be ignored. + + + + hive.security.authorization.enabled + false + enable or disable the Hive client authorization + Enable Authorization + + value-list + + + true + + + + false + + + + 1 + + + + hive-env + hive_security_authorization + + + + + + hive.security.authorization.manager + org.apache.hadoop.hive.ql.security.authorization.plugin.sqlstd.SQLStdConfOnlyAuthorizerFactory + + The Hive client authorization manager class name. The user defined authorization class should implement + interface org.apache.hadoop.hive.ql.security.authorization.HiveAuthorizationProvider. + + + + hive-env + hive_security_authorization + + + + + + hive.cluster.delegation.token.store.class + org.apache.hadoop.hive.thrift.ZooKeeperTokenStore + The delegation token store implementation. + Set to org.apache.hadoop.hive.thrift.ZooKeeperTokenStore for load-balanced cluster. + + + + hive.cluster.delegation.token.store.zookeeper.connectString + localhost:2181 + The ZooKeeper token store connect string. + + + + hive.security.metastore.authorization.auth.reads + true + If this is true, metastore authorizer authorizes read actions on database, table + + + + hive.server2.logging.operation.log.location + /tmp/hive/operation_logs + Top level directory where operation logs are stored if logging functionality is enabled + + + + hive.server2.logging.operation.enabled + true + When true, HS2 will save operation logs + + + + hive.security.metastore.authenticator.manager + org.apache.hadoop.hive.ql.security.HadoopDefaultMetastoreAuthenticator + + authenticator manager class name to be used in the metastore for authentication. + The user defined authenticator should implement interface org.apache.hadoop.hive.ql.security.HiveAuthenticationProvider. + + + + + hive.security.metastore.authorization.manager + Hive Authorization Manager + org.apache.hadoop.hive.ql.security.authorization.StorageBasedAuthorizationProvider + + authorization manager class name to be used in the metastore for authorization. + The user defined authorization class should implement interface + org.apache.hadoop.hive.ql.security.authorization.HiveMetastoreAuthorizationProvider. + + + + hive-env + hive_security_authorization + + + + + + hive.security.authenticator.manager + org.apache.hadoop.hive.ql.security.ProxyUserAuthenticator + + hive client authenticator manager class name. The user defined authenticator should implement + interface org.apache.hadoop.hive.ql.security.HiveAuthenticationProvider. + + + + hive-env + hive_security_authorization + + + + + + hive.server2.enable.doAs + true + + Setting this property to true will have HiveServer2 execute + Hive operations as the user making the calls to it. + + Run as end user instead of Hive user + + value-list + + + true + + + + false + + + + 1 + + + + hive-env + hive_security_authorization + + + + + + hive.user.install.directory + /user/ + + If hive (in tez mode only) cannot find a usable hive jar in "hive.jar.directory", + it will upload the hive jar to "hive.user.install.directory/user.name" + and use it to run queries. + + + + + hive.conf.restricted.list + hive.security.authenticator.manager,hive.security.authorization.manager,hive.users.in.admin.role + Comma separated list of configuration options which are immutable at runtime + + + + hive.server2.use.SSL + false + + Use SSL + + value-list + + + true + + + + false + + + + 1 + + + + + hive.server2.table.type.mapping + CLASSIC + + Expects one of [classic, hive]. + This setting reflects how HiveServer2 will report the table types for JDBC and other + client implementations that retrieve the available tables and supported table types + HIVE : Exposes Hive's native table types like MANAGED_TABLE, EXTERNAL_TABLE, VIRTUAL_VIEW + CLASSIC : More generic types like TABLE and VIEW + + + + + hive.server2.enable.impersonation + true + true + Enable user impersonation for HiveServer2 + + + + fs.hdfs.impl.disable.cache + true + Disable HDFS filesystem cache. + + + + fs.file.impl.disable.cache + true + Disable local filesystem cache. + + + + hive.exec.scratchdir + /tmp/hive + HDFS root scratch dir for Hive jobs which gets created with write all (733) permission. For each connecting user, an HDFS scratch dir: ${hive.exec.scratchdir}/<username> is created, with ${hive.scratch.dir.permission}. + + + + hive.exec.submit.local.task.via.child + true + + Determines whether local tasks (typically mapjoin hashtable generation phase) runs in + separate JVM (true recommended) or not. + Avoids the overhead of spawning new JVM, but can lead to out-of-memory issues. + + + + + hive.exec.compress.intermediate + false + + This controls whether intermediate files produced by Hive between multiple map-reduce jobs are compressed. + The compression codec and other options are determined from Hadoop config variables mapred.output.compress* + + + + + hive.exec.reducers.bytes.per.reducer + 67108864 + Defines the size per reducer. For example, if it is set to 64M, given 256M input size, 4 reducers will be used. + Data per Reducer + + int + 64 + 4294967296 + B + + + + + + hive.exec.reducers.max + 1009 + + max number of reducers will be used. If the one specified in the configuration parameter mapred.reduce.tasks is + negative, Hive will use this one as the max number of reducers when automatically determine number of reducers. + + + + + hive.exec.compress.output + false + + This controls whether the final outputs of a query (to a local/HDFS file or a Hive table) is compressed. + The compression codec and other options are determined from Hadoop config variables mapred.output.compress* + + + + + hive.exec.submitviachild + false + + + + + hive.enforce.bucketing + true + Whether bucketing is enforced. If true, while inserting into the table, bucketing is enforced. + Enforce bucketing + + value-list + + + true + + + + false + + + + 1 + + + + hive-env + hive_txn_acid + + + + + + hive.enforce.sorting + true + Whether sorting is enforced. If true, while inserting into the table, sorting is enforced. + + + + hive.enforce.sortmergebucketmapjoin + true + If the user asked for sort-merge bucketed map-side join, and it cannot be performed, should the query fail or not ? + + + + hive.map.aggr + true + Whether to use map-side aggregation in Hive Group By queries + + + + hive.mapjoin.optimized.hashtable + true + + Whether Hive should use memory-optimized hash table for MapJoin. Only works on Tez, + because memory-optimized hashtable cannot be serialized. + + + + + hive.smbjoin.cache.rows + 10000 + How many rows with the same key value should be cached in memory per smb joined table. + + + + hive.map.aggr.hash.percentmemory + 0.5 + Portion of total memory to be used by map-side group aggregation hash table + + + + hive.map.aggr.hash.force.flush.memory.threshold + 0.9 + + The max memory to be used by map-side group aggregation hash table. + If the memory usage is higher than this number, force to flush data + + + + + hive.map.aggr.hash.min.reduction + 0.5 + + Hash aggregation will be turned off if the ratio between hash table size and input rows is bigger than this number. + Set to 1 to make sure hash aggregation is never turned off. + + + + + hive.merge.mapfiles + true + Merge small files at the end of a map-only job + + + + hive.merge.mapredfiles + false + Merge small files at the end of a map-reduce job + + + + hive.merge.tezfiles + false + Merge small files at the end of a Tez DAG + + + + hive.merge.size.per.task + 256000000 + Size of merged files at the end of the job + + + + hive.merge.smallfiles.avgsize + 16000000 + + When the average output file size of a job is less than this number, Hive will start an additional + map-reduce job to merge the output files into bigger files. This is only done for map-only jobs + if hive.merge.mapfiles is true, and for map-reduce jobs if hive.merge.mapredfiles is true. + + + + + hive.merge.rcfile.block.level + true + + + + + hive.merge.orcfile.stripe.level + true + + When hive.merge.mapfiles or hive.merge.mapredfiles is enabled while writing a + table with ORC file format, enabling this config will do stripe level fast merge + for small ORC files. Note that enabling this config will not honor padding tolerance + config (hive.exec.orc.block.padding.tolerance). + + + + + hive.exec.orc.default.stripe.size + 67108864 + Define the default ORC stripe size + Default ORC Stripe Size + + int + 8388608 + 268435456 + B + 8388608 + + + + + hive.optimize.bucketmapjoin + true + Whether to try bucket mapjoin + + + + hive.optimize.bucketmapjoin.sortedmerge + false + If the tables being joined are sorted and bucketized on the join columns, and they have the same number + of buckets, a sort-merge join can be performed by setting this parameter as true. + + + + + hive.mapred.reduce.tasks.speculative.execution + false + Whether speculative execution for reducers should be turned on. + + + + hive.exec.dynamic.partition + true + Whether or not to allow dynamic partitions in DML/DDL. + + + + hive.exec.dynamic.partition.mode + nonstrict + + In strict mode, the user must specify at least one static partition + in case the user accidentally overwrites all partitions. + NonStrict allows all partitions of a table to be dynamic. + + Allow all partitions to be Dynamic + + value-list + + + nonstrict + + + + strict + + + + 1 + + + + hive-env + hive_txn_acid + + + + + + hive.exec.max.dynamic.partitions.pernode + 2000 + Maximum number of dynamic partitions allowed to be created in each mapper/reducer node. + + + + hive.exec.max.created.files + 100000 + Maximum number of HDFS files created by all mappers/reducers in a MapReduce job. + + + + hive.exec.max.dynamic.partitions + 5000 + Maximum number of dynamic partitions allowed to be created in total. + + + + hive.auto.convert.join + true + Whether Hive enables the optimization about converting common join into mapjoin based on the input file size + + + + hive.auto.convert.sortmerge.join + true + Will the join be automatically converted to a sort-merge join, if the joined tables pass the criteria for sort-merge join. + + + + hive.optimize.constant.propagation + true + Whether to enable constant propagation optimizer + + + + hive.optimize.metadataonly + true + + + + + hive.optimize.null.scan + true + Dont scan relations which are guaranteed to not generate any rows + + + + hive.auto.convert.sortmerge.join.to.mapjoin + false + + If hive.auto.convert.sortmerge.join is set to true, and a join was converted to a sort-merge join, + this parameter decides whether each table should be tried as a big table, and effectively a map-join should be + tried. That would create a conditional task with n+1 children for a n-way join (1 child for each table as the + big table), and the backup task will be the sort-merge join. In some cases, a map-join would be faster than a + sort-merge join, if there is no advantage of having the output bucketed and sorted. For example, if a very big sorted + and bucketed table with few files (say 10 files) are being joined with a very small sorter and bucketed table + with few files (10 files), the sort-merge join will only use 10 mappers, and a simple map-only join might be faster + if the complete small table can fit in memory, and a map-join can be performed. + + + + + hive.auto.convert.sortmerge.join.noconditionaltask + true + true + Required to Enable the conversion of an SMB (Sort-Merge-Bucket) to a map-join SMB. + + + + hive.auto.convert.join.noconditionaltask + true + + Whether Hive enables the optimization about converting common join into mapjoin based on the input file size. + If this parameter is on, and the sum of size for n-1 of the tables/partitions for a n-way join is smaller than the + specified size, the join is directly converted to a mapjoin (there is no conditional task). + + + + + hive.auto.convert.join.noconditionaltask.size + 52428800 + If hive.auto.convert.join.noconditionaltask is off, this parameter does not take affect. However, if it + is on, and the sum of size for n-1 of the tables/partitions for a n-way join is smaller than this size, the join is directly + converted to a mapjoin(there is no conditional task). + + For Map Join, per Map memory threshold + + int + 8192 + 17179869184 + B + + + + + hive-site + hive.tez.container.size + + + + + + hive.optimize.reducededuplication.min.reducer + 4 + + Reduce deduplication merges two RSs by moving key/parts/reducer-num of the child RS to parent RS. + That means if reducer-num of the child RS is fixed (order by or forced bucketing) and small, it can make very slow, single MR. + The optimization will be automatically disabled if number of reducers would be less than specified value. + + + + + hive.optimize.sort.dynamic.partition + false + + When enabled dynamic partitioning column will be globally sorted. + This way we can keep only one record writer open for each partition value + in the reducer thereby reducing the memory pressure on reducers. + + Sort Partitions Dynamically + + value-list + + + true + + + + false + + + + 1 + + + + + hive.optimize.mapjoin.mapreduce + true + true + If hive.auto.convert.join is off, this parameter does not take + affect. If it is on, and if there are map-join jobs followed by a map-reduce + job (for e.g a group by), each map-only job is merged with the following + map-reduce job. + + + + + hive.mapjoin.bucket.cache.size + 10000 + + Size per reducer.The default is 1G, i.e if the input size is 10G, it + will use 10 reducers. + + + + + hive.vectorized.execution.enabled + true + + This flag should be set to true to enable vectorized mode of query execution. + The default value is false. + + Enable Vectorization and Map Vectorization + + value-list + + + true + + + + false + + + + 1 + + + + + hive.optimize.reducededuplication + true + + Remove extra map-reduce jobs if the data is already clustered by the same key which needs to be used again. + This should always be set to true. Since it is a new feature, it has been made configurable. + + + + + hive.optimize.index.filter + true + Whether to enable automatic use of indexes + Push Filters to Storage + + value-list + + + true + + + + false + + + + 1 + + + + + hive.execution.engine + tez + + Expects one of [mr, tez]. + Chooses execution engine. Options are: mr (Map reduce, default) or tez (hadoop 2 only) + + Execution Engine + + value-list + + + mr + + + + tez + + + + 1 + + + + + hive.exec.post.hooks + org.apache.hadoop.hive.ql.hooks.ATSHook + + Comma-separated list of post-execution hooks to be invoked for each statement. + A post-execution hook is specified as the name of a Java class which implements the + org.apache.hadoop.hive.ql.hooks.ExecuteWithHookContext interface. + + + + hive-env + hive_timeline_logging_enabled + + + application-properties + atlas.server.http.port + + + application-properties + atlas.server.https.port + + + + + + hive.exec.pre.hooks + org.apache.hadoop.hive.ql.hooks.ATSHook + + Comma-separated list of pre-execution hooks to be invoked for each statement. + A pre-execution hook is specified as the name of a Java class which implements the + org.apache.hadoop.hive.ql.hooks.ExecuteWithHookContext interface. + + + + hive-env + hive_timeline_logging_enabled + + + + + + hive.exec.failure.hooks + org.apache.hadoop.hive.ql.hooks.ATSHook + + Comma-separated list of on-failure hooks to be invoked for each statement. + An on-failure hook is specified as the name of Java class which implements the + org.apache.hadoop.hive.ql.hooks.ExecuteWithHookContext interface. + + + + hive-env + hive_timeline_logging_enabled + + + + + + hive.exec.parallel + false + Whether to execute jobs in parallel + + + + hive.exec.parallel.thread.number + 8 + How many jobs at most can be executed in parallel + + + + hive.vectorized.groupby.maxentries + 100000 + + Max number of entries in the vector group by aggregation hashtables. + Exceeding this will trigger a flush irrelevant of memory pressure condition. + + + + + hive.tez.smb.number.waves + 0.5 + The number of waves in which to run the SMB join. Account for cluster being occupied. Ideally should be 1 wave. + + + + hive.tez.dynamic.partition.pruning.max.data.size + 104857600 + Maximum total data size of events in dynamic pruning. + + + + hive.tez.dynamic.partition.pruning.max.event.size + 1048576 + Maximum size of events sent by processors in dynamic pruning. If this size is crossed no pruning will take place. + + + + hive.tez.dynamic.partition.pruning + true + When dynamic pruning is enabled, joins on partition keys will be processed by sending events from the processing vertices to the tez application master. These events will be used to prune unnecessary partitions. + Allow dynamic partition pruning + + value-list + + + true + + + + false + + + + 1 + + + + + hive.tez.min.partition.factor + 0.25 + + When auto reducer parallelism is enabled this factor will be used to put a lower limit to the number + of reducers that tez specifies. + + + + + hive.tez.max.partition.factor + 2.0 + When auto reducer parallelism is enabled this factor will be used to over-partition data in shuffle edges. + + + + hive.tez.auto.reducer.parallelism + false + + Turn on Tez' auto reducer parallelism feature. When enabled, Hive will still estimate data sizes + and set parallelism estimates. Tez will sample source vertices' output sizes and adjust the estimates at runtime as + necessary. + + Allow dynamic numbers of reducers + + value-list + + + true + + + + false + + + + 1 + + + + + hive.convert.join.bucket.mapjoin.tez + false + + Whether joins can be automatically converted to bucket map joins in hive + when tez is used as the execution engine. + + + + + hive.prewarm.numcontainers + 3 + Controls the number of containers to prewarm for Tez (Hadoop 2 only) + Number of Containers Held + + int + 1 + 20 + 1 + + + + + hive.prewarm.enabled + false + Enables container prewarm for Tez (Hadoop 2 only) + Hold Containers to Reduce Latency + + value-list + + + true + + + + false + + + + 1 + + + + + hive.vectorized.groupby.checkinterval + 4096 + Number of entries added to the group by aggregation hash before a recomputation of average entry size is performed. + + + + hive.vectorized.groupby.flush.percent + 0.1 + Percent of entries in the group by aggregation hash flushed when the memory threshold is exceeded. + + + + hive.stats.autogather + true + A flag to gather statistics automatically during the INSERT OVERWRITE command. + + + + hive.stats.dbclass + fs + + Expects one of the pattern in [jdbc(:.*), hbase, counter, custom, fs]. + The storage that stores temporary Hive statistics. Currently, jdbc, hbase, counter and custom type are supported. + + + + + hive.stats.fetch.partition.stats + true + + Annotation of operator tree with statistics information requires partition level basic + statistics like number of rows, data size and file size. Partition statistics are fetched from + metastore. Fetching partition statistics for each needed partition can be expensive when the + number of partitions is high. This flag can be used to disable fetching of partition statistics + from metastore. When this flag is disabled, Hive will make calls to filesystem to get file sizes + and will estimate the number of rows from row schema. + + Fetch partition stats at compiler + + value-list + + + true + + + + false + + + + 1 + + + + hive-site + hive.cbo.enable + + + + + + hive.zookeeper.client.port + 2181 + The port of ZooKeeper servers to talk to. If the list of Zookeeper servers specified in hive.zookeeper.quorum,does not contain port numbers, this value is used. + + + + hive.zookeeper.namespace + hive_zookeeper_namespace + The parent node under which all ZooKeeper nodes are created. + + + + hive.stats.fetch.column.stats + false + + Annotation of operator tree with statistics information requires column statistics. + Column statistics are fetched from metastore. Fetching column statistics for each needed column + can be expensive when the number of columns is high. This flag can be used to disable fetching + of column statistics from metastore. + + Fetch column stats at compiler + + value-list + + + true + + + + false + + + + 1 + + + + hive-site + hive.cbo.enable + + + + + + hive.tez.container.size + 682 + By default, Tez uses the java options from map tasks. Use this property to override that value. + Tez Container Size + + int + 682 + 6820 + MB + 682 + + + + yarn-site + yarn.scheduler.minimum-allocation-mb + + + yarn-site + yarn.scheduler.maximum-allocation-mb + + + + + + hive.tez.input.format + org.apache.hadoop.hive.ql.io.HiveInputFormat + The default input format for Tez. Tez groups splits in the Application Master. + + + + hive.tez.java.opts + -server -Xmx545m -Djava.net.preferIPv4Stack=true -XX:NewRatio=8 -XX:+UseNUMA -XX:+UseParallelGC -XX:+PrintGCDetails -verbose:gc -XX:+PrintGCTimeStamps + Java command line options for Tez. The -Xmx parameter value is generally 80% of hive.tez.container.size. + + + + hive.compute.query.using.stats + true + + When set to true Hive will answer a few queries like count(1) purely using stats + stored in metastore. For basic stats collection turn on the config hive.stats.autogather to true. + For more advanced stats collection need to run analyze table queries. + + Compute simple queries using stats only + + value-list + + + true + + + + false + + + + 1 + + + + hive-site + hive.cbo.enable + + + + + + hive.exec.orc.default.compress + ZLIB + Define the default compression codec for ORC file + ORC Compression Algorithm + + value-list + + + ZLIB + + + + SNAPPY + + + + 1 + + + + + hive.orc.splits.include.file.footer + false + + If turned on splits generated by orc will include metadata about the stripes in the file. This + data is read remotely (from the client or HS2 machine) and sent to all the tasks. + + + + + hive.orc.compute.splits.num.threads + 10 + How many threads orc should use to create splits in parallel. + + + + hive.limit.optimize.enable + true + Whether to enable to optimization to trying a smaller subset of data for simple LIMIT first. + + + + hive.tez.cpu.vcores + -1 + By default Tez will ask for however many cpus map-reduce is configured to use per container. This can be used to overwrite. + + + + hive.tez.log.level + INFO + + The log level to use for tasks executing as part of the DAG. + Used only if hive.tez.java.opts is used to configure Java options. + + + + + hive.limit.pushdown.memory.usage + 0.04 + The max memory to be used for hash in RS operator for top K selection. + + + + hive.exec.orc.encoding.strategy + SPEED + + Define the encoding strategy to use while writing data. Changing this + will only affect the light weight encoding for integers. This flag will not change + the compression level of higher level compression codec (like ZLIB). Possible + options are SPEED and COMPRESSION. + + ORC Encoding Strategy + + value-list + + + SPEED + + + + COMPRESSION + + + + 1 + + + + hive-env + hive_exec_orc_storage_strategy + + + + + + hive.exec.orc.compression.strategy + SPEED + + Define the compression strategy to use while writing data. This changes the + compression level of higher level compression codec (like ZLIB). + + ORC Compression Strategy + + value-list + + + SPEED + + + + COMPRESSION + + + + 1 + + + + hive-env + hive_exec_orc_storage_strategy + + + + + + hive.vectorized.execution.reduce.enabled + false + + This flag should be set to true to enable vectorized mode of the reduce-side of + query execution. + + Enable Reduce Vectorization + + value-list + + + true + + + + false + + + + 1 + + + + + hive.server2.authentication.ldap.baseDN + DONT_ADD_ON_UPGRADE + + + hive-site + hive.server2.authentication + + + + + + hive.server2.authentication.kerberos.principal + hive/_HOST@EXAMPLE.COM + DONT_ADD_ON_UPGRADE + + + hive-site + hive.server2.authentication + + + + + + hive.server2.custom.authentication.class + DONT_ADD_ON_UPGRADE + + + hive-site + hive.server2.authentication + + + + + + hive.server2.authentication.kerberos.keytab + /etc/security/keytabs/hive.service.keytab + DONT_ADD_ON_UPGRADE + + + hive-site + hive.server2.authentication + + + + + + hive.server2.authentication.ldap.url + + DONT_ADD_ON_UPGRADE + + + hive-site + hive.server2.authentication + + + + + + hive.server2.tez.default.queues + Default query queues + default + + A list of comma separated values corresponding to YARN queues of the same name. + When HiveServer2 is launched in Tez mode, this configuration needs to be set + for multiple Tez sessions to run in parallel on the cluster. + + + combo + + + default + + + + 1+ + + + + capacity-scheduler + yarn.scheduler.capacity.root.queues + + + + + + hive.server2.tez.sessions.per.default.queue + 1 + + A positive integer that determines the number of Tez sessions that should be + launched on each of the queues specified by "hive.server2.tez.default.queues". + Determines the parallelism on each queue. + + Session per queue + + int + 1 + 10 + 1 + + + + + hive.server2.tez.initialize.default.sessions + false + + This flag is used in HiveServer2 to enable a user to use HiveServer2 without + turning on Tez for HiveServer2. The user could potentially want to run queries + over Tez without the pool of sessions. + + Start Tez session at Initialization + + value-list + + + true + + + + false + + + + 1 + + + + + hive.txn.manager + org.apache.hadoop.hive.ql.lockmgr.DummyTxnManager + + Transaction Manager + + value-list + + + org.apache.hadoop.hive.ql.lockmgr.DummyTxnManager + + + + org.apache.hadoop.hive.ql.lockmgr.DbTxnManager + + + + 1 + + + + hive-env + hive_txn_acid + + + + + + hive.txn.timeout + 300 + Time after which transactions are declared aborted if the client has not sent a heartbeat, in seconds. + + + + hive.txn.max.open.batch + 1000 + + Maximum number of transactions that can be fetched in one call to open_txns(). + Increasing this will decrease the number of delta files created when + streaming data into Hive. But it will also increase the number of + open transactions at any given time, possibly impacting read performance. + + + + + hive.cli.print.header + false + + Whether to print the names of the columns in query output. + + + + + hive.support.concurrency + false + + Support concurrency and use locks, needed for Transactions. Requires Zookeeper. + + Use Locking + + value-list + + + true + + + + false + + + + 1 + + + + hive-env + hive_txn_acid + + + + + + hive.compactor.initiator.on + false + Whether to run the compactor's initiator thread in this metastore instance or not. If there is more than one instance of the thrift metastore this should be set to true on only one instance. Setting true on only one host can be achieved by creating a config-group containing the metastore host, and overriding the default value to true in it. + Run Compactor + + value-list + + + true + + + + false + + + + 1 + + + + hive-env + hive_txn_acid + + + + + + hive.compactor.worker.threads + 0 + Number of compactor worker threads to run on this metastore instance. Can be different values on different metastore instances. + Number of threads used by Compactor + + int + 0 + 20 + 1 + + + + hive-env + hive_txn_acid + + + + + + hive.compactor.worker.timeout + 86400L + + Expects a time value with unit (d/day, h/hour, m/min, s/sec, ms/msec, us/usec, ns/nsec), which is sec if not specified. + Time before a given compaction in working state is declared a failure + and returned to the initiated state. + + + + + hive.compactor.check.interval + 300L + + Expects a time value with unit (d/day, h/hour, m/min, s/sec, ms/msec, us/usec, ns/nsec), which is sec if not specified. + Time between checks to see if any partitions need compacted. + This should be kept high because each check for compaction requires many calls against the NameNode. + + + + + hive.fetch.task.conversion + more + + Expects one of [none, minimal, more]. + Some select queries can be converted to single FETCH task minimizing latency. + Currently the query should be single sourced not having any subquery and should not have + any aggregations or distincts (which incurs RS), lateral views and joins. + 0. none : disable hive.fetch.task.conversion + 1. minimal : SELECT STAR, FILTER on partition columns, LIMIT only + 2. more : SELECT, FILTER, LIMIT only (support TABLESAMPLE and virtual columns) + + + + + hive.fetch.task.aggr + false + + Aggregation queries with no group-by clause (for example, select count(*) from src) execute + final aggregations in single reduce task. If this is set true, Hive delegates final aggregation + stage to fetch task, possibly decreasing the query time. + + + + + hive.fetch.task.conversion.threshold + 1073741824 + + Input threshold for applying hive.fetch.task.conversion. If target table is native, input length + is calculated by summation of file lengths. If it's not native, storage handler for the table + can optionally implement org.apache.hadoop.hive.ql.metadata.InputEstimator interface. + + + + + hive.compactor.delta.num.threshold + 10 + Number of delta files that must exist in a directory before the compactor will attempt a minor compaction. + + + + hive.compactor.delta.pct.threshold + 0.1f + Percentage (by size) of base that deltas can be before major compaction is initiated. + + + + hive.compactor.abortedtxn.threshold + 1000 + Number of aborted transactions involving a particular table or partition before major compaction is initiated. + + + + datanucleus.cache.level2.type + none + Determines caching mechanism DataNucleus L2 cache will use. It is strongly recommended to use default value of 'none' as other values may cause consistency errors in Hive. + + + + hive.server2.thrift.port + 10500 + HiveServer2 Port + + TCP port number to listen on, default 10000. + + + false + int + + + + + hive.server2.allow.user.substitution + true + Allow alternate user to be specified as part of HiveServer2 open connection request. + + + + hive.server2.thrift.max.worker.threads + 500 + Maximum number of Thrift worker threads + + + + hive.server2.thrift.sasl.qop + auth + + Expects one of [auth, auth-int, auth-conf]. + Sasl QOP value; Set it to one of following values to enable higher levels of + protection for HiveServer2 communication with clients. + "auth" - authentication only (default) + "auth-int" - authentication plus integrity protection + "auth-conf" - authentication plus integrity and confidentiality protection + This is applicable only if HiveServer2 is configured to use Kerberos authentication. + + + + + hive.server2.authentication.spnego.principal + /etc/security/keytabs/spnego.service.keytab + + SPNego service principal, optional, + typical value would look like HTTP/_HOST@EXAMPLE.COM + SPNego service principal would be used by HiveServer2 when Kerberos security is enabled + and HTTP transport mode is used. + This needs to be set only if SPNEGO is to be used in authentication. + + + + + hive.server2.authentication.spnego.keytab + HTTP/_HOST@EXAMPLE.COM + + keytab file for SPNego principal, optional, + typical value would look like /etc/security/keytabs/spnego.service.keytab, + This keytab would be used by HiveServer2 when Kerberos security is enabled and + HTTP transport mode is used. + This needs to be set only if SPNEGO is to be used in authentication. + SPNego authentication would be honored only if valid + hive.server2.authentication.spnego.principal + and + hive.server2.authentication.spnego.keytab + are specified. + + + + + hive.server2.authentication + Authentication mode, default NONE. Options are NONE, NOSASL, KERBEROS, LDAP, PAM and CUSTOM + NONE + HiveServer2 Authentication + + value-list + + + NONE + + + + LDAP + + + + KERBEROS + + + + PAM + + + + CUSTOM + + + + 1 + + + + + hive.server2.support.dynamic.service.discovery + true + Whether HiveServer2 supports dynamic service discovery for its clients. + To support this, each instance of HiveServer2 currently uses ZooKeeper to register itself, + when it is brought up. JDBC/ODBC clients should use the ZooKeeper ensemble: hive.zookeeper.quorum + in their connection string. + + + boolean + + + + + hive.server2.zookeeper.namespace + hiveserver2 + The parent node in ZooKeeper used by HiveServer2 when supporting dynamic service discovery. + + + + hive.server2.thrift.http.port + 10501 + Port number of HiveServer2 Thrift interface when hive.server2.transport.mode is 'http'. + + + + hive.server2.transport.mode + binary + + Expects one of [binary, http]. + Transport mode of HiveServer2. + + + + + hive.default.fileformat + TextFile + Default file format for CREATE TABLE statement. + Default File Format + + value-list + + + ORC + The Optimized Row Columnar (ORC) file format provides a highly efficient way to store Hive data. It was designed to overcome limitations of the other Hive file formats. Using ORC files improves performance when Hive is reading, writing, and processing data. + + + TextFile + Text file format saves Hive data as normal text. + + + + + + + atlas.cluster.name + primary + + + application-properties + atlas.enableTLS + + + + + + atlas.rest.address + http://localhost:21000 + + + application-properties + atlas.enableTLS + + + application-properties + atlas.server.http.port + + + application-properties + atlas.server.https.port + + + + + + hive.default.fileformat.managed + TextFile + + Default file format for CREATE TABLE statement applied to managed tables only. + External tables will be created with default file format. Leaving this null + will result in using the default file format for all tables. + + + + + datanucleus.rdbms.datastoreAdapterClassName + Datanucleus Class, This property used only when hive db is SQL Anywhere + + + hive-env + hive_database + + + + + + atlas.hook.hive.maxThreads + 1 + + Maximum number of threads used by Atlas hook. + + + + + atlas.hook.hive.minThreads + 1 + + Minimum number of threads maintained by Atlas hook. + + + + \ No newline at end of file http://git-wip-us.apache.org/repos/asf/ambari/blob/037d9338/ambari-server/src/main/resources/stacks/HDP/2.6/services/HIVE/configuration/llap-daemon-log4j.xml ---------------------------------------------------------------------- diff --git a/ambari-server/src/main/resources/stacks/HDP/2.6/services/HIVE/configuration/llap-daemon-log4j.xml b/ambari-server/src/main/resources/stacks/HDP/2.6/services/HIVE/configuration/llap-daemon-log4j.xml new file mode 100644 index 0000000..1c60285 --- /dev/null +++ b/ambari-server/src/main/resources/stacks/HDP/2.6/services/HIVE/configuration/llap-daemon-log4j.xml @@ -0,0 +1,126 @@ + + + + + + + content + Custom log4j.properties + +# Licensed to the Apache Software Foundation (ASF) under one +# or more contributor license agreements. See the NOTICE file +# distributed with this work for additional information +# regarding copyright ownership. The ASF licenses this file +# to you under the Apache License, Version 2.0 (the +# "License"); you may not use this file except in compliance +# with the License. You may obtain a copy of the License at +# +# http://www.apache.org/licenses/LICENSE-2.0 +# +# Unless required by applicable law or agreed to in writing, software +# distributed under the License is distributed on an "AS IS" BASIS, +# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. +# See the License for the specific language governing permissions and +# limitations under the License. +status = WARN +name = LlapDaemonLog4j2 +packages = org.apache.hadoop.hive.ql.log + +# list of properties +property.llap.daemon.log.level = WARN +property.llap.daemon.root.logger = console +property.llap.daemon.log.dir = . +property.llap.daemon.log.file = llapdaemon.log +property.llap.daemon.historylog.file = llapdaemon_history.log +property.llap.daemon.log.maxfilesize = 256MB +property.llap.daemon.log.maxbackupindex = 20 + +# list of all appenders +appenders = console, RFA, HISTORYAPPENDER + +# console appender +appender.console.type = Console +appender.console.name = console +appender.console.target = SYSTEM_ERR +appender.console.layout.type = PatternLayout +appender.console.layout.pattern = %d{yy/MM/dd HH:mm:ss} [%t%x] %p %c{2} : %m%n + +# rolling file appender +appender.RFA.type = RollingFile +appender.RFA.name = RFA +appender.RFA.fileName = ${sys:llap.daemon.log.dir}/${sys:llap.daemon.log.file} +appender.RFA.filePattern = ${sys:llap.daemon.log.dir}/${sys:llap.daemon.log.file}_%i +appender.RFA.layout.type = PatternLayout +appender.RFA.layout.pattern = %d{ISO8601} %-5p [%t%x]: %c{2} (%F:%M(%L)) - %m%n +appender.RFA.policies.type = Policies +appender.RFA.policies.size.type = SizeBasedTriggeringPolicy +appender.RFA.policies.size.size = ${sys:llap.daemon.log.maxfilesize} +appender.RFA.strategy.type = DefaultRolloverStrategy +appender.RFA.strategy.max = ${sys:llap.daemon.log.maxbackupindex} + +# history file appender +appender.HISTORYAPPENDER.type = RollingFile +appender.HISTORYAPPENDER.name = HISTORYAPPENDER +appender.HISTORYAPPENDER.fileName = ${sys:llap.daemon.log.dir}/${sys:llap.daemon.historylog.file} +appender.HISTORYAPPENDER.filePattern = ${sys:llap.daemon.log.dir}/${sys:llap.daemon.historylog.file}_%i +appender.HISTORYAPPENDER.layout.type = PatternLayout +appender.HISTORYAPPENDER.layout.pattern = %m%n +appender.HISTORYAPPENDER.policies.type = Policies +appender.HISTORYAPPENDER.policies.size.type = SizeBasedTriggeringPolicy +appender.HISTORYAPPENDER.policies.size.size = ${sys:llap.daemon.log.maxfilesize} +appender.HISTORYAPPENDER.strategy.type = DefaultRolloverStrategy +appender.HISTORYAPPENDER.strategy.max = ${sys:llap.daemon.log.maxbackupindex} + +# list of all loggers +loggers = NIOServerCnxn, ClientCnxnSocketNIO, DataNucleus, Datastore, JPOX, HistoryLogger + +logger.NIOServerCnxn.name = org.apache.zookeeper.server.NIOServerCnxn +logger.NIOServerCnxn.level = WARN + +logger.ClientCnxnSocketNIO.name = org.apache.zookeeper.ClientCnxnSocketNIO +logger.ClientCnxnSocketNIO.level = WARN + +logger.DataNucleus.name = DataNucleus +logger.DataNucleus.level = ERROR + +logger.Datastore.name = Datastore +logger.Datastore.level = ERROR + +logger.JPOX.name = JPOX +logger.JPOX.level = ERROR + +logger.HistoryLogger.name = org.apache.hadoop.hive.llap.daemon.HistoryLogger +logger.HistoryLogger.level = WARN +logger.HistoryLogger.additivity = false +logger.HistoryLogger.appenderRefs = HistoryAppender +logger.HistoryLogger.appenderRef.HistoryAppender.ref = HISTORYAPPENDER + +# root logger +rootLogger.level = ${sys:llap.daemon.log.level} +rootLogger.appenderRefs = root +rootLogger.appenderRef.root.ref = ${sys:llap.daemon.root.logger} + + + content + false + + + http://git-wip-us.apache.org/repos/asf/ambari/blob/037d9338/ambari-server/src/main/resources/stacks/HDP/2.6/services/HIVE/configuration/llap-env.xml ---------------------------------------------------------------------- diff --git a/ambari-server/src/main/resources/stacks/HDP/2.6/services/HIVE/configuration/llap-env.xml b/ambari-server/src/main/resources/stacks/HDP/2.6/services/HIVE/configuration/llap-env.xml new file mode 100644 index 0000000..24a95cf --- /dev/null +++ b/ambari-server/src/main/resources/stacks/HDP/2.6/services/HIVE/configuration/llap-env.xml @@ -0,0 +1,72 @@ + + + + + + enable_hive_interactive + false + Enable or disable interactive query in this cluster. + Enable Interactive Query + + value-list + + + true + + + + false + + + + 1 + + + + llap_queue_name + default + LLAP Queue Name. + LLAP Queue Name + + combo + + + default + + + + 1 + + + + llap_am_queue_name + default + LLAP AM Queue Name. + LLAP AM Queue Name + + combo + + + default + + + + 1 + + + \ No newline at end of file http://git-wip-us.apache.org/repos/asf/ambari/blob/037d9338/ambari-server/src/main/resources/stacks/HDP/2.6/services/HIVE/metainfo.xml ---------------------------------------------------------------------- diff --git a/ambari-server/src/main/resources/stacks/HDP/2.6/services/HIVE/metainfo.xml b/ambari-server/src/main/resources/stacks/HDP/2.6/services/HIVE/metainfo.xml index 54c92d8..459845b 100644 --- a/ambari-server/src/main/resources/stacks/HDP/2.6/services/HIVE/metainfo.xml +++ b/ambari-server/src/main/resources/stacks/HDP/2.6/services/HIVE/metainfo.xml @@ -21,6 +21,97 @@ HIVE 1.2.1.2.6 + + + HIVE_SERVER_INTERACTIVE + HiveServer2 Interactive + MASTER + 0-1 + true + + + + ZOOKEEPER/ZOOKEEPER_SERVER + cluster + + true + HIVE/HIVE_SERVER_INTERACTIVE + + + + YARN/YARN_CLIENT + host + + true + HIVE/HIVE_SERVER_INTERACTIVE + + + + HDFS/HDFS_CLIENT + host + + true + HIVE/HIVE_SERVER_INTERACTIVE + + + + MAPREDUCE2/MAPREDUCE2_CLIENT + host + + true + HIVE/HIVE_SERVER_INTERACTIVE + + + + TEZ/TEZ_CLIENT + host + + true + HIVE/HIVE_SERVER_INTERACTIVE + + + + PIG/PIG + host + + true + HIVE/HIVE_SERVER_INTERACTIVE + + + + SLIDER/SLIDER + host + + true + HIVE/HIVE_SERVER_INTERACTIVE + + + + + + PYTHON + + + hive-site + hive-interactive-site + + + + + + theme.json + true + + + + + ZOOKEEPER + HDFS + YARN + TEZ + PIG + SLIDER + - + \ No newline at end of file http://git-wip-us.apache.org/repos/asf/ambari/blob/037d9338/ambari-server/src/main/resources/stacks/HDP/2.6/services/HIVE/themes/theme.json ---------------------------------------------------------------------- diff --git a/ambari-server/src/main/resources/stacks/HDP/2.6/services/HIVE/themes/theme.json b/ambari-server/src/main/resources/stacks/HDP/2.6/services/HIVE/themes/theme.json new file mode 100644 index 0000000..26b9532 --- /dev/null +++ b/ambari-server/src/main/resources/stacks/HDP/2.6/services/HIVE/themes/theme.json @@ -0,0 +1,76 @@ +{ + "configuration" : { + "placement" : { + "configs" : [ + { + "config": "llap-env/enable_hive_interactive", + "subsection-name": "interactive-query-row1-col1" + }, + { + "config": "llap-env/llap_queue_name", + "subsection-name": "interactive-query-row1-col1", + "depends-on": [ + { + "configs":[ + "llap-env/enable_hive_interactive" + ], + "if": "${llap-env/enable_hive_interactive}", + "then": { + "property_value_attributes": { + "visible": true + } + }, + "else": { + "property_value_attributes": { + "visible": false + } + } + } + ] + }, + { + "config": "llap-env/llap_am_queue_name", + "subsection-name": "interactive-query-row1-col1", + "depends-on": [ + { + "configs":[ + "llap-env/enable_hive_interactive" + ], + "if": "${llap-env/enable_hive_interactive}", + "then": { + "property_value_attributes": { + "visible": true + } + }, + "else": { + "property_value_attributes": { + "visible": false + } + } + } + ] + } + ] + }, + "widgets" : [ + { + "config": "llap-env/enable_hive_interactive", + "widget": { + "type": "toggle" + } + }, + { + "config": "llap-env/llap_queue_name", + "widget": { + "type": "list" + } + }, + { + "config": "llap-env/llap_am_queue_name", + "widget": { + "type": "list" + } + } + ] + } +} \ No newline at end of file