hive-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Riesland, Zack" <>
Subject RE: How to Query running in background in tez
Date Mon, 29 Feb 2016 14:00:17 GMT
There is also a “TezViewInstance” view for Ambari that will help you drill into Tez queries
and see things like DAG counters, etc.

It didn’t work out of the box for us – some permissions had to be tweaked manually. But
it’s worth investigating.

From: Riesland, Zack
Sent: Monday, February 29, 2016 8:46 AM
To: ''
Subject: RE: How to Query running in background in tez

1) Your cluster should have an application master UI, usually located at <application timeline
host>:8088/cluster  This will list all the running queries/applications and allow you to
drill in for statistics like mappers, reducers, etc.
2) You want to use the Yarn capacity scheduler. You can create queues and allocate a % of
resources to each queue. The easiest way to use the capacity scheduler is to download the
Ambari view – assuming that you’re using Ambari. I’m not sure what the cloudera equivalent

From: mahender bigdata []
Sent: Sunday, February 28, 2016 2:10 PM
Subject: How to Query running in background in tez


I have 2 queries regarding Hive Query

  1.  Is there a way to know which Hive Query is running in background by application ID,
I would also like to know location of Log during running of the hive query in TEZ mode
  2.  If I'm having cluster 20 Nodes, If I submit a query, query takes entire resources of
cluster. Is there way to limit the usage. As Query is taking full resources of cluster, any
query which is submitted latter, will be in Pending.Is there a way to restrict to 50% -70%
Cluster resource  consumption for a query. Allowing other query esp. smaller query to complete.

View raw message