Mailing-List: contact user-help@impala.incubator.apache.org; run by ezmlm
Precedence: bulk
Reply-To: user@impala.incubator.apache.org
MIME-Version: 1.0
In-Reply-To: <CAF_nf8hMgoM2R2jcyE6kCOsKpPofPbUSgo1pSYcm3fmyzBWxOA@mail.gmail.com>
References: <CANk3DLn0Eja6xNRCCDTcD-r6SU9yDQfehnyDqyKjMP65RwiHfA@mail.gmail.com>
 <CADf=JzzdH=1=iF+5UidPAcL+eSRHvTQW7TmCYfbv-zhb7AnEVA@mail.gmail.com>
 <CANk3DLmNidYCQVk5R8fK0oDYh1SaOtrGBqRMgFiWrk2MbbVN1g@mail.gmail.com>
 <CADf=JzwBMzqfRJazOkAZtWrfmESewVsNcDVDusRP1FkBm=yeeQ@mail.gmail.com>
 <CAEoRBexD9aL+aW=NekzjhLGyWYj=xXeHZQ=A6NewsLHOxE2Vig@mail.gmail.com>
 <CANk3DLn8bapv1G=92WfjLLRawVK66uKTEcbqB7TR11F8Xe5maw@mail.gmail.com> <CAF_nf8hMgoM2R2jcyE6kCOsKpPofPbUSgo1pSYcm3fmyzBWxOA@mail.gmail.com>
From: William Cox <william.cox@distilnetworks.com>
Date: Wed, 1 Feb 2017 17:30:23 -0500
Message-ID: <CANk3DLn_DjKUsQZ3ZpoREowq0+D79B8esi3X9Fu_2NA=ePG4rw@mail.gmail.com>
Subject: Re: queries not being submitted in Impala cluster despite free resources
To: user@impala.incubator.apache.org
Content-Type: multipart/alternative; boundary=001a1149087a3a1dfd05477f983d
archived-at: Wed, 01 Feb 2017 22:30:31 -0000

--001a1149087a3a1dfd05477f983d
Content-Type: text/plain; charset=UTF-8

Thanks. We've removed HA Proxy to cause queries to admit to a single
coordinator and reduced the default query memory limit to about 5GB. Things
are looking pretty good. Y'all have been very helpful.
Thanks!
-William


On Wed, Feb 1, 2017 at 3:18 PM, Matthew Jacobs <mj@cloudera.com> wrote:

> Yes, your understanding is correct. This is a limitation of the
> current distributed design of the admission control mechanism, and one
> that we aim to improve by creating a centralized node for admission.
> In the meantime, you may need to avoid using a load balancer if you're
> sensitive to overadmission.
>
> On Wed, Feb 1, 2017 at 9:23 AM, William Cox
> <william.cox@distilnetworks.com> wrote:
> > Tim,
> >
> > I have a 7 node cluster with 159.71 GB available to each Impala node
> (1.1TB
> > available total) - the default resource allocation pool has 700GB
> allocated
> > - so 100GB per node.
> >
> > We have a "default query memory limit" set to 25GB. From reading
> > (https://www.cloudera.com/documentation/enterprise/5-8-
> x/topics/impala_mem_limit.html)
> > it would see that this means each node can only run 4 queries at once,
> since
> > Impala is requesting 25Gb per query regardless of the estimate (100/25 =
> 4).
>
> There is not a hard reservation yet, so the queries will consume as
> much as they can up to the mem limit. If there is 'overadmission',
> then more queries will begin executing and thus some may run oom.
>
> >
> > What I *don't* understand is how this works with running more than 4
> queries
> > *total* at any time - wouldn't Impala be asking for 25Gb for each query
> on
> > each node?
> >
> > It should also be noted that we set up HA proxy in front of Impala
> > (http://www.cloudera.com/documentation/enterprise/5-8-
> x/topics/impala_proxy.html)
> > because we have a lot of adhoc users. From reading the Admission Control
> > docs it seems that maybe that's part of the problem: "Note that admission
> > control currently offers only soft limits when multiple coordinators are
> > being used."
> >
> > So while I can only seem to run 4 queries per node, I can run more than 4
> > total because of the multiple coordinators?
> >
> > -William
> >
> >
> >
> > On Tue, Jan 31, 2017 at 2:08 PM, Tim Armstrong <tarmstrong@cloudera.com>
> > wrote:
> >>
> >> Do you have a default query memory limit set? Admission control does not
> >> generally work well if it's relying on the estimated memory requirement
> -
> >> you really need to have query memory limits set. If you have the default
> >> query memory limit set to 25GB, then admission control assumes that the
> >> query will use that amount on each node. I assume you mean 700GB memory
> >> total across all nodes - how much memory do you have per node?
> >>
> >> On Tue, Jan 31, 2017 at 7:31 AM, Jeszy <jeszyb@gmail.com> wrote:
> >>>
> >>> That would be good. If they eventually run successfully, a query
> profile
> >>> would also be welcome.
> >>>
> >>> Thanks
> >>>
> >>> On Tue, Jan 31, 2017 at 4:28 PM, William Cox
> >>> <william.cox@distilnetworks.com> wrote:
> >>>>
> >>>> Jeszy,
> >>>>
> >>>> Thanks for the suggestion. We also have a 25GB per-query limit set up.
> >>>> Queries that estimate a large size are rejected with an error stating
> they
> >>>> exceeded the memory limit. The queries I'm having trouble with are
> ones that
> >>>> have no such error but simply wait in the CREATED state. Next time it
> >>>> happens I'll see if I can grab the memory estimates and check.
> >>>> Thanks.
> >>>> -William
> >>>>
> >>>>
> >>>> On Tue, Jan 31, 2017 at 7:08 AM, Jeszy <jeszyb@gmail.com> wrote:
> >>>>>
> >>>>> Hey William,
> >>>>>
> >>>>> IIUC you have configured both a memory-based upper bound and a #
> >>>>> queries upper bound for the default pool. A query can get queued if
> it would
> >>>>> exceed either of these limits. If you're not hitting the number of
> queries
> >>>>> one, then it's probably memory, which can happen even if not fully
> utilized
> >>>>> - unless you specify a mem_limit for the query, the estimated memory
> >>>>> requirement will be used for deciding whether the query should be
> admitted.
> >>>>> This can get out of hand when the cardinality estimation is off,
> either due
> >>>>> to a very complex query or because of missing / old stats.
> >>>>>
> >>>>> This is about memory-based admission control exclusively, but I think
> >>>>> it will be helpful:
> >>>>> http://www.cloudera.com/documentation/enterprise/
> latest/topics/impala_admission.html#admission_memory
> >>>>>
> >>>>> HTH
> >>>>>
> >>>>> On Mon, Jan 30, 2017 at 8:31 PM, William Cox
> >>>>> <william.cox@distilnetworks.com> wrote:
> >>>>>>
> >>>>>> I'm running CDH CDH-5.8.0-1 and Impala =version 2.6.0-cdh5.8.0
> RELEASE
> >>>>>> (build 8d8652f69461f0dd8d5f474573fb5de7ceb0ee6b). We have enabled
> resource
> >>>>>> management and allocated  ~700Gb of memory with 30 running queries
> for the
> >>>>>> default. Our background data jobs are Unlimited.
> >>>>>>
> >>>>>>
> >>>>>> In spite of this setup, we still encounter times where queries will
> be
> >>>>>> marked as CREATED and waiting for allocation when the number of
> running
> >>>>>> queries is well below 30 and the amount of used memory, as listed
> in the CDH
> >>>>>> UI, is well below 700GB.
> >>>>>>
> >>>>>> This is seemingly unpredicable. We've created extensive monitors to
> >>>>>> track # of running queries and memory usage but there seems to be
> no pattern
> >>>>>> to why/when these queries won't be submitted to the cluster.
> >>>>>>
> >>>>>> Is there some key metric that I might be missing or is there any
> >>>>>> suggestions folks have for tracking down these queries that won't be
> >>>>>> submitted?
> >>>>>> Thanks.
> >>>>>> -William
> >>>>>>
> >>>>>
> >>>>
> >>>
> >>
> >
>

--001a1149087a3a1dfd05477f983d
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">Thanks. We&#39;ve removed HA Proxy to cause queries to adm=
it to a single coordinator and reduced the=C2=A0default query memory limit =
to about 5GB. Things are looking pretty good. Y&#39;all have been very help=
ful.<div><font color=3D"#500050"><span style=3D"font-size:12.8px">Thanks!</=
span></font></div><div><font color=3D"#500050"><span style=3D"font-size:12.=
8px">-William</span></font></div><div><font color=3D"#500050"><span style=
=3D"font-size:12.8px"><br></span></font></div></div><div class=3D"gmail_ext=
ra"><br><div class=3D"gmail_quote">On Wed, Feb 1, 2017 at 3:18 PM, Matthew =
Jacobs <span dir=3D"ltr">&lt;<a href=3D"mailto:mj@cloudera.com" target=3D"_=
blank">mj@cloudera.com</a>&gt;</span> wrote:<br><blockquote class=3D"gmail_=
quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1=
ex">Yes, your understanding is correct. This is a limitation of the<br>
current distributed design of the admission control mechanism, and one<br>
that we aim to improve by creating a centralized node for admission.<br>
In the meantime, you may need to avoid using a load balancer if you&#39;re<=
br>
sensitive to overadmission.<br>
<span class=3D""><br>
On Wed, Feb 1, 2017 at 9:23 AM, William Cox<br>
&lt;<a href=3D"mailto:william.cox@distilnetworks.com">william.cox@distilnet=
works.<wbr>com</a>&gt; wrote:<br>
&gt; Tim,<br>
&gt;<br>
&gt; I have a 7 node cluster with 159.71 GB available to each Impala node (=
1.1TB<br>
&gt; available total) - the default resource allocation pool has 700GB allo=
cated<br>
&gt; - so 100GB per node.<br>
&gt;<br>
&gt; We have a &quot;default query memory limit&quot; set to 25GB. From rea=
ding<br>
&gt; (<a href=3D"https://www.cloudera.com/documentation/enterprise/5-8-x/to=
pics/impala_mem_limit.html" rel=3D"noreferrer" target=3D"_blank">https://ww=
w.cloudera.com/<wbr>documentation/enterprise/5-8-<wbr>x/topics/impala_mem_l=
imit.html</a><wbr>)<br>
&gt; it would see that this means each node can only run 4 queries at once,=
 since<br>
&gt; Impala is requesting 25Gb per query regardless of the estimate (100/25=
 =3D 4).<br>
<br>
</span>There is not a hard reservation yet, so the queries will consume as<=
br>
much as they can up to the mem limit. If there is &#39;overadmission&#39;,<=
br>
then more queries will begin executing and thus some may run oom.<br>
<div class=3D"HOEnZb"><div class=3D"h5"><br>
&gt;<br>
&gt; What I *don&#39;t* understand is how this works with running more than=
 4 queries<br>
&gt; *total* at any time - wouldn&#39;t Impala be asking for 25Gb for each =
query on<br>
&gt; each node?<br>
&gt;<br>
&gt; It should also be noted that we set up HA proxy in front of Impala<br>
&gt; (<a href=3D"http://www.cloudera.com/documentation/enterprise/5-8-x/top=
ics/impala_proxy.html" rel=3D"noreferrer" target=3D"_blank">http://www.clou=
dera.com/<wbr>documentation/enterprise/5-8-<wbr>x/topics/impala_proxy.html<=
/a>)<br>
&gt; because we have a lot of adhoc users. From reading the Admission Contr=
ol<br>
&gt; docs it seems that maybe that&#39;s part of the problem: &quot;Note th=
at admission<br>
&gt; control currently offers only soft limits when multiple coordinators a=
re<br>
&gt; being used.&quot;<br>
&gt;<br>
&gt; So while I can only seem to run 4 queries per node, I can run more tha=
n 4<br>
&gt; total because of the multiple coordinators?<br>
&gt;<br>
&gt; -William<br>
&gt;<br>
&gt;<br>
&gt;<br>
&gt; On Tue, Jan 31, 2017 at 2:08 PM, Tim Armstrong &lt;<a href=3D"mailto:t=
armstrong@cloudera.com">tarmstrong@cloudera.com</a>&gt;<br>
&gt; wrote:<br>
&gt;&gt;<br>
&gt;&gt; Do you have a default query memory limit set? Admission control do=
es not<br>
&gt;&gt; generally work well if it&#39;s relying on the estimated memory re=
quirement -<br>
&gt;&gt; you really need to have query memory limits set. If you have the d=
efault<br>
&gt;&gt; query memory limit set to 25GB, then admission control assumes tha=
t the<br>
&gt;&gt; query will use that amount on each node. I assume you mean 700GB m=
emory<br>
&gt;&gt; total across all nodes - how much memory do you have per node?<br>
&gt;&gt;<br>
&gt;&gt; On Tue, Jan 31, 2017 at 7:31 AM, Jeszy &lt;<a href=3D"mailto:jeszy=
b@gmail.com">jeszyb@gmail.com</a>&gt; wrote:<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt; That would be good. If they eventually run successfully, a que=
ry profile<br>
&gt;&gt;&gt; would also be welcome.<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt; Thanks<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt; On Tue, Jan 31, 2017 at 4:28 PM, William Cox<br>
&gt;&gt;&gt; &lt;<a href=3D"mailto:william.cox@distilnetworks.com">william.=
cox@distilnetworks.<wbr>com</a>&gt; wrote:<br>
&gt;&gt;&gt;&gt;<br>
&gt;&gt;&gt;&gt; Jeszy,<br>
&gt;&gt;&gt;&gt;<br>
&gt;&gt;&gt;&gt; Thanks for the suggestion. We also have a 25GB per-query l=
imit set up.<br>
&gt;&gt;&gt;&gt; Queries that estimate a large size are rejected with an er=
ror stating they<br>
&gt;&gt;&gt;&gt; exceeded the memory limit. The queries I&#39;m having trou=
ble with are ones that<br>
&gt;&gt;&gt;&gt; have no such error but simply wait in the CREATED state. N=
ext time it<br>
&gt;&gt;&gt;&gt; happens I&#39;ll see if I can grab the memory estimates an=
d check.<br>
&gt;&gt;&gt;&gt; Thanks.<br>
&gt;&gt;&gt;&gt; -William<br>
&gt;&gt;&gt;&gt;<br>
&gt;&gt;&gt;&gt;<br>
&gt;&gt;&gt;&gt; On Tue, Jan 31, 2017 at 7:08 AM, Jeszy &lt;<a href=3D"mail=
to:jeszyb@gmail.com">jeszyb@gmail.com</a>&gt; wrote:<br>
&gt;&gt;&gt;&gt;&gt;<br>
&gt;&gt;&gt;&gt;&gt; Hey William,<br>
&gt;&gt;&gt;&gt;&gt;<br>
&gt;&gt;&gt;&gt;&gt; IIUC you have configured both a memory-based upper bou=
nd and a #<br>
&gt;&gt;&gt;&gt;&gt; queries upper bound for the default pool. A query can =
get queued if it would<br>
&gt;&gt;&gt;&gt;&gt; exceed either of these limits. If you&#39;re not hitti=
ng the number of queries<br>
&gt;&gt;&gt;&gt;&gt; one, then it&#39;s probably memory, which can happen e=
ven if not fully utilized<br>
&gt;&gt;&gt;&gt;&gt; - unless you specify a mem_limit for the query, the es=
timated memory<br>
&gt;&gt;&gt;&gt;&gt; requirement will be used for deciding whether the quer=
y should be admitted.<br>
&gt;&gt;&gt;&gt;&gt; This can get out of hand when the cardinality estimati=
on is off, either due<br>
&gt;&gt;&gt;&gt;&gt; to a very complex query or because of missing / old st=
ats.<br>
&gt;&gt;&gt;&gt;&gt;<br>
&gt;&gt;&gt;&gt;&gt; This is about memory-based admission control exclusive=
ly, but I think<br>
&gt;&gt;&gt;&gt;&gt; it will be helpful:<br>
&gt;&gt;&gt;&gt;&gt; <a href=3D"http://www.cloudera.com/documentation/enter=
prise/latest/topics/impala_admission.html#admission_memory" rel=3D"noreferr=
er" target=3D"_blank">http://www.cloudera.com/<wbr>documentation/enterprise=
/<wbr>latest/topics/impala_<wbr>admission.html#admission_<wbr>memory</a><br=
>
&gt;&gt;&gt;&gt;&gt;<br>
&gt;&gt;&gt;&gt;&gt; HTH<br>
&gt;&gt;&gt;&gt;&gt;<br>
&gt;&gt;&gt;&gt;&gt; On Mon, Jan 30, 2017 at 8:31 PM, William Cox<br>
&gt;&gt;&gt;&gt;&gt; &lt;<a href=3D"mailto:william.cox@distilnetworks.com">=
william.cox@distilnetworks.<wbr>com</a>&gt; wrote:<br>
&gt;&gt;&gt;&gt;&gt;&gt;<br>
&gt;&gt;&gt;&gt;&gt;&gt; I&#39;m running CDH CDH-5.8.0-1 and Impala =3Dvers=
ion 2.6.0-cdh5.8.0 RELEASE<br>
&gt;&gt;&gt;&gt;&gt;&gt; (build 8d8652f69461f0dd8d5f474573fb5d<wbr>e7ceb0ee=
6b). We have enabled resource<br>
&gt;&gt;&gt;&gt;&gt;&gt; management and allocated=C2=A0 ~700Gb of memory wi=
th 30 running queries for the<br>
&gt;&gt;&gt;&gt;&gt;&gt; default. Our background data jobs are Unlimited.<b=
r>
&gt;&gt;&gt;&gt;&gt;&gt;<br>
&gt;&gt;&gt;&gt;&gt;&gt;<br>
</div></div><div class=3D"HOEnZb"><div class=3D"h5">&gt;&gt;&gt;&gt;&gt;&gt=
; In spite of this setup, we still encounter times where queries will be<br=
>
&gt;&gt;&gt;&gt;&gt;&gt; marked as CREATED and waiting for allocation when =
the number of running<br>
&gt;&gt;&gt;&gt;&gt;&gt; queries is well below 30 and the amount of used me=
mory, as listed in the CDH<br>
&gt;&gt;&gt;&gt;&gt;&gt; UI, is well below 700GB.<br>
&gt;&gt;&gt;&gt;&gt;&gt;<br>
&gt;&gt;&gt;&gt;&gt;&gt; This is seemingly unpredicable. We&#39;ve created =
extensive monitors to<br>
&gt;&gt;&gt;&gt;&gt;&gt; track # of running queries and memory usage but th=
ere seems to be no pattern<br>
&gt;&gt;&gt;&gt;&gt;&gt; to why/when these queries won&#39;t be submitted t=
o the cluster.<br>
&gt;&gt;&gt;&gt;&gt;&gt;<br>
&gt;&gt;&gt;&gt;&gt;&gt; Is there some key metric that I might be missing o=
r is there any<br>
&gt;&gt;&gt;&gt;&gt;&gt; suggestions folks have for tracking down these que=
ries that won&#39;t be<br>
&gt;&gt;&gt;&gt;&gt;&gt; submitted?<br>
&gt;&gt;&gt;&gt;&gt;&gt; Thanks.<br>
&gt;&gt;&gt;&gt;&gt;&gt; -William<br>
&gt;&gt;&gt;&gt;&gt;&gt;<br>
&gt;&gt;&gt;&gt;&gt;<br>
&gt;&gt;&gt;&gt;<br>
&gt;&gt;&gt;<br>
&gt;&gt;<br>
&gt;<br>
</div></div></blockquote></div><br></div>

--001a1149087a3a1dfd05477f983d--