drill-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Jacques Nadeau <jacq...@apache.org>
Subject Re: TestDrillbitResilience broken? assertion errors; now slow/hung, with 278 threads!
Date Wed, 29 Apr 2015 16:14:34 GMT
The thread count doesn't seem that surprising given the nature of the
test.  It's starting up three distinct Drillbits plus a DrillClient.  That
is a large number of RPC pools (3 server and 2 client pools for each
Drillbit plus a client pool for the DrillClient).

I'd focus on the two actual failures.

On Wed, Apr 29, 2015 at 12:13 AM, Daniel Barclay <dbarclay@maprtech.com>
wrote:

> Does anyone know what's going on with TestDrillbitResilience (rebased
> from master today)?  (Is it working right?)
>
>
> One run, via "mvn install", yielded assertion errors:
>
> ...
> Error shutting down Drillbit "beta".
> Tests run: 11, Failures: 2, Errors: 0, Skipped: 0, Time elapsed: 33.811
> sec <<< FAILURE! - in org.apache.drill.exec.server.TestDrillbitResilience
> cancelAfterEverythingIsCompleted(org.apache.drill.exec.server.TestDrillbitResilience)
> Time elapsed: 1.468 sec  <<< FAILURE!
> java.lang.AssertionError: null
>         at
> org.apache.drill.exec.server.TestDrillbitResilience.assertCancelled(TestDrillbitResilience.java:459)
>         at
> org.apache.drill.exec.server.TestDrillbitResilience.cancelAfterEverythingIsCompleted(TestDrillbitResilience.java:565)
>
> cancelInMiddleOfFetchingResults(org.apache.drill.exec.server.TestDrillbitResilience)
> Time elapsed: 1.496 sec  <<< FAILURE!
> java.lang.AssertionError: null
>         at
> org.apache.drill.exec.server.TestDrillbitResilience.assertCancelled(TestDrillbitResilience.java:459)
>         at
> org.apache.drill.exec.server.TestDrillbitResilience.cancelInMiddleOfFetchingResults(TestDrillbitResilience.java:510)
>
> Running <next test>
> ...
>
>
> A second run, run individually (but still via Maven) died with different
> errors.
>
>
>
> A third run, via "mvn install" again, seems hung after reporting this
> (maybe expected) exception:
>
> Exception (no rows returned):
> org.apache.drill.common.exceptions.UserRemoteException: SYSTEM ERROR:
> run-try-end
>
>
> [fb9cfe61-af6e-4c9c-b6ab-8a1b8725c6e9 on dev-linux2:31010]
>
>
> The process is using only about 5% CPU--but has 278 threads!
> (That includes about 35 threads all with the same name of "BitClient-1".)
>
>
> Daniel
>
>
>
>
>
>
> --
> Daniel Barclay
> MapR Technologies
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message