Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 46A0B200CB6 for ; Wed, 14 Jun 2017 15:58:07 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 45498160BC0; Wed, 14 Jun 2017 13:58:07 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id 174E4160BDB for ; Wed, 14 Jun 2017 15:58:05 +0200 (CEST) Received: (qmail 66868 invoked by uid 500); 14 Jun 2017 13:58:05 -0000 Mailing-List: contact issues-help@flink.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@flink.apache.org Delivered-To: mailing list issues@flink.apache.org Received: (qmail 66859 invoked by uid 99); 14 Jun 2017 13:58:05 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd2-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 14 Jun 2017 13:58:05 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd2-us-west.apache.org (ASF Mail Server at spamd2-us-west.apache.org) with ESMTP id C9F611AFCBE for ; Wed, 14 Jun 2017 13:58:04 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd2-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -99.202 X-Spam-Level: X-Spam-Status: No, score=-99.202 tagged_above=-999 required=6.31 tests=[KAM_ASCII_DIVIDERS=0.8, RP_MATCHES_RCVD=-0.001, SPF_PASS=-0.001, USER_IN_WHITELIST=-100] autolearn=disabled Received: from mx1-lw-us.apache.org ([10.40.0.8]) by localhost (spamd2-us-west.apache.org [10.40.0.9]) (amavisd-new, port 10024) with ESMTP id GAM12obF4zXr for ; Wed, 14 Jun 2017 13:58:01 +0000 (UTC) Received: from mailrelay1-us-west.apache.org (mailrelay1-us-west.apache.org [209.188.14.139]) by mx1-lw-us.apache.org (ASF Mail Server at mx1-lw-us.apache.org) with ESMTP id 2276D5FB8B for ; Wed, 14 Jun 2017 13:58:01 +0000 (UTC) Received: from jira-lw-us.apache.org (unknown [207.244.88.139]) by mailrelay1-us-west.apache.org (ASF Mail Server at mailrelay1-us-west.apache.org) with ESMTP id 9DD7AE0984 for ; Wed, 14 Jun 2017 13:58:00 +0000 (UTC) Received: from jira-lw-us.apache.org (localhost [127.0.0.1]) by jira-lw-us.apache.org (ASF Mail Server at jira-lw-us.apache.org) with ESMTP id 31B9D21D8F for ; Wed, 14 Jun 2017 13:58:00 +0000 (UTC) Date: Wed, 14 Jun 2017 13:58:00 +0000 (UTC) From: "Greg Hogan (JIRA)" To: issues@flink.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Comment Edited] (FLINK-6918) Failing tests: ChainLengthDecreaseTest and ChainLengthIncreaseTest MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 archived-at: Wed, 14 Jun 2017 13:58:07 -0000 [ https://issues.apache.org/jira/browse/FLINK-6918?page=3Dcom.atlassian= .jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=3D1604= 9202#comment-16049202 ]=20 Greg Hogan edited comment on FLINK-6918 at 6/14/17 1:57 PM: ------------------------------------------------------------ Yes, also 1.3. I'm wondering if we aren't cleaning up {{/tmp}} as the error= seems to be alternating and not an issue on TravisCI. Also, as you noted, = the fast failure (< 1s this time). {noformat} testMigrationAndRestore[Migrate Savepoint: nonKeyed-flink1.3](org.apache.fl= ink.test.state.operator.restore.unkeyed.ChainLengthIncreaseTest) Time elap= sed: 0.595 sec <<< ERROR! java.lang.Exception: java.lang.Exception: Failed to trigger savepoint. =09at org.apache.flink.test.state.operator.restore.AbstractOperatorRestoreT= estBase.migrateJob(AbstractOperatorRestoreTestBase.java:202) {noformat} was (Author: greghogan): Yes, also 1.3. I'm wondering if we aren't cleaning up {{/tmp}} as the error= seems to be alternating and not an issue on TravisCI. {noformat} testMigrationAndRestore[Migrate Savepoint: nonKeyed-flink1.3](org.apache.fl= ink.test.state.operator.restore.unkeyed.ChainLengthIncreaseTest) Time elap= sed: 0.595 sec <<< ERROR! java.lang.Exception: java.lang.Exception: Failed to trigger savepoint. =09at org.apache.flink.test.state.operator.restore.AbstractOperatorRestoreT= estBase.migrateJob(AbstractOperatorRestoreTestBase.java:202) {noformat} > Failing tests: ChainLengthDecreaseTest and ChainLengthIncreaseTest > ------------------------------------------------------------------ > > Key: FLINK-6918 > URL: https://issues.apache.org/jira/browse/FLINK-6918 > Project: Flink > Issue Type: Bug > Components: Tests > Affects Versions: 1.4.0 > Reporter: Greg Hogan > > While running {{mvn clean verify}} on Linux with {{commit 3bad77c0ae932a9= 26260b769efb151a89fc309ab}}. > {noformat} > Tests in error: > ChainLengthDecreaseTest>AbstractOperatorRestoreTestBase.testMigrationAn= dRestore:164->AbstractOperatorRestoreTestBase.migrateJob:202 =C2=BB > ChainLengthIncreaseTest>AbstractOperatorRestoreTestBase.testMigrationAn= dRestore:164->AbstractOperatorRestoreTestBase.migrateJob:202 =C2=BB > {noformat} > {noformat} > Tests run: 2, Failures: 0, Errors: 1, Skipped: 0, Time elapsed: 6.497 sec= <<< FAILURE! - in org.apache.flink.test.state.operator.restore.unkeyed.Cha= inLengthDecreaseTest > testMigrationAndRestore[Migrate Savepoint: nonKeyed-flink1.3](org.apache.= flink.test.state.operator.restore.unkeyed.ChainLengthDecreaseTest) Time el= apsed: 0.361 sec <<< ERROR! > java.lang.Exception: java.lang.Exception: Failed to trigger savepoint. > =09at org.apache.flink.test.state.operator.restore.AbstractOperatorRestor= eTestBase.migrateJob(AbstractOperatorRestoreTestBase.java:202) > =09at org.apache.flink.test.state.operator.restore.AbstractOperatorRestor= eTestBase.testMigrationAndRestore(AbstractOperatorRestoreTestBase.java:164) > =09at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) > =09at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImp= l.java:62) > =09at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAcc= essorImpl.java:43) > =09at java.lang.reflect.Method.invoke(Method.java:498) > =09at org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(Framewo= rkMethod.java:50) > =09at org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveC= allable.java:12) > =09at org.junit.runners.model.FrameworkMethod.invokeExplosively(Framework= Method.java:47) > =09at org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeM= ethod.java:17) > =09at org.junit.rules.ExternalResource$1.evaluate(ExternalResource.java:4= 8) > =09at org.junit.rules.TestWatcher$1.evaluate(TestWatcher.java:55) > =09at org.junit.rules.RunRules.evaluate(RunRules.java:20) > =09at org.junit.runners.ParentRunner.runLeaf(ParentRunner.java:325) > =09at org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassR= unner.java:78) > =09at org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassR= unner.java:57) > =09at org.junit.runners.ParentRunner$3.run(ParentRunner.java:290) > =09at org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:71) > =09at org.junit.runners.ParentRunner.runChildren(ParentRunner.java:288) > =09at org.junit.runners.ParentRunner.access$000(ParentRunner.java:58) > =09at org.junit.runners.ParentRunner$2.evaluate(ParentRunner.java:268) > =09at org.junit.runners.ParentRunner.run(ParentRunner.java:363) > =09at org.junit.runners.Suite.runChild(Suite.java:128) > =09at org.junit.runners.Suite.runChild(Suite.java:27) > =09at org.junit.runners.ParentRunner$3.run(ParentRunner.java:290) > =09at org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:71) > =09at org.junit.runners.ParentRunner.runChildren(ParentRunner.java:288) > =09at org.junit.runners.ParentRunner.access$000(ParentRunner.java:58) > =09at org.junit.runners.ParentRunner$2.evaluate(ParentRunner.java:268) > =09at org.junit.internal.runners.statements.RunBefores.evaluate(RunBefore= s.java:26) > =09at org.junit.internal.runners.statements.RunAfters.evaluate(RunAfters.= java:27) > =09at org.junit.runners.ParentRunner.run(ParentRunner.java:363) > =09at org.apache.maven.surefire.junit4.JUnit4Provider.execute(JUnit4Provi= der.java:283) > =09at org.apache.maven.surefire.junit4.JUnit4Provider.executeWithRerun(JU= nit4Provider.java:173) > =09at org.apache.maven.surefire.junit4.JUnit4Provider.executeTestSet(JUni= t4Provider.java:153) > =09at org.apache.maven.surefire.junit4.JUnit4Provider.invoke(JUnit4Provid= er.java:128) > =09at org.apache.maven.surefire.booter.ForkedBooter.invokeProviderInSameC= lassLoader(ForkedBooter.java:203) > =09at org.apache.maven.surefire.booter.ForkedBooter.runSuitesInProcess(Fo= rkedBooter.java:155) > =09at org.apache.maven.surefire.booter.ForkedBooter.main(ForkedBooter.jav= a:103) > Caused by: java.lang.Exception: Failed to trigger savepoint. > =09at org.apache.flink.runtime.jobmanager.JobManager$$anonfun$handleMessa= ge$1$$anon$6.apply(JobManager.scala:639) > =09at org.apache.flink.runtime.jobmanager.JobManager$$anonfun$handleMessa= ge$1$$anon$6.apply(JobManager.scala:629) > =09at org.apache.flink.runtime.concurrent.impl.FlinkFuture$5.onComplete(F= linkFuture.java:272) > =09at akka.dispatch.OnComplete.internal(Future.scala:247) > =09at akka.dispatch.OnComplete.internal(Future.scala:245) > =09at akka.dispatch.japi$CallbackBridge.apply(Future.scala:175) > =09at akka.dispatch.japi$CallbackBridge.apply(Future.scala:172) > =09at scala.concurrent.impl.CallbackRunnable.run(Promise.scala:32) > =09at akka.dispatch.BatchingExecutor$AbstractBatch.processBatch(BatchingE= xecutor.scala:55) > =09at akka.dispatch.BatchingExecutor$BlockableBatch$$anonfun$run$1.apply$= mcV$sp(BatchingExecutor.scala:91) > =09at akka.dispatch.BatchingExecutor$BlockableBatch$$anonfun$run$1.apply(= BatchingExecutor.scala:91) > =09at akka.dispatch.BatchingExecutor$BlockableBatch$$anonfun$run$1.apply(= BatchingExecutor.scala:91) > =09at scala.concurrent.BlockContext$.withBlockContext(BlockContext.scala:= 72) > =09at akka.dispatch.BatchingExecutor$BlockableBatch.run(BatchingExecutor.= scala:90) > =09at akka.dispatch.TaskInvocation.run(AbstractDispatcher.scala:40) > =09at akka.dispatch.ForkJoinExecutorConfigurator$AkkaForkJoinTask.exec(Ab= stractDispatcher.scala:397) > =09at scala.concurrent.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260= ) > =09at scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.runTask(ForkJoinPo= ol.java:1339) > =09at scala.concurrent.forkjoin.ForkJoinPool.runWorker(ForkJoinPool.java:= 1979) > =09at scala.concurrent.forkjoin.ForkJoinWorkerThread.run(ForkJoinWorkerTh= read.java:107) > Caused by: java.lang.Exception: Checkpoint was declined (tasks not ready) > =09at org.apache.flink.runtime.checkpoint.PendingCheckpoint.abortDeclined= (PendingCheckpoint.java:510) > =09at org.apache.flink.runtime.checkpoint.CheckpointCoordinator.receiveDe= clineMessage(CheckpointCoordinator.java:698) > =09at org.apache.flink.runtime.jobmanager.JobManager$$anonfun$org$apache$= flink$runtime$jobmanager$JobManager$$handleCheckpointMessage$2.apply$mcV$sp= (JobManager.scala:1491) > =09at org.apache.flink.runtime.jobmanager.JobManager$$anonfun$org$apache$= flink$runtime$jobmanager$JobManager$$handleCheckpointMessage$2.apply(JobMan= ager.scala:1490) > =09at org.apache.flink.runtime.jobmanager.JobManager$$anonfun$org$apache$= flink$runtime$jobmanager$JobManager$$handleCheckpointMessage$2.apply(JobMan= ager.scala:1490) > =09at scala.concurrent.impl.Future$PromiseCompletingRunnable.liftedTree1$= 1(Future.scala:24) > =09at scala.concurrent.impl.Future$PromiseCompletingRunnable.run(Future.s= cala:24) > =09at akka.dispatch.TaskInvocation.run(AbstractDispatcher.scala:40) > =09at akka.dispatch.ForkJoinExecutorConfigurator$AkkaForkJoinTask.exec(Ab= stractDispatcher.scala:397) > =09at scala.concurrent.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260= ) > =09at scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.runTask(ForkJoinPo= ol.java:1339) > =09at scala.concurrent.forkjoin.ForkJoinPool.runWorker(ForkJoinPool.java:= 1979) > =09at scala.concurrent.forkjoin.ForkJoinWorkerThread.run(ForkJoinWorkerTh= read.java:107) > Tests run: 2, Failures: 0, Errors: 1, Skipped: 0, Time elapsed: 6.722 sec= <<< FAILURE! - in org.apache.flink.test.state.operator.restore.unkeyed.Cha= inLengthIncreaseTest > testMigrationAndRestore[Migrate Savepoint: nonKeyed-flink1.3](org.apache.= flink.test.state.operator.restore.unkeyed.ChainLengthIncreaseTest) Time el= apsed: 0.458 sec <<< ERROR! > java.lang.Exception: java.lang.Exception: Failed to trigger savepoint. > =09at org.apache.flink.test.state.operator.restore.AbstractOperatorRestor= eTestBase.migrateJob(AbstractOperatorRestoreTestBase.java:202) > =09at org.apache.flink.test.state.operator.restore.AbstractOperatorRestor= eTestBase.testMigrationAndRestore(AbstractOperatorRestoreTestBase.java:164) > =09at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) > =09at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImp= l.java:62) > =09at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAcc= essorImpl.java:43) > =09at java.lang.reflect.Method.invoke(Method.java:498) > =09at org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(Framewo= rkMethod.java:50) > =09at org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveC= allable.java:12) > =09at org.junit.runners.model.FrameworkMethod.invokeExplosively(Framework= Method.java:47) > =09at org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeM= ethod.java:17) > =09at org.junit.rules.ExternalResource$1.evaluate(ExternalResource.java:4= 8) > =09at org.junit.rules.TestWatcher$1.evaluate(TestWatcher.java:55) > =09at org.junit.rules.RunRules.evaluate(RunRules.java:20) > =09at org.junit.runners.ParentRunner.runLeaf(ParentRunner.java:325) > =09at org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassR= unner.java:78) > =09at org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassR= unner.java:57) > =09at org.junit.runners.ParentRunner$3.run(ParentRunner.java:290) > =09at org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:71) > =09at org.junit.runners.ParentRunner.runChildren(ParentRunner.java:288) > =09at org.junit.runners.ParentRunner.access$000(ParentRunner.java:58) > =09at org.junit.runners.ParentRunner$2.evaluate(ParentRunner.java:268) > =09at org.junit.runners.ParentRunner.run(ParentRunner.java:363) > =09at org.junit.runners.Suite.runChild(Suite.java:128) > =09at org.junit.runners.Suite.runChild(Suite.java:27) > =09at org.junit.runners.ParentRunner$3.run(ParentRunner.java:290) > =09at org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:71) > =09at org.junit.runners.ParentRunner.runChildren(ParentRunner.java:288) > =09at org.junit.runners.ParentRunner.access$000(ParentRunner.java:58) > =09at org.junit.runners.ParentRunner$2.evaluate(ParentRunner.java:268) > =09at org.junit.internal.runners.statements.RunBefores.evaluate(RunBefore= s.java:26) > =09at org.junit.internal.runners.statements.RunAfters.evaluate(RunAfters.= java:27) > =09at org.junit.runners.ParentRunner.run(ParentRunner.java:363) > =09at org.apache.maven.surefire.junit4.JUnit4Provider.execute(JUnit4Provi= der.java:283) > =09at org.apache.maven.surefire.junit4.JUnit4Provider.executeWithRerun(JU= nit4Provider.java:173) > =09at org.apache.maven.surefire.junit4.JUnit4Provider.executeTestSet(JUni= t4Provider.java:153) > =09at org.apache.maven.surefire.junit4.JUnit4Provider.invoke(JUnit4Provid= er.java:128) > =09at org.apache.maven.surefire.booter.ForkedBooter.invokeProviderInSameC= lassLoader(ForkedBooter.java:203) > =09at org.apache.maven.surefire.booter.ForkedBooter.runSuitesInProcess(Fo= rkedBooter.java:155) > =09at org.apache.maven.surefire.booter.ForkedBooter.main(ForkedBooter.jav= a:103) > Caused by: java.lang.Exception: Failed to trigger savepoint. > =09at org.apache.flink.runtime.jobmanager.JobManager$$anonfun$handleMessa= ge$1$$anon$6.apply(JobManager.scala:639) > =09at org.apache.flink.runtime.jobmanager.JobManager$$anonfun$handleMessa= ge$1$$anon$6.apply(JobManager.scala:629) > =09at org.apache.flink.runtime.concurrent.impl.FlinkFuture$5.onComplete(F= linkFuture.java:272) > =09at akka.dispatch.OnComplete.internal(Future.scala:247) > =09at akka.dispatch.OnComplete.internal(Future.scala:245) > =09at akka.dispatch.japi$CallbackBridge.apply(Future.scala:175) > =09at akka.dispatch.japi$CallbackBridge.apply(Future.scala:172) > =09at scala.concurrent.impl.CallbackRunnable.run(Promise.scala:32) > =09at akka.dispatch.BatchingExecutor$AbstractBatch.processBatch(BatchingE= xecutor.scala:55) > =09at akka.dispatch.BatchingExecutor$BlockableBatch$$anonfun$run$1.apply$= mcV$sp(BatchingExecutor.scala:91) > =09at akka.dispatch.BatchingExecutor$BlockableBatch$$anonfun$run$1.apply(= BatchingExecutor.scala:91) > =09at akka.dispatch.BatchingExecutor$BlockableBatch$$anonfun$run$1.apply(= BatchingExecutor.scala:91) > =09at scala.concurrent.BlockContext$.withBlockContext(BlockContext.scala:= 72) > =09at akka.dispatch.BatchingExecutor$BlockableBatch.run(BatchingExecutor.= scala:90) > =09at akka.dispatch.TaskInvocation.run(AbstractDispatcher.scala:40) > =09at akka.dispatch.ForkJoinExecutorConfigurator$AkkaForkJoinTask.exec(Ab= stractDispatcher.scala:397) > =09at scala.concurrent.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260= ) > =09at scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.runTask(ForkJoinPo= ol.java:1339) > =09at scala.concurrent.forkjoin.ForkJoinPool.runWorker(ForkJoinPool.java:= 1979) > =09at scala.concurrent.forkjoin.ForkJoinWorkerThread.run(ForkJoinWorkerTh= read.java:107) > Caused by: java.lang.Exception: Checkpoint was declined (tasks not ready) > =09at org.apache.flink.runtime.checkpoint.PendingCheckpoint.abortDeclined= (PendingCheckpoint.java:510) > =09at org.apache.flink.runtime.checkpoint.CheckpointCoordinator.receiveDe= clineMessage(CheckpointCoordinator.java:698) > =09at org.apache.flink.runtime.jobmanager.JobManager$$anonfun$org$apache$= flink$runtime$jobmanager$JobManager$$handleCheckpointMessage$2.apply$mcV$sp= (JobManager.scala:1491) > =09at org.apache.flink.runtime.jobmanager.JobManager$$anonfun$org$apache$= flink$runtime$jobmanager$JobManager$$handleCheckpointMessage$2.apply(JobMan= ager.scala:1490) > =09at org.apache.flink.runtime.jobmanager.JobManager$$anonfun$org$apache$= flink$runtime$jobmanager$JobManager$$handleCheckpointMessage$2.apply(JobMan= ager.scala:1490) > =09at scala.concurrent.impl.Future$PromiseCompletingRunnable.liftedTree1$= 1(Future.scala:24) > =09at scala.concurrent.impl.Future$PromiseCompletingRunnable.run(Future.s= cala:24) > =09at akka.dispatch.TaskInvocation.run(AbstractDispatcher.scala:40) > =09at akka.dispatch.ForkJoinExecutorConfigurator$AkkaForkJoinTask.exec(Ab= stractDispatcher.scala:397) > =09at scala.concurrent.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260= ) > =09at scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.runTask(ForkJoinPo= ol.java:1339) > =09at scala.concurrent.forkjoin.ForkJoinPool.runWorker(ForkJoinPool.java:= 1979) > =09at scala.concurrent.forkjoin.ForkJoinWorkerThread.run(ForkJoinWorkerTh= read.java:107) > {noformat} -- This message was sent by Atlassian JIRA (v6.4.14#64029)