aurora-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Bill Farner (JIRA)" <j...@apache.org>
Subject [jira] [Comment Edited] (AURORA-1395) RescheduleCalculator precondition fails when using DbTaskStore
Date Sat, 11 Jul 2015 04:41:04 GMT

    [ https://issues.apache.org/jira/browse/AURORA-1395?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14623066#comment-14623066
] 

Bill Farner edited comment on AURORA-1395 at 7/11/15 4:40 AM:
--------------------------------------------------------------

I realized we implement an update with delete/insert in DbTaskStore, which would cause a cascading
delete in the task_events table.  We should be doing an UPDATE or MERGE instead to keep the
child table in tact.  I'm not sure if that's causing this issue, though.

*edit* While the above isn't nice, it should actually work fine (though since we allow dirty
reads, there's a window where other readers could get stale data.  The resulting consistency
should be preserved, though, since we re-insert into child tables.


was (Author: wfarner):
I realized we implement an update with delete/insert in DbTaskStore, which would cause a cascading
delete in the task_events table.  We should be doing an UPDATE or MERGE instead to keep the
child table in tact.  I'm not sure if that's causing this issue, though.

*edit* While the above isn't nice, it should actually work find (though since we allow dirty
reads, there's a window where other readers could get stale data.  The resulting consistency
should be preserved, though, since we re-insert into child tables.

> RescheduleCalculator precondition fails when using DbTaskStore
> --------------------------------------------------------------
>
>                 Key: AURORA-1395
>                 URL: https://issues.apache.org/jira/browse/AURORA-1395
>             Project: Aurora
>          Issue Type: Bug
>          Components: Scheduler
>            Reporter: Bill Farner
>            Assignee: Bill Farner
>            Priority: Critical
>
> When enabling the DB task store, i frequently encounter this exception due to a precondition
check fail in RescheduleCalculator:
> {noformat}
> E0710 22:33:48.688 THREAD138 org.apache.aurora.scheduler.events.PubsubEventModule$1.handleException:
Failed to dispatch event to public void org.apache.aurora.scheduler.async.TaskThrottler.taskChangedState(org.apache.aurora.scheduler.events.PubsubEvent$TaskStateChange):
java.lang.IllegalStateException
> java.lang.IllegalStateException
>         at com.google.common.base.Preconditions.checkState(Preconditions.java:161)
>         at org.apache.aurora.scheduler.async.RescheduleCalculator$RescheduleCalculatorImpl$1.apply(RescheduleCalculator.java:103)
>         at org.apache.aurora.scheduler.async.RescheduleCalculator$RescheduleCalculatorImpl$1.apply(RescheduleCalculator.java:85)
>         at org.apache.aurora.scheduler.async.RescheduleCalculator$RescheduleCalculatorImpl.getFlappingPenaltyMs(RescheduleCalculator.java:159)
>         at org.apache.aurora.scheduler.async.TaskThrottler.taskChangedState(TaskThrottler.java:72)
>         at sun.reflect.GeneratedMethodAccessor89.invoke(Unknown Source)
>         at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>         at java.lang.reflect.Method.invoke(Method.java:497)
>         at com.google.common.eventbus.EventSubscriber.handleEvent(EventSubscriber.java:74)
>         at com.google.common.eventbus.SynchronizedEventSubscriber.handleEvent(SynchronizedEventSubscriber.java:47)
>         at com.google.common.eventbus.EventBus.dispatch(EventBus.java:322)
>         at com.google.common.eventbus.AsyncEventBus.access$001(AsyncEventBus.java:34)
>         at com.google.common.eventbus.AsyncEventBus$1.run(AsyncEventBus.java:117)
>         at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
>         at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
>         at java.lang.Thread.run(Thread.java:745)
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message