Return-Path: X-Original-To: apmail-accumulo-dev-archive@www.apache.org Delivered-To: apmail-accumulo-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id DE4ED17D91 for ; Thu, 6 Nov 2014 18:06:03 +0000 (UTC) Received: (qmail 56640 invoked by uid 500); 6 Nov 2014 18:06:03 -0000 Delivered-To: apmail-accumulo-dev-archive@accumulo.apache.org Received: (qmail 56614 invoked by uid 500); 6 Nov 2014 18:06:03 -0000 Mailing-List: contact dev-help@accumulo.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@accumulo.apache.org Delivered-To: mailing list dev@accumulo.apache.org Received: (qmail 56399 invoked by uid 99); 6 Nov 2014 18:06:02 -0000 Received: from reviews-vm.apache.org (HELO reviews.apache.org) (140.211.11.40) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 06 Nov 2014 18:06:02 +0000 Received: from reviews.apache.org (localhost [127.0.0.1]) by reviews.apache.org (Postfix) with ESMTP id 5A8A41DFC2A; Thu, 6 Nov 2014 18:06:06 +0000 (UTC) Content-Type: multipart/alternative; boundary="===============0874843300670275922==" MIME-Version: 1.0 Subject: Re: Review Request 27654: Add introspection of long running assignments From: "Josh Elser" To: "accumulo" , "Josh Elser" , keith@deenlo.com Date: Thu, 06 Nov 2014 18:06:06 -0000 Message-ID: <20141106180606.10454.94999@reviews.apache.org> X-ReviewBoard-URL: https://reviews.apache.org Auto-Submitted: auto-generated Sender: "Josh Elser" X-ReviewGroup: accumulo X-ReviewRequest-URL: https://reviews.apache.org/r/27654/ X-Sender: "Josh Elser" References: <20141106174751.10454.96318@reviews.apache.org> In-Reply-To: <20141106174751.10454.96318@reviews.apache.org> Reply-To: "Josh Elser" X-ReviewRequest-Repository: accumulo --===============0874843300670275922== MIME-Version: 1.0 Content-Type: text/plain; charset="utf-8" Content-Transfer-Encoding: 7bit > On Nov. 6, 2014, 5:47 p.m., kturner wrote: > > core/src/main/java/org/apache/accumulo/core/conf/Property.java, line 276 > > > > > > I think this default may be too low for reasons you mentioned on irc (like a walog recovery taking a while). I am thinking 5 or 10 min would be better. We don't want people to ignore the warning, if it happens more than it should. Yeah, I knew I needed to come back to this. I'm unsure what a good reasonable value is. I even considered 15mins. > On Nov. 6, 2014, 5:47 p.m., kturner wrote: > > server/tserver/src/main/java/org/apache/accumulo/tserver/ActiveAssignmentRunnable.java, line 28 > > > > > > It seems LoggingRunnable's run method is never executed? Good catch. Looking back at this, I don't think I want ActiveAssignmentRunnable to be a LoggingRunnable, but just accept a LoggingRunnable instead. > On Nov. 6, 2014, 5:47 p.m., kturner wrote: > > server/tserver/src/main/java/org/apache/accumulo/tserver/TabletServerResourceManager.java, line 203 > > > > > > why have two maps? Oh, true. I was just keeping metadata and normal assignments separate, but the keyspace will never overlap. > On Nov. 6, 2014, 5:47 p.m., kturner wrote: > > server/tserver/src/main/java/org/apache/accumulo/tserver/TabletServerResourceManager.java, line 248 > > > > > > 15 seconds? Heh, thanks. I had the timeout really small before I added configuration to test things. > On Nov. 6, 2014, 5:47 p.m., kturner wrote: > > server/tserver/src/main/java/org/apache/accumulo/tserver/TabletServerResourceManager.java, line 250 > > > > > > The compaction code remembers when it logged an exception and does not do it again. It also logs a message if the compaction becomes unstuck. An advantage I thought of w/ repeatedly logging, is that you could see the stack trace changing (or not). > > > > > > The stack trace is a possible trace. By the time logging happens, the assignment could have completed and the thread could have moved on to other things. Yeah, since these are running fairly regularly (order of seconds) a stuck assignment could get really spammy. Like you point out, there could be value gained from printing out the stack more than once. Maybe I could add some backoff which only warns so often? bq. By the time logging happens, the assignment could have completed and the thread could have moved on to other things. Do you think the message should be updated to be more clear about this? A "Maybe you should look into this" type message? - Josh ----------------------------------------------------------- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/27654/#review60185 ----------------------------------------------------------- On Nov. 6, 2014, 12:58 a.m., Josh Elser wrote: > > ----------------------------------------------------------- > This is an automatically generated e-mail. To reply, visit: > https://reviews.apache.org/r/27654/ > ----------------------------------------------------------- > > (Updated Nov. 6, 2014, 12:58 a.m.) > > > Review request for accumulo. > > > Bugs: ACCUMULO-3304 > https://issues.apache.org/jira/browse/ACCUMULO-3304 > > > Repository: accumulo > > > Description > ------- > > Watches assignments and reports when an assignment is running for longer than a configured time. > > > Diffs > ----- > > core/src/main/java/org/apache/accumulo/core/conf/Property.java 56f3d9c > server/tserver/src/main/java/org/apache/accumulo/tserver/ActiveAssignmentRunnable.java PRE-CREATION > server/tserver/src/main/java/org/apache/accumulo/tserver/RunnableStartedAt.java PRE-CREATION > server/tserver/src/main/java/org/apache/accumulo/tserver/TabletServer.java 94be0bb > server/tserver/src/main/java/org/apache/accumulo/tserver/TabletServerResourceManager.java 935ffeb > > Diff: https://reviews.apache.org/r/27654/diff/ > > > Testing > ------- > > Very minimal. > > > Thanks, > > Josh Elser > > --===============0874843300670275922==--