Return-Path: X-Original-To: apmail-accumulo-dev-archive@www.apache.org Delivered-To: apmail-accumulo-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 4733717E4E for ; Thu, 6 Nov 2014 18:18:12 +0000 (UTC) Received: (qmail 92619 invoked by uid 500); 6 Nov 2014 18:18:12 -0000 Delivered-To: apmail-accumulo-dev-archive@accumulo.apache.org Received: (qmail 92573 invoked by uid 500); 6 Nov 2014 18:18:12 -0000 Mailing-List: contact dev-help@accumulo.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@accumulo.apache.org Delivered-To: mailing list dev@accumulo.apache.org Received: (qmail 92556 invoked by uid 99); 6 Nov 2014 18:18:11 -0000 Received: from reviews-vm.apache.org (HELO reviews.apache.org) (140.211.11.40) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 06 Nov 2014 18:18:11 +0000 Received: from reviews.apache.org (localhost [127.0.0.1]) by reviews.apache.org (Postfix) with ESMTP id 36AF61DFC2A; Thu, 6 Nov 2014 18:18:15 +0000 (UTC) Content-Type: multipart/alternative; boundary="===============6495231374956962066==" MIME-Version: 1.0 Subject: Re: Review Request 27654: Add introspection of long running assignments From: keith@deenlo.com To: "accumulo" , "Josh Elser" , keith@deenlo.com Date: Thu, 06 Nov 2014 18:18:15 -0000 Message-ID: <20141106181815.10454.79400@reviews.apache.org> X-ReviewBoard-URL: https://reviews.apache.org Auto-Submitted: auto-generated Sender: noreply@reviews.apache.org X-ReviewGroup: accumulo X-ReviewRequest-URL: https://reviews.apache.org/r/27654/ X-Sender: noreply@reviews.apache.org References: <20141106174751.10454.96318@reviews.apache.org> In-Reply-To: <20141106174751.10454.96318@reviews.apache.org> Reply-To: keith@deenlo.com X-ReviewRequest-Repository: accumulo --===============6495231374956962066== MIME-Version: 1.0 Content-Type: text/plain; charset="utf-8" Content-Transfer-Encoding: 7bit > On Nov. 6, 2014, 5:47 p.m., kturner wrote: > > server/tserver/src/main/java/org/apache/accumulo/tserver/TabletServerResourceManager.java, line 250 > > > > > > The compaction code remembers when it logged an exception and does not do it again. It also logs a message if the compaction becomes unstuck. An advantage I thought of w/ repeatedly logging, is that you could see the stack trace changing (or not). > > > > > > The stack trace is a possible trace. By the time logging happens, the assignment could have completed and the thread could have moved on to other things. > > Josh Elser wrote: > Yeah, since these are running fairly regularly (order of seconds) a stuck assignment could get really spammy. Like you point out, there could be value gained from printing out the stack more than once. Maybe I could add some backoff which only warns so often? > > bq. By the time logging happens, the assignment could have completed and the thread could have moved on to other things. > > Do you think the message should be updated to be more clear about this? A "Maybe you should look into this" type message? > a stuck assignment could get really spammy I think that spam is probably ok as long as the default is high enough such that when it does happen, its something to be concerned about. Could make the timer check a little less frequently. > Do you think the message should be updated to be more clear about this? I think compaction code just says its a possible stack trace. I suppose a good solution would be to have error codes, then user can look up error code and get nitty gritty details. Can't really put too much info in log message. - kturner ----------------------------------------------------------- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/27654/#review60185 ----------------------------------------------------------- On Nov. 6, 2014, 12:58 a.m., Josh Elser wrote: > > ----------------------------------------------------------- > This is an automatically generated e-mail. To reply, visit: > https://reviews.apache.org/r/27654/ > ----------------------------------------------------------- > > (Updated Nov. 6, 2014, 12:58 a.m.) > > > Review request for accumulo. > > > Bugs: ACCUMULO-3304 > https://issues.apache.org/jira/browse/ACCUMULO-3304 > > > Repository: accumulo > > > Description > ------- > > Watches assignments and reports when an assignment is running for longer than a configured time. > > > Diffs > ----- > > core/src/main/java/org/apache/accumulo/core/conf/Property.java 56f3d9c > server/tserver/src/main/java/org/apache/accumulo/tserver/ActiveAssignmentRunnable.java PRE-CREATION > server/tserver/src/main/java/org/apache/accumulo/tserver/RunnableStartedAt.java PRE-CREATION > server/tserver/src/main/java/org/apache/accumulo/tserver/TabletServer.java 94be0bb > server/tserver/src/main/java/org/apache/accumulo/tserver/TabletServerResourceManager.java 935ffeb > > Diff: https://reviews.apache.org/r/27654/diff/ > > > Testing > ------- > > Very minimal. > > > Thanks, > > Josh Elser > > --===============6495231374956962066==--