Return-Path: X-Original-To: apmail-accumulo-notifications-archive@minotaur.apache.org Delivered-To: apmail-accumulo-notifications-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id AE1D211829 for ; Tue, 15 Apr 2014 19:12:31 +0000 (UTC) Received: (qmail 77892 invoked by uid 500); 15 Apr 2014 19:12:23 -0000 Delivered-To: apmail-accumulo-notifications-archive@accumulo.apache.org Received: (qmail 77686 invoked by uid 500); 15 Apr 2014 19:12:20 -0000 Mailing-List: contact notifications-help@accumulo.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: jira@apache.org Delivered-To: mailing list notifications@accumulo.apache.org Received: (qmail 77665 invoked by uid 99); 15 Apr 2014 19:12:19 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 15 Apr 2014 19:12:19 +0000 Date: Tue, 15 Apr 2014 19:12:19 +0000 (UTC) From: "ASF subversion and git services (JIRA)" To: notifications@accumulo.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (ACCUMULO-2621) Masters not restarting during concurrent randomwalk MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/ACCUMULO-2621?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13969930#comment-13969930 ] ASF subversion and git services commented on ACCUMULO-2621: ----------------------------------------------------------- Commit ef12e59ecf918299574eef94deb62464fb3f54bd in accumulo's branch refs/heads/master from [~bhavanki] [ https://git-wip-us.apache.org/repos/asf?p=accumulo.git;h=ef12e59 ] ACCUMULO-2621 Add delay to Shutdown step of continuous randomwalk, update README The Concurrent.xml randomwalk module can shut down and restart the Accumulo cluster during the test. This commit adds a 10 second delay after shutdown, to give the master plenty of time to exit before the test attempts to start it up again. Also, it is essential that the walker(s) running the test have passwordless SSH access to the cluster to run $ACCUMULO_HOME/bin/start-all.sh. Otherwise, the masters will not restart and the test will eventually get stuck. The randomwalk README.md now has information about that. > Masters not restarting during concurrent randomwalk > --------------------------------------------------- > > Key: ACCUMULO-2621 > URL: https://issues.apache.org/jira/browse/ACCUMULO-2621 > Project: Accumulo > Issue Type: Test > Components: test > Reporter: Bill Havanki > Assignee: Bill Havanki > Priority: Critical > Labels: 16_qa_bug, randomwalk, test > Fix For: 1.6.1 > > Attachments: ACCUMULO-2621.v1.patch.txt > > > The Concurrent randomwalk test can stop and restart the masters. Under 1.6.0-SNAPSHOT, the stopped masters are not restarting, and eventually the test becomes stuck reporting "No matchers..." forever. > Tested on 7-node CentOS 6.4 cluster, 2 masters. The active master seems to die first, then the standby that becomes the new master. -- This message was sent by Atlassian JIRA (v6.2#6252)