Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id A70AF200B3B for ; Mon, 11 Jul 2016 15:11:14 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id A592F160A62; Mon, 11 Jul 2016 13:11:14 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id C6831160A85 for ; Mon, 11 Jul 2016 15:11:13 +0200 (CEST) Received: (qmail 33616 invoked by uid 500); 11 Jul 2016 13:11:12 -0000 Mailing-List: contact notifications-help@accumulo.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: jira@apache.org Delivered-To: mailing list notifications@accumulo.apache.org Received: (qmail 33263 invoked by uid 99); 11 Jul 2016 13:11:11 -0000 Received: from arcas.apache.org (HELO arcas) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 11 Jul 2016 13:11:11 +0000 Received: from arcas.apache.org (localhost [127.0.0.1]) by arcas (Postfix) with ESMTP id 1458C2C02AB for ; Mon, 11 Jul 2016 13:11:11 +0000 (UTC) Date: Mon, 11 Jul 2016 13:11:11 +0000 (UTC) From: "Shawn Walker (JIRA)" To: notifications@accumulo.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (ACCUMULO-4365) ShellServerIT#trace() failing intermittently due to missing "sendMutations" block MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 archived-at: Mon, 11 Jul 2016 13:11:14 -0000 [ https://issues.apache.org/jira/browse/ACCUMULO-4365?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15370722#comment-15370722 ] Shawn Walker commented on ACCUMULO-4365: ---------------------------------------- [~elserj]: Busy with ACCUMULO-4353 at the moment. I'll post a "me too", though. Actually, I've given up on ShellServerIT at present -- I always end up killing it after 10 minutes of waiting on it to finish. > ShellServerIT#trace() failing intermittently due to missing "sendMutations" block > --------------------------------------------------------------------------------- > > Key: ACCUMULO-4365 > URL: https://issues.apache.org/jira/browse/ACCUMULO-4365 > Project: Accumulo > Issue Type: Bug > Components: test > Reporter: Josh Elser > Fix For: 2.0.0 > > > Noticed this on master, but not sure if it also affects other branches. > {noformat} > trace(org.apache.accumulo.test.ShellServerIT) Time elapsed: 5.166 sec <<< FAILURE! > java.lang.AssertionError > at org.apache.accumulo.test.ShellServerIT.trace(ShellServerIT.java:1630) > {noformat} > This is a trace that was observed when the test case failed. > {noformat} > Trace started at 2016/07/10 22:43:38.277 > Time Start Service@Location Name > 3446+0 shell@localhost shell:root > 1+160 master@0.0.0.0 beginFateOperation > 4+167 master@0.0.0.0 executeFateOperation > 3+173 master@0.0.0.0 CreateTable > 2+176 master@0.0.0.0 CreateTable > 16+181 master@0.0.0.0 SetupPermissions > 4+200 master@0.0.0.0 PopulateZookeeper > 19+204 master@0.0.0.0 PopulateZookeeper > 1+694 master@0.0.0.0 ChooseDir > 1+709 master@0.0.0.0 CreateDir > 2+712 master@0.0.0.0 PopulateMetadata > 1+713 tserver@0.0.0.0 update > 1+713 tserver@0.0.0.0 prep > 5+716 master@0.0.0.0 FinishCreateTable > 563+172 master@0.0.0.0 waitForFateOperation > 2+736 master@0.0.0.0 finishFateOperation > 1513+745 shell@localhost close > 13+746 shell@localhost BinMutations 1 > 5+746 shell@localhost binMutations > 2+748 tserver@0.0.0.0 startScan > 1+748 tserver@0.0.0.0 metadata tablets read ahead 5 > 3+2259 tserver@0.0.0.0 getTableConfiguration > 3+2263 tserver@0.0.0.0 getTableConfiguration > 3+2267 tserver@0.0.0.0 getTableConfiguration > 3+2270 tserver@0.0.0.0 getTableConfiguration > 3+2281 shell@localhost scan > 2+2282 shell@localhost scan:location > 2+2282 tserver@0.0.0.0 startScan > 2+2282 tserver@0.0.0.0 tablet read ahead 6 > 7+2285 master@0.0.0.0 beginFateOperation > 2+2293 master@0.0.0.0 executeFateOperation > 3+2297 master@0.0.0.0 DeleteTable > 1+2300 master@0.0.0.0 DeleteTable > 4+2413 master@0.0.0.0 CleanUp > 2+2415 master@0.0.0.0 scan > 1+2415 master@0.0.0.0 scan:location > 1+2415 tserver@0.0.0.0 startScan > 1+2415 tserver@0.0.0.0 metadata tablets read ahead 6 > 20+2417 master@0.0.0.0 CleanUp > 2+2417 master@0.0.0.0 batch scanner 555- 1 > 1+2417 master@0.0.0.0 client:startMultiScan > 1+2418 tserver@0.0.0.0 startMultiScan > 1+2418 tserver@0.0.0.0 metadata tablets read ahead 7 > 1+2420 master@0.0.0.0 scan > 1+2420 master@0.0.0.0 scan:location > 1+2420 tserver@0.0.0.0 startScan > 1+2420 tserver@0.0.0.0 metadata tablets read ahead 1 > 2+2421 master@0.0.0.0 close > 1+2421 master@0.0.0.0 BinMutations 1 > 1+2421 master@0.0.0.0 binMutations > 1+2423 master@0.0.0.0 scan > 1+2423 master@0.0.0.0 scan:location > 1+2423 tserver@0.0.0.0 startScan > 1+2423 tserver@0.0.0.0 metadata tablets read ahead 8 > 145+2296 master@0.0.0.0 waitForFateOperation > 1+2441 master@0.0.0.0 finishFateOperation > {noformat} > In another run where the test did not fail: > {noformat} > Trace started at 2016/07/10 22:48:06.432 > Time Start Service@Location Name > 3066+0 shell@localhost shell:root > 5+210 master@0.0.0.0 beginFateOperation > 4+222 master@0.0.0.0 executeFateOperation > 2+228 master@0.0.0.0 CreateTable > 2+230 master@0.0.0.0 CreateTable > 15+235 master@0.0.0.0 SetupPermissions > 1+252 master@0.0.0.0 PopulateZookeeper > 10+253 master@0.0.0.0 PopulateZookeeper > 2+266 master@0.0.0.0 ChooseDir > 70+227 master@0.0.0.0 waitForFateOperation > 2+298 master@0.0.0.0 finishFateOperation > 1511+306 shell@localhost close > 9+306 shell@localhost BinMutations 1 > 5+306 shell@localhost binMutations > 2+308 tserver@0.0.0.0 startScan > 1+308 tserver@0.0.0.0 metadata tablets read ahead 5 > 6+1818 tserver@0.0.0.0 getTableConfiguration > 3+1825 tserver@0.0.0.0 getTableConfiguration > 4+1828 tserver@0.0.0.0 getTableConfiguration > 3+1833 tserver@0.0.0.0 getTableConfiguration > 1+1836 shell@localhost client:getUserAuthorizations > 3+1845 shell@localhost scan > 2+1846 shell@localhost scan:location > 2+1846 tserver@0.0.0.0 startScan > 1+1847 tserver@0.0.0.0 tablet read ahead 8 > 7+1849 master@0.0.0.0 beginFateOperation > 3+1856 master@0.0.0.0 executeFateOperation > 2+1860 master@0.0.0.0 DeleteTable > 1+1862 master@0.0.0.0 DeleteTable > 5+2027 master@0.0.0.0 CleanUp > 4+2028 master@0.0.0.0 scan > 4+2028 master@0.0.0.0 scan:location > 2+2029 tserver@0.0.0.0 startScan > 1+2030 tserver@0.0.0.0 metadata tablets read ahead 4 > 24+2032 master@0.0.0.0 CleanUp > 3+2032 master@0.0.0.0 batch scanner 560- 1 > 1+2032 master@0.0.0.0 client:startMultiScan > 1+2033 tserver@0.0.0.0 startMultiScan > 1+2033 tserver@0.0.0.0 metadata tablets read ahead 5 > 2+2035 master@0.0.0.0 scan > 2+2035 master@0.0.0.0 scan:location > 1+2036 tserver@0.0.0.0 startScan > 1+2036 tserver@0.0.0.0 metadata tablets read ahead 6 > 2+2037 master@0.0.0.0 close > 2+2039 master@0.0.0.0 scan > 2+2039 master@0.0.0.0 scan:location > 1+2040 tserver@0.0.0.0 startScan > 1+2040 tserver@0.0.0.0 metadata tablets read ahead 7 > 200+1859 master@0.0.0.0 waitForFateOperation > 1+2060 master@0.0.0.0 finishFateOperation > The following spans are not rooted (probably due to a parent span of length 0ms): > 2+273 master@0.0.0.0 PopulateMetadata > 1+274 tserver@0.0.0.0 update > 1+274 tserver@0.0.0.0 wal > 5+278 master@0.0.0.0 FinishCreateTable > 1+2038 master@0.0.0.0 sendMutations > 1+2038 tserver@0.0.0.0 update > 1+2038 master@0.0.0.0 org.apache.accumulo.core.client.impl.TabletServerBatchWriter$MutationWriter 1 > 1+2038 tserver@0.0.0.0 prep > {noformat} > Note how the only sendMutations is actually coming from the Master (and is unrooted for some reason...), not actually from the BatchWriter as we'd expected. > [~ShawnWalker], maybe this is related to your changes in ACCUMULO-4191? Do you have any time to look into this? -- This message was sent by Atlassian JIRA (v6.3.4#6332)