Return-Path: X-Original-To: apmail-accumulo-notifications-archive@minotaur.apache.org Delivered-To: apmail-accumulo-notifications-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 1BD5710648 for ; Wed, 13 Nov 2013 18:05:40 +0000 (UTC) Received: (qmail 75628 invoked by uid 500); 13 Nov 2013 18:05:31 -0000 Delivered-To: apmail-accumulo-notifications-archive@accumulo.apache.org Received: (qmail 75394 invoked by uid 500); 13 Nov 2013 18:05:27 -0000 Mailing-List: contact notifications-help@accumulo.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: jira@apache.org Delivered-To: mailing list notifications@accumulo.apache.org Received: (qmail 75306 invoked by uid 99); 13 Nov 2013 18:05:26 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 13 Nov 2013 18:05:26 +0000 Date: Wed, 13 Nov 2013 18:05:26 +0000 (UTC) From: "ASF subversion and git services (JIRA)" To: notifications@accumulo.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (ACCUMULO-1867) Split failed during conditional randomwalk test MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/ACCUMULO-1867?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13821608#comment-13821608 ] ASF subversion and git services commented on ACCUMULO-1867: ----------------------------------------------------------- Commit 5ded019232528880fa1746134799571abde19a85 in branch refs/heads/master from [~ecn] [ https://git-wip-us.apache.org/repos/asf?p=accumulo.git;h=5ded019 ] ACCUMULO-1867 use the correct file system when reading the indexes of existing files > Split failed during conditional randomwalk test > ----------------------------------------------- > > Key: ACCUMULO-1867 > URL: https://issues.apache.org/jira/browse/ACCUMULO-1867 > Project: Accumulo > Issue Type: Bug > Reporter: Keith Turner > Assignee: Eric Newton > Priority: Blocker > Fix For: 1.6.0 > > > I left the conditional random walk test running overnight against 1.6.0-SNAPSHOT configured to use two namenodes. After running for a few hours a client saw a split operating failand I saw the following corresponding error message in the tserver logs. > {noformat} > 2013-11-08 12:31:59,227 [util.FileUtil] DEBUG: Too many indexes (33) to open at once for null null, reducing in tmpDir = /accumulo-1.6/tmp/idxReduce_1116774712 > 2013-11-08 12:31:59,369 [thrift.ProcessFunction] ERROR: Internal error processing splitTablet > java.lang.IllegalArgumentException: Wrong FS: hdfs://nn2:9001/accumulo-1.6/tables/2/t-0000ew3/F0000ex7.rf, expected: hdfs://nn1:6093 > at org.apache.hadoop.fs.FileSystem.checkPath(FileSystem.java:381) > at org.apache.hadoop.hdfs.DistributedFileSystem.getPathName(DistributedFileSystem.java:129) > at org.apache.hadoop.hdfs.DistributedFileSystem.open(DistributedFileSystem.java:154) > at org.apache.hadoop.fs.FileSystem.open(FileSystem.java:427) > at org.apache.accumulo.core.file.blockfile.impl.CachableBlockFile$Reader.getBCFile(CachableBlockFile.java:256) > at org.apache.accumulo.core.file.blockfile.impl.CachableBlockFile$Reader.access$000(CachableBlockFile.java:143) > at org.apache.accumulo.core.file.blockfile.impl.CachableBlockFile$Reader$MetaBlockLoader.get(CachableBlockFile.java:212) > at org.apache.accumulo.core.file.blockfile.impl.CachableBlockFile$Reader.getBlock(CachableBlockFile.java:313) > at org.apache.accumulo.core.file.blockfile.impl.CachableBlockFile$Reader.getMetaBlock(CachableBlockFile.java:367) > at org.apache.accumulo.core.file.blockfile.impl.CachableBlockFile$Reader.getMetaBlock(CachableBlockFile.java:143) > at org.apache.accumulo.core.file.rfile.RFile$Reader.(RFile.java:825) > at org.apache.accumulo.core.file.rfile.RFileOperations.openIndex(RFileOperations.java:63) > at org.apache.accumulo.core.file.DispatchingFileFactory.openIndex(FileOperations.java:66) > at org.apache.accumulo.server.util.FileUtil.reduceFiles(FileUtil.java:135) > at org.apache.accumulo.server.util.FileUtil.estimatePercentageLTE(FileUtil.java:207) > at org.apache.accumulo.tserver.Tablet.split(Tablet.java:3527) > at org.apache.accumulo.tserver.TabletServer.splitTablet(TabletServer.java:2648) > at org.apache.accumulo.tserver.TabletServer.access$1600(TabletServer.java:237) > at org.apache.accumulo.tserver.TabletServer$ThriftClientHandler.splitTablet(TabletServer.java:2095) > at sun.reflect.GeneratedMethodAccessor34.invoke(Unknown Source) > at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25) > at java.lang.reflect.Method.invoke(Method.java:597) > at org.apache.accumulo.trace.instrument.thrift.TraceWrap$1.invoke(TraceWrap.java:63) > at $Proxy10.splitTablet(Unknown Source) > at org.apache.accumulo.core.tabletserver.thrift.TabletClientService$Processor$splitTablet.getResult(TabletClientService.java:2531) > at org.apache.accumulo.core.tabletserver.thrift.TabletClientService$Processor$splitTablet.getResult(TabletClientService.java:2515) > at org.apache.thrift.ProcessFunction.process(ProcessFunction.java:39) > at org.apache.thrift.TBaseProcessor.process(TBaseProcessor.java:39) > at org.apache.accumulo.server.util.TServerUtils$TimedProcessor.process(TServerUtils.java:159) > at org.apache.thrift.server.AbstractNonblockingServer$FrameBuffer.invoke(AbstractNonblockingServer.java:478) > at org.apache.accumulo.server.util.TServerUtils$THsHaServer$Invocation.run(TServerUtils.java:214) > at java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886) > at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908) > at org.apache.accumulo.trace.instrument.TraceRunnable.run(TraceRunnable.java:47) > at org.apache.accumulo.core.util.LoggingRunnable.run(LoggingRunnable.java:34) > at java.lang.Thread.run(Thread.java:662) > {noformat} > nn1 is the default namenode. The "Too many indexes" message may be important. That message indicates the split code entered special code that handles tablets w/ lots of files. -- This message was sent by Atlassian JIRA (v6.1#6144)