Return-Path: X-Original-To: apmail-accumulo-notifications-archive@minotaur.apache.org Delivered-To: apmail-accumulo-notifications-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 6B40E10A1F for ; Thu, 23 Jan 2014 20:07:45 +0000 (UTC) Received: (qmail 35474 invoked by uid 500); 23 Jan 2014 20:07:42 -0000 Delivered-To: apmail-accumulo-notifications-archive@accumulo.apache.org Received: (qmail 35409 invoked by uid 500); 23 Jan 2014 20:07:40 -0000 Mailing-List: contact notifications-help@accumulo.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: jira@apache.org Delivered-To: mailing list notifications@accumulo.apache.org Received: (qmail 35383 invoked by uid 99); 23 Jan 2014 20:07:40 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 23 Jan 2014 20:07:40 +0000 Date: Thu, 23 Jan 2014 20:07:40 +0000 (UTC) From: "Christopher Tubbs (JIRA)" To: notifications@accumulo.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Reopened] (ACCUMULO-2234) Cannot run offline mapreduce over non-default instance.dfs.dir value MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/ACCUMULO-2234?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Christopher Tubbs reopened ACCUMULO-2234: ----------------------------------------- Implementation should not add a dependency on server configuration files which cannot assumed to be known by the launching process. It should use conn.instanceOperations().getSiteConfiguration() to get the configuration via thrift, without additional classpath dependencies on server configuration files. Also, is this really a blocker? > Cannot run offline mapreduce over non-default instance.dfs.dir value > -------------------------------------------------------------------- > > Key: ACCUMULO-2234 > URL: https://issues.apache.org/jira/browse/ACCUMULO-2234 > Project: Accumulo > Issue Type: Bug > Affects Versions: 1.4.4, 1.5.0 > Reporter: Josh Elser > Assignee: Josh Elser > Priority: Blocker > Fix For: 1.4.5, 1.5.1, 1.6.0 > > > The javadoc for setting up offline scans over RFiles (InputFormatBase.setScanOffline in 1.4 or InputFormatBase.setOfflineTableScan in 1.5) includes a nice little comment to the effect that if a "non-standard" directory is used for Accumulo in HDFS (read as, if the default value for instance.dfs.dir), accumulo-site.xml may need to be on the classpath for the mappers. > Best as I can tell, even if accumulo-site.xml is on the classpath, it makes no difference as InputFormatBase is creating a new ZooKeeperInstance which, in turn, will only ever make a DefaultConfiguration and never try to check if an accumulo-site.xml file is available. This would make it impossible for a non-default value for instance.dfs.dir to ever be used. -- This message was sent by Atlassian JIRA (v6.1.5#6160)