accumulo-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
Subject [2/3] accumulo git commit: ACCUMULO-3766 Add some documentation about using FQDN in config files
Date Fri, 01 May 2015 20:40:14 GMT
ACCUMULO-3766 Add some documentation about using FQDN in config files


Branch: refs/heads/master
Commit: 2436eec237dc23608e91cae56777a323e2f77295
Parents: fae7a82
Author: Josh Elser <>
Authored: Fri May 1 16:39:24 2015 -0400
Committer: Josh Elser <>
Committed: Fri May 1 16:39:24 2015 -0400

 docs/src/main/asciidoc/chapters/administration.txt | 17 +++++++++++++++++
 1 file changed, 17 insertions(+)
diff --git a/docs/src/main/asciidoc/chapters/administration.txt b/docs/src/main/asciidoc/chapters/administration.txt
index 857d948..cbe7ed2 100644
--- a/docs/src/main/asciidoc/chapters/administration.txt
+++ b/docs/src/main/asciidoc/chapters/administration.txt
@@ -238,6 +238,23 @@ to load Hadoop from multiple potential locations depending on how Hadoop
was ins
 recommended that `general.classpaths` is defined and limited to only the necessary jars to
 extra jars from being unintentionally loaded into Accumulo processes.
+==== Hostnames in configuration files
+Accumulo has a number of configuration files which can contain references to other hosts
in your
+network. All of the "host" configuration files for Accumulo (+gc+, +masters+, +slaves+, +monitor+,
++tracers+) as well as +instance.volumes+ in accumulo-site.xml must contain some host reference.
+While IP address, short hostnames, or fully qualified domain names (FQDN) are all technically
valid, it
+is good practice to always use FQDNs for both Accumulo and other processes in your Hadoop
+Failing to consistently use FQDNs can have unexpected consequences in how Accumulo uses the
+A common way for this problem can be observed is via applications that use Bulk Ingest. The
+Master coordinates moving the input files to Bulk Ingest to an Accumulo-managed directory.
+Accumulo cannot safely move files across different Hadoop FileSystems. This is problematic
+Accumulo also cannot make reliable assertions across what is the same FileSystem which is
+with different names. Naively, while might be a valid identifier for an HDFS
+Accumulo identifies +localhost:8020+ as a different HDFS instance than +
 ==== Deploy Configuration
 Copy the masters, slaves,, and if necessary, accumulo-site.xml

View raw message