hadoop-common-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Apache Wiki <wikidi...@apache.org>
Subject [Hadoop Wiki] Update of "Hbase/Troubleshooting" by stack
Date Mon, 15 Dec 2008 21:20:58 GMT
Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Hadoop Wiki" for change notification.

The following page has been changed by stack:
http://wiki.apache.org/hadoop/Hbase/Troubleshooting

The comment on the change is:
Removed dup.

------------------------------------------------------------------------------
   1. [#1 Problem: Master initializes, but Region Servers do not]
   1. [#2 Problem: Created Root Directory for HBase through Hadoop DFS]
   1. [#3 Problem: Replay of hlog required, forcing regionserver restart]
-  2. [#4 Problem: Master initializes, but Region Servers do not]
-  1. [#5 Problem: On migration, no files in root directory]
+  1. [#4 Problem: On migration, no files in root directory]
-  1. [#6 Problem: "xceiverCount 258 exceeds the limit of concurrent xcievers 256"]
+  1. [#5 Problem: "xceiverCount 258 exceeds the limit of concurrent xcievers 256"]
-  1. [#7 Problem: "No live nodes contain current block"]
+  1. [#6 Problem: "No live nodes contain current block"]
  
  [[Anchor(1)]]
  == Problem: Master initializes, but Region Servers do not ==
@@ -67, +66 @@

  === Causes ===
   * RPC timeouts may happen because of a IO contention which blocks processes during file
swapping.
  === Resolution ===
-  * Either reduce the load or add more memory/machines.
+  * Eith
  
  [[Anchor(4)]]
- == Problem: Master initializes, but Region Servers do not ==
-  * Master's log contains repeated instances of the following block:
-   ~-INFO org.apache.hadoop.ipc.Client: Retrying connect to server: /127.0.0.1:60020. Already
tried 1 time(s).[[BR]]
-   INFO org.apache.hadoop.ipc.Client: Retrying connect to server: /127.0.0.1:60020. Already
tried 2 time(s).[[BR]]
-   ...[[BR]]
-   INFO org.apache.hadoop.ipc.RPC: Server at /127.0.0.1:60020 not available yet, Zzzzz...-~
-  * Region Servers' logs contains repeated instances of the following block:
-   ~-INFO org.apache.hadoop.ipc.Client: Retrying connect to server: masternode/192.168.100.50:60000.
Already tried 1 time(s).[[BR]]
-   INFO org.apache.hadoop.ipc.Client: Retrying connect to server: masternode/192.168.100.50:60000.
Already tried 2 time(s).[[BR]]
-   ...[[BR]]
-   INFO org.apache.hadoop.ipc.RPC: Server at masternode/192.168.100.50:60000 not available
yet, Zzzzz...-~
-  * Note that the Master believes the Region Servers have the IP of 127.0.0.1 - which is
localhost and resolves to the master's own localhost.
- === Causes ===
-  * The Region Servers are erroneously informing the Master that their IP addresses are 127.0.0.1.
- === Resolution ===
-  * Modify '''/etc/hosts''' on the region servers, from
-   {{{
- # Do not remove the following line, or various programs
- # that require network functionality will fail.
- 127.0.0.1		fully.qualified.regionservername regionservername  localhost.localdomain localhost
- ::1		localhost6.localdomain6 localhost6
- }}}
- 
-  * To (removing the master node's name from localhost)
-   {{{
- # Do not remove the following line, or various programs
- # that require network functionality will fail.
- 127.0.0.1		localhost.localdomain localhost
- ::1		localhost6.localdomain6 localhost6
- }}}
- 
- [[Anchor(5)]]
  == Problem: On migration, no files in root directory ==
   * On Startup, Master says that you need to run the hbase migrations script. Upon running
that, the hbase migrations script says no files in root directory.
  === Causes ===
@@ -118, +85 @@

  === Resolution ===
   * Either reduce the load or set dfs.datanode.max.xcievers (hadoop-site.xml) to a larger
value than the default (256). Note that in order to change the tunable, you need 0.17.2 or
0.18.0 (HADOOP-3859).
  
- [[Anchor(6)]]
+ [[Anchor(5)]]
  == Problem: "xceiverCount 258 exceeds the limit of concurrent xcievers 256" ==
   * See an exception with above message in logs (usually hadoop 0.18.x).
  === Causes ===
@@ -127, +94 @@

   * Up the maximum by setting '''dfs.datanode.max.xcievers''' (sic).  See [http://mail-archives.apache.org/mod_mbox/hadoop-hbase-user/200810.mbox/%3C20126171.post@talk.nabble.com%3E
message from jean-adrien] for some background.
  
  
- [[Anchor(7)]]
+ [[Anchor(6)]]
  == Problem: "No live nodes contain current block" ==
   * See an exception with above message in logs (usually hadoop 0.18.x).
  === Causes ===

Mime
View raw message