flume-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From iain wright <iainw...@gmail.com>
Subject Re: Flume HbaseSink ZK woes
Date Mon, 08 Oct 2012 21:14:32 GMT
Hi Hari,

Getting similar (but seemingly more serious/frequent) results with the
ASync Sink, I'll hit the hbase list as well. Thank you

iating client connection,
connectString=hbasemaster0.hadoop.domain.tvsessionTimeout=5000
watcher=org.hbase.async.HBaseClient$ZKCl
:509)] EventThread shut
down

2012-10-08 14:09:14,978 (lifecycleSupervisor-1-1-SendThread(
hbasemaster0.hadoop.domain.tv:2181)) [INFO - org.apache.zookeeper.Clie
2012-10-08 14:09:14,986 (lifecycleSupervisor-1-1-SendThread(
hbasemaster0.hadoop.domain.tv:2181)) [INFO - org.apache.zookeeper.Clie
:2181, initiating
session

11.30:2181, sessionid = 0x138a0620045bd47, negotiated timeout =
6000
on: 0x138a0620045bd47
closed

iating client connection,
connectString=hbasemaster0.hadoop.domain.tvsessionTimeout=5000
watcher=org.hbase.async.HBaseClient$ZKCl
:509)] EventThread shut
down

2012-10-08 14:09:19,108 (lifecycleSupervisor-1-1-SendThread(
hbasemaster0.hadoop.domain.tv:2181)) [INFO - org.apache.zookeeper.Clie
2012-10-08 14:09:19,110 (lifecycleSupervisor-1-1-SendThread(
hbasemaster0.hadoop.domain.tv:2181)) [INFO - org.apache.zookeeper.Clie
:2181, initiating
session

11.30:2181, sessionid = 0x138a0620045bd48, negotiated timeout =
6000
on: 0x138a0620045bd48
closed

iating client connection,
connectString=hbasemaster0.hadoop.domain.tvsessionTimeout=5000
watcher=org.hbase.async.HBaseClient$ZKCl
:509)] EventThread shut
down

2012-10-08 14:09:23,235 (lifecycleSupervisor-1-1-SendThread(
hbasemaster0.hadoop.domain.tv:2181)) [INFO - org.apache.zookeeper.Clie
2012-10-08 14:09:23,238 (lifecycleSupervisor-1-1-SendThread(
hbasemaster0.hadoop.domain.tv:2181)) [INFO - org.apache.zookeeper.Clie
:2181, initiating
session

11.30:2181, sessionid = 0x138a0620045bd49, negotiated timeout =
6000
on: 0x138a0620045bd49
closed

iating client connection,
connectString=hbasemaster0.hadoop.domain.tvsessionTimeout=5000
watcher=org.hbase.async.HBaseClient$ZKCl
:509)] EventThread shut
down

2012-10-08 14:09:27,368 (lifecycleSupervisor-1-1-SendThread(
hbasemaster0.hadoop.domain.tv:2181)) [INFO - org.apache.zookeeper.Clie
2012-10-08 14:09:27,373 (lifecycleSupervisor-1-1-SendThread(
hbasemaster0.hadoop.domain.tv:2181)) [INFO - org.apache.zookeeper.Clie
:2181, initiating
session

11.30:2181, sessionid = 0x138a0620045bd4d, negotiated timeout =
6000
on: 0x138a0620045bd4d
closed

iating client connection,
connectString=hbasemaster0.hadoop.domain.tvsessionTimeout=5000
watcher=org.hbase.async.HBaseClient$ZKCl
:509)] EventThread shut
down

2012-10-08 14:09:31,500 (lifecycleSupervisor-1-1-SendThread(
hbasemaster0.hadoop.domain.tv:2181)) [INFO - org.apache.zookeeper.Clie
2012-10-08 14:09:31,504 (lifecycleSupervisor-1-1-SendThread(
hbasemaster0.hadoop.domain.tv:2181)) [INFO - org.apache.zookeeper.Clie
:2181, initiating
session

11.30:2181, sessionid = 0x138a0620045bd4e, negotiated timeout =
6000
on: 0x138a0620045bd4e
closed

iating client connection,
connectString=hbasemaster0.hadoop.domain.tvsessionTimeout=5000
watcher=org.hbase.async.HBaseClient$ZKCl
:509)] EventThread shut
down

2012-10-08 14:09:35,660 (lifecycleSupervisor-1-1-SendThread(
hbasemaster0.hadoop.domain.tv:2181)) [INFO - org.apache.zookeeper.Clie
2012-10-08 14:09:35,668 (lifecycleSupervisor-1-1-SendThread(
hbasemaster0.hadoop.domain.tv:2181)) [INFO - org.apache.zookeeper.Clie
:2181, initiating
session

11.30:2181, sessionid = 0x138a0620045bd4f, negotiated timeout =
6000
on: 0x138a0620045bd4f
closed

iating client connection,
connectString=hbasemaster0.hadoop.domain.tvsessionTimeout=5000
watcher=org.hbase.async.HBaseClient$ZKCl
:509)] EventThread shut
down

2012-10-08 14:09:39,815 (lifecycleSupervisor-1-1-SendThread(
hbasemaster0.hadoop.domain.tv:2181)) [INFO - org.apache.zookeeper.Clie
2012-10-08 14:09:39,818 (lifecycleSupervisor-1-1-SendThread(
hbasemaster0.hadoop.domain.tv:2181)) [INFO - org.apache.zookeeper.Clie
:2181, initiating
session

11.30:2181, sessionid = 0x138a0620045bd50, negotiated timeout =
6000
on: 0x138a0620045bd50
closed

iating client connection,
connectString=hbasemaster0.hadoop.domain.tvsessionTimeout=5000
watcher=org.hbase.async.HBaseClient$ZKCl
:509)] EventThread shut
down

2012-10-08 14:09:43,947 (lifecycleSupervisor-1-1-SendThread(
hbasemaster0.hadoop.domain.tv:2181)) [INFO - org.apache.zookeeper.Clie
2012-10-08 14:09:43,952 (lifecycleSupervisor-1-1-SendThread(
hbasemaster0.hadoop.domain.tv:2181)) [INFO - org.apache.zookeeper.Clie
ntCnxn$SendThread.primeConnection(ClientCnxn.java:849)] Socket connection
established to hbasemaster0.hadoop.domain.tv/10.10.11.30
:2181, initiating
session

2012-10-08 14:09:43,968 (lifecycleSupervisor-1-1-SendThread(
hbasemaster0.hadoop.domain.tv:2181)) [INFO - org.apache.zookeeper.Clie
ntCnxn$SendThread.onConnected(ClientCnxn.java:1207)] Session establishment
complete on server hbasemaster0.hadoop.domain.tv/10.10.
11.30:2181, sessionid = 0x138a0620045bd51, negotiated timeout =
6000
2012-10-08 14:09:48,069 (lifecycleSupervisor-1-1-EventThread) [INFO -
org.apache.zookeeper.ZooKeeper.close(ZooKeeper.java:679)] Sessi
on: 0x138a0620045bd51
closed

2012-10-08 14:09:48,069 (lifecycleSupervisor-1-1-EventThread) [INFO -
org.apache.zookeeper.ZooKeeper.<init>(ZooKeeper.java:433)] Init
iating client connection,
connectString=hbasemaster0.hadoop.domain.tvsessionTimeout=5000
watcher=org.hbase.async.HBaseClient$ZKCl
ient@66e43eb8

2012-10-08 14:09:48,070 (lifecycleSupervisor-1-1-EventThread) [INFO -
org.apache.zookeeper.ClientCnxn$EventThread.run(ClientCnxn.java
:509)] EventThread shut
down

2012-10-08 14:09:48,070 (lifecycleSupervisor-1-1-EventThread) [ERROR -
org.hbase.async.HBaseClient$ZKClient$2.processResult(HBaseClie
nt.java:2407)] Looks like our ZK session expired or is broken, rc=-4:
CONNECTIONLOSS
2012-10-08 14:09:48,071 (lifecycleSupervisor-1-1-SendThread(
hbasemaster0.hadoop.domain.tv:2181)) [INFO - org.apache.zookeeper.Clie
ntCnxn$SendThread.logStartConnect(ClientCnxn.java:966)] Opening socket
connection to server hbasemaster0.hadoop.domain.tv/10.10.11
.30:2181. Will not attempt to authenticate using SASL (unknown
error)
2012-10-08 14:09:48,072 (lifecycleSupervisor-1-1-SendThread(
hbasemaster0.hadoop.domain.tv:2181)) [INFO - org.apache.zookeeper.Clie
ntCnxn$SendThread.primeConnection(ClientCnxn.java:849)] Socket connection
established to hbasemaster0.hadoop.domain.tv/10.10.11.30
:2181, initiating
session

2012-10-08 14:09:48,080 (lifecycleSupervisor-1-1-SendThread(
hbasemaster0.hadoop.domain.tv:2181)) [INFO - org.apache.zookeeper.Clie
ntCnxn$SendThread.onConnected(ClientCnxn.java:1207)] Session establishment
complete on server hbasemaster0.hadoop.domain.tv/10.10.
11.30:2181, sessionid = 0x138a0620045bd52, negotiated timeout =
6000

-- 
Iain Wright
Cell: (562) 852-5916

<http://www.labctsi.org/>
This email message is confidential, intended only for the recipient(s)
named above and may contain information that is privileged, exempt from
disclosure under applicable law. If you are not the intended recipient, do
not disclose or disseminate the message to anyone except the intended
recipient. If you have received this message in error, or are not the named
recipient(s), please immediately notify the sender by return email, and
delete all copies of this message.


On Mon, Oct 8, 2012 at 1:37 PM, Hari Shreedharan
<hshreedharan@cloudera.com>wrote:

> Hi Iain,
>
> I am not too sure of this issue. It looks like something to do with the
> HBaseClient. Can you try pinging HBase user list and see if this is a known
> issue? Did you try the async hbase sink? That is the recommended sink, I's
> suggest using that.
>
> Thanks,
> Hari
>
> --
> Hari Shreedharan
>
> On Monday, October 8, 2012 at 12:29 PM, iain wright wrote:
>
> We're having some trouble with the Flume & the HbaseSink. Seems we cannot
> hang on to zookeeper sessions. Using 1.3 ng, hbase 0.94. Hbase & Zoo both
> look fine, Don't think we are hitting our
> hbase.zookeeper.property.maxClientCnxns as we only have about 95 sesions
> and I believe it defaults to 2k. I can establish new sessions using the CLI
> from the same server and idle there for 10 minutes without getting dropped.
>
> Flume & zookeeper logs below, using the same hadoop & hbase directories
> from our regionserver's.
>
> *Searched the list, this user appeared to have the same problem, not sure
> how he fixed it though:*
>
> http://mail-archives.apache.org/mod_mbox/flume-user/201207.mbox/%3CCADGpRhcgztwhuUJi1izMRX2UOKeAWwhYOkKSKkHYmKopAkwHAQ@mail.gmail.com%3E
>
> *startup cmd
> */app/apache-flume-1.3.0-SNAPSHOT/bin/flume-ng agent -n agent1 -c ./conf
> -f conf/brian.properties  -Dflume.root.logger=INFO,console*
> *
> *flume-env.sh*
> $ cat conf/flume-env.sh
>
> # Licensed to the Apache Software Foundation (ASF) under one
> # or more contributor license agreements.  See the NOTICE file
> # distributed with this work for additional information
> # regarding copyright ownership.  The ASF licenses this file
> # to you under the Apache License, Version 2.0 (the
> # "License"); you may not use this file except in compliance
> # with the License.  You may obtain a copy of the License at
> #
> #     http://www.apache.org/licenses/LICENSE-2.0
> #
> # Unless required by applicable law or agreed to in writing, software
> # distributed under the License is distributed on an "AS IS" BASIS,
> # WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
> # See the License for the specific language governing permissions and
> # limitations under the License.
>
> # If this file is placed at FLUME_CONF_DIR/flume-env.sh, it will be
> sourced
> # during Flume startup.
>
> # Enviroment variables can be set here.
>
> #JAVA_HOME=/usr/lib/jvm/java-6-sun
>
> # Give Flume more memory and pre-allocate, enable remote monitoring via JMX
> #JAVA_OPTS="-Xms100m -Xmx200m -Dcom.sun.management.jmxremote"
>
> # Note that the Flume conf directory is always included in the classpath.
> #FLUME_CLASSPATH="/app/flume/lib"
>
> FLUME_CLASSPATH="/app/apache-flume-1.3.0-SNAPSHOT/lib"
>
> HBASE_HOME="/app/hbase-0.94.0"
>
> HADOOP_HOME="/app/hadoop-1.0.1"
>
>
> *config*
> $ cat conf/brian.properties
> #example.conf: A single-node Flume configuration
>
> # Name the components on this agent
> agent1.sources = source1
> agent1.sinks = sink1
> agent1.channels = channel1
>
> # Describe/configure source1
> agent1.sources.source1.type = exec
> agent1.sources.source1.command = tail -F /tank/log_dev.log
> agent1.sources.source1.batchSize = 1
> # Describe sink1
> #agent1.sinks.sink1.type = logger
> agent1.sinks.sink1.type = org.apache.flume.sink.hbase.HBaseSink
> agent1.sinks.sink1.table = brian_test
> agent1.sinks.sink1.columnFamily = f1
> #agent1.sinks.sink1.serializer =
> org.apache.flume.sink.hbase.SimpleHBaseEventSerializer
> #agent1.sinks.sink1.serializer =
> org.apache.flume.sink.hbase.SimpleHbaseEventSerializer
>
>
> # Use a channel which buffers events in memory
> agent1.channels.channel1.type = memory
> agent1.channels.channel1.capacity = 1000
> agent1.channels.channel1.transactionCapactiy = 100
>
> # Bind the source and sink to the channel
> agent1.sources.source1.channels = channel1
> agent1.sinks.sink1.channel = channel1
>
>
> *flume console log*
> 2012-10-08 12:06:19,007 (lifecycleSupervisor-1-1) [INFO -
> org.apache.zookeeper.Environment.logEnv(Environment.java:100)] Client
> environment:java.library.path=:/app/hadoop-1.0.1/libexec/../lib/native/FreeBSD-amd64-64:/app/hbase-0.94.0/bin/../lib/native/FreeBSD-amd64-64
> 2012-10-08 12:06:19,008 (lifecycleSupervisor-1-1) [INFO -
> org.apache.zookeeper.Environment.logEnv(Environment.java:100)] Client
> environment:java.io.tmpdir=/var/tmp/
> 2012-10-08 12:06:19,008 (lifecycleSupervisor-1-1) [INFO -
> org.apache.zookeeper.Environment.logEnv(Environment.java:100)] Client
> environment:java.compiler=<NA>
> 2012-10-08 12:06:19,008 (lifecycleSupervisor-1-1) [INFO -
> org.apache.zookeeper.Environment.logEnv(Environment.java:100)] Client
> environment:os.name=FreeBSD
> 2012-10-08 12:06:19,008 (lifecycleSupervisor-1-1) [INFO -
> org.apache.zookeeper.Environment.logEnv(Environment.java:100)] Client
> environment:os.arch=amd64
> 2012-10-08 12:06:19,008 (lifecycleSupervisor-1-1) [INFO -
> org.apache.zookeeper.Environment.logEnv(Environment.java:100)] Client
> environment:os.version=9.0-RELEASE
> 2012-10-08 12:06:19,009 (lifecycleSupervisor-1-1) [INFO -
> org.apache.zookeeper.Environment.logEnv(Environment.java:100)] Client
> environment:user.name=czhang
> 2012-10-08 12:06:19,009 (lifecycleSupervisor-1-1) [INFO -
> org.apache.zookeeper.Environment.logEnv(Environment.java:100)] Client
> environment:user.home=/users/czhang
> 2012-10-08 12:06:19,009 (lifecycleSupervisor-1-1) [INFO -
> org.apache.zookeeper.Environment.logEnv(Environment.java:100)] Client
> environment:user.dir=/app/apache-flume-1.3.0-SNAPSHOT
> 2012-10-08 12:06:19,010 (lifecycleSupervisor-1-1) [INFO -
> org.apache.zookeeper.ZooKeeper.<init>(ZooKeeper.java:433)] Initiating
> client connection, connectString=hbasemaster0.hadoop.domain.com:2181sessionTimeout=180000
watcher=hconnection
> 2012-10-08 12:06:19,036 (lifecycleSupervisor-1-1) [INFO -
> org.apache.hadoop.hbase.zookeeper.RecoverableZooKeeper.<init>(RecoverableZooKeeper.java:97)]
> The identifier of this process is 12494@app14.app.domain.com
> 2012-10-08 12:06:19,041 (lifecycleSupervisor-1-1-SendThread(
> hbasemaster0.hadoop.domain.com:2181)) [INFO -
> org.apache.zookeeper.ClientCnxn$SendThread.logStartConnect(ClientCnxn.java:966)]
> Opening socket connection to server
> hbasemaster0.hadoop.domain.com/10.10.11.30:2181. Will not attempt to
> authenticate using SASL (unknown error)
> 2012-10-08 12:06:19,049 (lifecycleSupervisor-1-1-SendThread(
> hbasemaster0.hadoop.domain.com:2181)) [INFO -
> org.apache.zookeeper.ClientCnxn$SendThread.primeConnection(ClientCnxn.java:849)]
> Socket connection established to
> hbasemaster0.hadoop.domain.com/10.10.11.30:2181, initiating session
> 2012-10-08 12:06:19,072 (lifecycleSupervisor-1-1-SendThread(
> hbasemaster0.hadoop.domain.com:2181)) [INFO -
> org.apache.zookeeper.ClientCnxn$SendThread.onConnected(ClientCnxn.java:1207)]
> Session establishment complete on server
> hbasemaster0.hadoop.domain.com/10.10.11.30:2181, sessionid =
> 0x138a0620045bbce, negotiated timeout = 180000
> 2012-10-08 12:08:19,070 (lifecycleSupervisor-1-1-SendThread(
> hbasemaster0.hadoop.domain.com:2181)) [INFO -
> org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1083)]
> Client session timed out, have not heard from server in 120001ms for
> sessionid 0x138a0620045bbce, closing socket connection and attempting
> reconnect
> 2012-10-08 12:08:19,233 (lifecycleSupervisor-1-1) [WARN -
> org.apache.hadoop.hbase.zookeeper.RecoverableZooKeeper.retryOrThrow(RecoverableZooKeeper.java:195)]
> Possibly transient ZooKeeper exception:
> org.apache.zookeeper.KeeperException$ConnectionLossException:
> KeeperErrorCode = ConnectionLoss for /hbase/master
> 2012-10-08 12:08:19,234 (lifecycleSupervisor-1-1) [INFO -
> org.apache.hadoop.hbase.util.RetryCounter.sleepUntilNextRetry(RetryCounter.java:53)]
> Sleeping 2000ms before retry #1...
> 2012-10-08 12:08:20,691 (lifecycleSupervisor-1-1-SendThread(
> hbasemaster0.hadoop.domain.com:2181)) [INFO -
> org.apache.zookeeper.ClientCnxn$SendThread.logStartConnect(ClientCnxn.java:966)]
> Opening socket connection to server
> hbasemaster0.hadoop.domain.com/10.10.11.30:2181. Will not attempt to
> authenticate using SASL (unknown error)
> 2012-10-08 12:08:20,692 (lifecycleSupervisor-1-1-SendThread(
> hbasemaster0.hadoop.domain.com:2181)) [INFO -
> org.apache.zookeeper.ClientCnxn$SendThread.primeConnection(ClientCnxn.java:849)]
> Socket connection established to
> hbasemaster0.hadoop.domain.com/10.10.11.30:2181, initiating session
> 2012-10-08 12:08:20,695 (lifecycleSupervisor-1-1-SendThread(
> hbasemaster0.hadoop.domain.com:2181)) [INFO -
> org.apache.zookeeper.ClientCnxn$SendThread.onConnected(ClientCnxn.java:1207)]
> Session establishment complete on server
> hbasemaster0.hadoop.domain.com/10.10.11.30:2181, sessionid =
> 0x138a0620045bbce, negotiated timeout = 180000
> 2012-10-08 12:10:20,695 (lifecycleSupervisor-1-1-SendThread(
> hbasemaster0.hadoop.domain.com:2181)) [INFO -
> org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1083)]
> Client session timed out, have not heard from server in 120000ms for
> sessionid 0x138a0620045bbce, closing socket connection and attempting
> reconnect
> 2012-10-08 12:10:20,796 (lifecycleSupervisor-1-1) [WARN -
> org.apache.hadoop.hbase.zookeeper.RecoverableZooKeeper.retryOrThrow(RecoverableZooKeeper.java:195)]
> Possibly transient ZooKeeper exception:
> org.apache.zookeeper.KeeperException$ConnectionLossException:
> KeeperErrorCode = ConnectionLoss for /hbase/master
> 2012-10-08 12:10:20,796 (lifecycleSupervisor-1-1) [INFO -
> org.apache.hadoop.hbase.util.RetryCounter.sleepUntilNextRetry(RetryCounter.java:53)]
> Sleeping 4000ms before retry #2...
> 2012-10-08 12:10:22,792 (lifecycleSupervisor-1-1-SendThread(
> hbasemaster0.hadoop.domain.com:2181)) [INFO -
> org.apache.zookeeper.ClientCnxn$SendThread.logStartConnect(ClientCnxn.java:966)]
> Opening socket connection to server
> hbasemaster0.hadoop.domain.com/10.10.11.30:2181. Will not attempt to
> authenticate using SASL (unknown error)
> 2012-10-08 12:10:22,794 (lifecycleSupervisor-1-1-SendThread(
> hbasemaster0.hadoop.domain.com:2181)) [INFO -
> org.apache.zookeeper.ClientCnxn$SendThread.primeConnection(ClientCnxn.java:849)]
> Socket connection established to
> hbasemaster0.hadoop.domain.com/10.10.11.30:2181, initiating session
> 2012-10-08 12:10:22,799 (lifecycleSupervisor-1-1-SendThread(
> hbasemaster0.hadoop.domain.com:2181)) [INFO -
> org.apache.zookeeper.ClientCnxn$SendThread.onConnected(ClientCnxn.java:1207)]
> Session establishment complete on server
> hbasemaster0.hadoop.domain.com/10.10.11.30:2181, sessionid =
> 0x138a0620045bbce, negotiated timeout = 180000
> 2012-10-08 12:12:22,800 (lifecycleSupervisor-1-1-SendThread(
> hbasemaster0.hadoop.domain.com:2181)) [INFO -
> org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1083)]
> Client session timed out, have not heard from server in 120001ms for
> sessionid 0x138a0620045bbce, closing socket connection and attempting
> reconnect
> 2012-10-08 12:12:22,902 (lifecycleSupervisor-1-1) [WARN -
> org.apache.hadoop.hbase.zookeeper.RecoverableZooKeeper.retryOrThrow(RecoverableZooKeeper.java:195)]
> Possibly transient ZooKeeper exception:
> org.apache.zookeeper.KeeperException$ConnectionLossException:
> KeeperErrorCode = ConnectionLoss for /hbase/master
> 2012-10-08 12:12:22,903 (lifecycleSupervisor-1-1) [INFO -
> org.apache.hadoop.hbase.util.RetryCounter.sleepUntilNextRetry(RetryCounter.java:53)]
> Sleeping 8000ms before retry #3...
> 2012-10-08 12:12:24,289 (lifecycleSupervisor-1-1-SendThread(
> hbasemaster0.hadoop.domain.com:2181)) [INFO -
> org.apache.zookeeper.ClientCnxn$SendThread.logStartConnect(ClientCnxn.java:966)]
> Opening socket connection to server
> hbasemaster0.hadoop.domain.com/10.10.11.30:2181. Will not attempt to
> authenticate using SASL (unknown error)
> 2012-10-08 12:12:24,291 (lifecycleSupervisor-1-1-SendThread(
> hbasemaster0.hadoop.domain.com:2181)) [INFO -
> org.apache.zookeeper.ClientCnxn$SendThread.primeConnection(ClientCnxn.java:849)]
> Socket connection established to
> hbasemaster0.hadoop.domain.com/10.10.11.30:2181, initiating session
> 2012-10-08 12:12:24,294 (lifecycleSupervisor-1-1-SendThread(
> hbasemaster0.hadoop.domain.com:2181)) [INFO -
> org.apache.zookeeper.ClientCnxn$SendThread.onConnected(ClientCnxn.java:1207)]
> Session establishment complete on server
> hbasemaster0.hadoop.domain.com/10.10.11.30:2181, sessionid =
> 0x138a0620045bbce, negotiated timeout = 180000
> 2012-10-08 12:14:24,294 (lifecycleSupervisor-1-1-SendThread(
> hbasemaster0.hadoop.domain.com:2181)) [INFO -
> org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1083)]
> Client session timed out, have not heard from server in 120000ms for
> sessionid 0x138a0620045bbce, closing socket connection and attempting
> reconnect
> 2012-10-08 12:14:24,395 (lifecycleSupervisor-1-1) [WARN -
> org.apache.hadoop.hbase.zookeeper.RecoverableZooKeeper.retryOrThrow(RecoverableZooKeeper.java:195)]
> Possibly transient ZooKeeper exception:
> org.apache.zookeeper.KeeperException$ConnectionLossException:
> KeeperErrorCode = ConnectionLoss for /hbase/master
> 2012-10-08 12:14:24,395 (lifecycleSupervisor-1-1) [ERROR -
> org.apache.hadoop.hbase.zookeeper.RecoverableZooKeeper.retryOrThrow(RecoverableZooKeeper.java:197)]
> ZooKeeper exists failed after 3 retries
> 2012-10-08 12:14:24,398 (lifecycleSupervisor-1-1) [WARN -
> org.apache.hadoop.hbase.zookeeper.ZKUtil.watchAndCheckExists(ZKUtil.java:239)]
> hconnection-0x138a0620045bbce-0x138a0620045bbce-0x138a0620045bbce-0x138a0620045bbce
> Unable to set watcher on znode /hbase/master
> org.apache.zookeeper.KeeperException$ConnectionLossException:
> KeeperErrorCode = ConnectionLoss for /hbase/master
>         at
> org.apache.zookeeper.KeeperException.create(KeeperException.java:99)
>         at
> org.apache.zookeeper.KeeperException.create(KeeperException.java:51)
>         at org.apache.zookeeper.ZooKeeper.exists(ZooKeeper.java:1036)
>         at
> org.apache.hadoop.hbase.zookeeper.RecoverableZooKeeper.exists(RecoverableZooKeeper.java:150)
>         at
> org.apache.hadoop.hbase.zookeeper.ZKUtil.watchAndCheckExists(ZKUtil.java:230)
>         at
> org.apache.hadoop.hbase.zookeeper.ZooKeeperNodeTracker.start(ZooKeeperNodeTracker.java:82)
>         at
> org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation.ensureZookeeperTrackers(HConnectionManager.java:590)
>         at
> org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation.locateRegion(HConnectionManager.java:818)
>         at
> org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation.locateRegion(HConnectionManager.java:801)
>         at
> org.apache.hadoop.hbase.client.HTable.finishSetup(HTable.java:234)
>         at org.apache.hadoop.hbase.client.HTable.<init>(HTable.java:174)
>         at org.apache.hadoop.hbase.client.HTable.<init>(HTable.java:133)
>         at org.apache.flume.sink.hbase.HBaseSink.start(HBaseSink.java:107)
>         at
> org.apache.flume.sink.DefaultSinkProcessor.start(DefaultSinkProcessor.java:46)
>         at org.apache.flume.SinkRunner.start(SinkRunner.java:79)
>         at
> org.apache.flume.lifecycle.LifecycleSupervisor$MonitorRunnable.run(LifecycleSupervisor.java:236)
>         at
> java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:471)
>         at
> java.util.concurrent.FutureTask$Sync.innerRunAndReset(FutureTask.java:351)
>         at java.util.concurrent.FutureTask.runAndReset(FutureTask.java:178)
>         at
> java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$301(ScheduledThreadPoolExecutor.java:178)
>         at
> java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:293)
>         at
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1110)
>         at
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:603)
>         at java.lang.Thread.run(Thread.java:722)
> 2012-10-08 12:14:24,405 (lifecycleSupervisor-1-1) [ERROR -
> org.apache.hadoop.hbase.zookeeper.ZooKeeperWatcher.keeperException(ZooKeeperWatcher.java:408)]
> hconnection-0x138a0620045bbce-0x138a0620045bbce-0x138a0620045bbce-0x138a0620045bbce
> Received unexpected KeeperException, re-throwing exception
> org.apache.zookeeper.KeeperException$ConnectionLossException:
> KeeperErrorCode = ConnectionLoss for /hbase/master
>         at
> org.apache.zookeeper.KeeperException.create(KeeperException.java:99)
>         at
> org.apache.zookeeper.KeeperException.create(KeeperException.java:51)
>         at org.apache.zookeeper.ZooKeeper.exists(ZooKeeper.java:1036)
>         at
> org.apache.hadoop.hbase.zookeeper.RecoverableZooKeeper.exists(RecoverableZooKeeper.java:150)
>         at
> org.apache.hadoop.hbase.zookeeper.ZKUtil.watchAndCheckExists(ZKUtil.java:230)
>         at
> org.apache.hadoop.hbase.zookeeper.ZooKeeperNodeTracker.start(ZooKeeperNodeTracker.java:82)
>         at
> org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation.ensureZookeeperTrackers(HConnectionManager.java:590)
>         at
> org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation.locateRegion(HConnectionManager.java:818)
>         at
> org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation.locateRegion(HConnectionManager.java:801)
>         at
> org.apache.hadoop.hbase.client.HTable.finishSetup(HTable.java:234)
>         at org.apache.hadoop.hbase.client.HTable.<init>(HTable.java:174)
>         at org.apache.hadoop.hbase.client.HTable.<init>(HTable.java:133)
>         at org.apache.flume.sink.hbase.HBaseSink.start(HBaseSink.java:107)
>         at
> org.apache.flume.sink.DefaultSinkProcessor.start(DefaultSinkProcessor.java:46)
>         at org.apache.flume.SinkRunner.start(SinkRunner.java:79)
>         at
> org.apache.flume.lifecycle.LifecycleSupervisor$MonitorRunnable.run(LifecycleSupervisor.java:236)
>         at
> java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:471)
>         at
> java.util.concurrent.FutureTask$Sync.innerRunAndReset(FutureTask.java:351)
>         at java.util.concurrent.FutureTask.runAndReset(FutureTask.java:178)
>         at
> java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$301(ScheduledThreadPoolExecutor.java:178)
>         at
> java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:293)
>         at
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1110)
>         at
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:603)
>         at java.lang.Thread.run(Thread.java:722)
> 2012-10-08 12:14:24,407 (lifecycleSupervisor-1-1) [INFO -
> org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation.abort(HConnectionManager.java:1654)]
> This client just lost it's session with ZooKeeper, will automatically
> reconnect when needed.
> 2012-10-08 12:14:26,166 (lifecycleSupervisor-1-1-SendThread(
> hbasemaster0.hadoop.domain.com:2181)) [INFO -
> org.apache.zookeeper.ClientCnxn$SendThread.logStartConnect(ClientCnxn.java:966)]
> Opening socket connection to server
> hbasemaster0.hadoop.domain.com/10.10.11.30:2181. Will not attempt to
> authenticate using SASL (unknown error)
> 2012-10-08 12:14:26,167 (lifecycleSupervisor-1-1-SendThread(
> hbasemaster0.hadoop.domain.com:2181)) [INFO -
> org.apache.zookeeper.ClientCnxn$SendThread.primeConnection(ClientCnxn.java:849)]
> Socket connection established to
> hbasemaster0.hadoop.domain.com/10.10.11.30:2181, initiating session
> 2012-10-08 12:14:26,170 (lifecycleSupervisor-1-1-SendThread(
> hbasemaster0.hadoop.domain.com:2181)) [INFO -
> org.apache.zookeeper.ClientCnxn$SendThread.onConnected(ClientCnxn.java:1207)]
> Session establishment complete on server
> hbasemaster0.hadoop.domain.com/10.10.11.30:2181, sessionid =
> 0x138a0620045bbce, negotiated timeout = 180000
>
> *zookeeper log*
> 2012-10-08 19:06:19,049 INFO
> org.apache.zookeeper.server.NIOServerCnxnFactory: Accepted socket
> connection from /10.10.10.64:16755
> 2012-10-08 19:06:19,053 INFO org.apache.zookeeper.server.ZooKeeperServer:
> Client attempting to establish new session at /10.10.10.64:16755
> 2012-10-08 19:06:19,069 INFO org.apache.zookeeper.server.ZooKeeperServer:
> Established session 0x138a0620045bbce with negotiated timeout 180000 for
> client /10.10.10.64:16755
> 2012-10-08 19:08:19,072 INFO org.apache.zookeeper.server.NIOServerCnxn:
> Closed socket connection for client /10.10.10.64:16755 which had
> sessionid 0x138a0620045bbce
> 2012-10-08 19:08:20,692 INFO
> org.apache.zookeeper.server.NIOServerCnxnFactory: Accepted socket
> connection from /10.10.10.64:33245
> 2012-10-08 19:08:20,693 INFO org.apache.zookeeper.server.ZooKeeperServer:
> Client attempting to renew session 0x138a0620045bbce at /10.10.10.64:33245
> 2012-10-08 19:08:20,694 INFO org.apache.zookeeper.server.ZooKeeperServer:
> Established session 0x138a0620045bbce with negotiated timeout 180000 for
> client /10.10.10.64:33245
> 2012-10-08 19:10:20,695 WARN org.apache.zookeeper.server.NIOServerCnxn:
> caught end of stream exception
> EndOfStreamException: Unable to read additional data from client sessionid
> 0x138a0620045bbce, likely client has closed socket
>     at
> org.apache.zookeeper.server.NIOServerCnxn.doIO(NIOServerCnxn.java:220)
>     at
> org.apache.zookeeper.server.NIOServerCnxnFactory.run(NIOServerCnxnFactory.java:224)
>     at java.lang.Thread.run(Thread.java:619)
> 2012-10-08 19:10:20,698 INFO org.apache.zookeeper.server.NIOServerCnxn:
> Closed socket connection for client /10.10.10.64:33245 which had
> sessionid 0x138a0620045bbce
> 2012-10-08 19:10:22,794 INFO
> org.apache.zookeeper.server.NIOServerCnxnFactory: Accepted socket
> connection from /10.10.10.64:44993
> 2012-10-08 19:10:22,796 INFO org.apache.zookeeper.server.ZooKeeperServer:
> Client attempting to renew session 0x138a0620045bbce at /10.10.10.64:44993
> 2012-10-08 19:10:22,798 INFO org.apache.zookeeper.server.ZooKeeperServer:
> Established session 0x138a0620045bbce with negotiated timeout 180000 for
> client /10.10.10.64:44993
>
>
> Thanks for the continued help from the community on getting us going
> w/flume
>
> Cheers,
>
> --
> Iain Wright
>
>
> This email message is confidential, intended only for the recipient(s)
> named above and may contain information that is privileged, exempt from
> disclosure under applicable law. If you are not the intended recipient, do
> not disclose or disseminate the message to anyone except the intended
> recipient. If you have received this message in error, or are not the named
> recipient(s), please immediately notify the sender by return email, and
> delete all copies of this message.
>
> --
> Iain Wright
>
>
> <http://www.labctsi.org/>
> This email message is confidential, intended only for the recipient(s)
> named above and may contain information that is privileged, exempt from
> disclosure under applicable law. If you are not the intended recipient, do
> not disclose or disseminate the message to anyone except the intended
> recipient. If you have received this message in error, or are not the named
> recipient(s), please immediately notify the sender by return email, and
> delete all copies of this message.
>
>
>

Mime
View raw message