Return-Path: X-Original-To: apmail-ambari-dev-archive@www.apache.org Delivered-To: apmail-ambari-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id A1859173EE for ; Wed, 11 Mar 2015 00:46:38 +0000 (UTC) Received: (qmail 45754 invoked by uid 500); 11 Mar 2015 00:46:38 -0000 Delivered-To: apmail-ambari-dev-archive@ambari.apache.org Received: (qmail 45713 invoked by uid 500); 11 Mar 2015 00:46:38 -0000 Mailing-List: contact dev-help@ambari.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@ambari.apache.org Delivered-To: mailing list dev@ambari.apache.org Received: (qmail 45700 invoked by uid 99); 11 Mar 2015 00:46:38 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 11 Mar 2015 00:46:38 +0000 Date: Wed, 11 Mar 2015 00:46:38 +0000 (UTC) From: "Jonathan Hurley (JIRA)" To: dev@ambari.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Created] (AMBARI-10021) Python Does Not Close Alert TCP Connections Reliably MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 Jonathan Hurley created AMBARI-10021: ---------------------------------------- Summary: Python Does Not Close Alert TCP Connections Reliably Key: AMBARI-10021 URL: https://issues.apache.org/jira/browse/AMBARI-10021 Project: Ambari Issue Type: Bug Components: ambari-agent Affects Versions: 2.0.0 Reporter: Jonathan Hurley Assignee: Jonathan Hurley Priority: Critical Fix For: 2.0.0 During installs, we've seen a process bound to port 50070. This causes the NN to abort startup. This is with build: 1129 {noformat} root@hdp2-02-01 hdfs]# netstat -anp | grep 50070 tcp 0 0 192.168.1.141:50070 192.168.1.141:50070 ESTABLISHED 1630/python2.6 [root@hdp2-02-01 hdfs]# ps aux | grep 1630 root 1630 2.7 1.0 837364 50508 ? Sl Mar07 114:13 /usr/bin/python2.6 /usr/lib/python2.6/site-packages/ambari_agent/main.py start restart root 16057 0.0 0.0 103252 820 pts/0 S+ 08:54 0:00 grep 1630 {noformat} The NN Log is: {noformat} 2015-03-10 08:50:13,046 FATAL namenode.NameNode (NameNode.java:main(1509)) - Failed to start namenode. java.net.BindException: Port in use: 192.168.1.141:50070 at org.apache.hadoop.http.HttpServer2.openListeners(HttpServer2.java:891) at org.apache.hadoop.http.HttpServer2.start(HttpServer2.java:827) at org.apache.hadoop.hdfs.server.namenode.NameNodeHttpServer.start(NameNodeHtt pServer.java:142) at org.apache.hadoop.hdfs.server.namenode.NameNode.startHttpServer(NameNode.ja va:703) at org.apache.hadoop.hdfs.server.namenode.NameNode.initialize(NameNode.java:59 0) at org.apache.hadoop.hdfs.server.namenode.NameNode.(NameNode.java:762) at org.apache.hadoop.hdfs.server.namenode.NameNode.(NameNode.java:746) at org.apache.hadoop.hdfs.server.namenode.NameNode.createNameNode(NameNode.jav a:1438) at org.apache.hadoop.hdfs.server.namenode.NameNode.main(NameNode.java:1504) Caused by: java.net.BindException: Address already in use at sun.nio.ch.Net.bind0(Native Method) at sun.nio.ch.Net.bind(Net.java:444) at sun.nio.ch.Net.bind(Net.java:436) at sun.nio.ch.ServerSocketChannelImpl.bind(ServerSocketChannelImpl.java:214) at sun.nio.ch.ServerSocketAdaptor.bind(ServerSocketAdaptor.java:74) at org.mortbay.jetty.nio.SelectChannelConnector.open(SelectChannelConnector.ja va:216) at org.apache.hadoop.http.HttpServer2.openListeners(HttpServer2.java:886) ... 8 more 2015-03-10 08:50:13,056 INFO util.ExitUtil (ExitUtil.java:terminate(124)) - Exiting with status 1 2015-03-10 08:50:13,068 INFO namenode.NameNode (StringUtils.java:run(659)) - SHUTDOWN_MSG: /************************************************************ SHUTDOWN_MSG: Shutting down NameNode at 192.168.1.141/192.168.1.141 ************************************************************/ {noformat} -- This message was sent by Atlassian JIRA (v6.3.4#6332)