ambari-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jonathan Hurley" <jhur...@hortonworks.com>
Subject Review Request 31920: Python Does Not Close Alert TCP Connections Reliably
Date Wed, 11 Mar 2015 00:50:15 GMT

-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/31920/
-----------------------------------------------------------

Review request for Ambari, Nate Cole and Tom Beerbower.


Bugs: AMBARI-10021
    https://issues.apache.org/jira/browse/AMBARI-10021


Repository: ambari


Description
-------

During installs, we've seen a process bound to port 50070. This causes the NN to abort startup.

```
root@hdp2-02-01 hdfs]# netstat -anp | grep 50070
tcp 0 0 192.168.1.141:50070 192.168.1.141:50070 ESTABLISHED 1630/python2.6

[root@hdp2-02-01 hdfs]# ps aux | grep 1630
root 1630 2.7 1.0 837364 50508 ? Sl Mar07 114:13 /usr/bin/python2.6
/usr/lib/python2.6/site-packages/ambari_agent/main.py start restart
root 16057 0.0 0.0 103252 820 pts/0 S+ 08:54 0:00 grep 1630
```

It looks like Python does not always release connections from the urllib2 library.


Diffs
-----

  ambari-agent/src/main/python/ambari_agent/alerts/metric_alert.py 83dc54d 
  ambari-agent/src/main/python/ambari_agent/alerts/web_alert.py d7a833e 
  ambari-server/src/main/resources/common-services/HDFS/2.1.0.2.0/package/alerts/alert_checkpoint_time.py
410608f 
  ambari-server/src/main/resources/common-services/HDFS/2.1.0.2.0/package/alerts/alert_ha_namenode_health.py
fc1541d 
  ambari-server/src/main/resources/common-services/HIVE/0.12.0.2.0/package/alerts/alert_hive_metastore.py
20d8abe 
  ambari-server/src/main/resources/common-services/HIVE/0.12.0.2.0/package/alerts/alert_hive_thrift_port.py
5e9ab1f 
  ambari-server/src/main/resources/common-services/HIVE/0.12.0.2.0/package/alerts/alert_webhcat_server.py
0b7535c 
  ambari-server/src/main/resources/common-services/YARN/2.1.0.2.0/package/alerts/alert_nodemanager_health.py
12b1bd7 
  ambari-server/src/main/resources/common-services/YARN/2.1.0.2.0/package/alerts/alert_nodemanagers_summary.py
5f7abbd 
  ambari-server/src/main/resources/stacks/BIGTOP/0.8/services/HDFS/package/files/alert_checkpoint_time.py
410608f 
  ambari-server/src/main/resources/stacks/BIGTOP/0.8/services/HDFS/package/files/alert_ha_namenode_health.py
fc1541d 
  ambari-server/src/main/resources/stacks/BIGTOP/0.8/services/HIVE/package/files/alert_hive_thrift_port.py
bd3f276 
  ambari-server/src/main/resources/stacks/BIGTOP/0.8/services/WEBHCAT/package/files/alert_webhcat_server.py
c484f0b 
  ambari-server/src/main/resources/stacks/BIGTOP/0.8/services/YARN/package/files/alert_nodemanager_health.py
b1de951 

Diff: https://reviews.apache.org/r/31920/diff/


Testing
-------

mvn clean test

[INFO] ------------------------------------------------------------------------
[INFO] BUILD SUCCESS
[INFO] ------------------------------------------------------------------------
[INFO] Total time: 15.011 s
[INFO] Finished at: 2015-03-10T20:42:57-04:00
[INFO] Final Memory: 10M/81M

Deployed clean cluster with all services, verified alerts were working. Kerberized cluster
and verified alerts were working.


Thanks,

Jonathan Hurley


Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message