ambari-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Aravindan Vijayan (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (AMBARI-20071) Hadoop metrics sink prints lots of logs if collector is unavailable
Date Fri, 17 Feb 2017 20:04:41 GMT

     [ https://issues.apache.org/jira/browse/AMBARI-20071?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Aravindan Vijayan updated AMBARI-20071:
---------------------------------------
    Status: Patch Available  (was: Open)

> Hadoop metrics sink prints lots of logs if collector is unavailable
> -------------------------------------------------------------------
>
>                 Key: AMBARI-20071
>                 URL: https://issues.apache.org/jira/browse/AMBARI-20071
>             Project: Ambari
>          Issue Type: Bug
>          Components: ambari-metrics
>    Affects Versions: 2.5.0
>            Reporter: Aravindan Vijayan
>            Assignee: Aravindan Vijayan
>            Priority: Critical
>             Fix For: 2.5.0
>
>         Attachments: AMBARI-20071.patch
>
>
> The metrics sink prints lots of such messages in Hadoop daemons log every second, which
makes logs rotates and purge fast.
> {code}
> 2017-02-16 19:05:48,896 INFO  timeline.HadoopTimelineMetricsSink (AbstractTimelineMetricsSink.java:refreshCollectorsFromConfigured(419))
- Collector ctr-e129-1487033772569-2546-01-000004.hwx.site is not longer live. Removing it
from list of know live collector hosts : []
> 2017-02-16 19:05:48,896 WARN  timeline.HadoopTimelineMetricsSink (AbstractTimelineMetricsSink.java:findPreferredCollectHost(399))
- Couldn't find any live collectors. Returning null
> 2017-02-16 19:05:48,896 WARN  timeline.HadoopTimelineMetricsSink (AbstractTimelineMetricsSink.java:emitMetrics(227))
- No live collector to send metrics to. Metrics to be sent will be discarded.
> 2017-02-16 19:05:50,901 WARN  timeline.HadoopTimelineMetricsSink (AbstractTimelineMetricsSink.java:findLiveCollectorHostsFromKnownCollector(476))
- Unable to connect to collector to find live nodes.
> 2017-02-16 19:05:50,901 INFO  timeline.HadoopTimelineMetricsSink (AbstractTimelineMetricsSink.java:refreshCollectorsFromConfigured(419))
- Collector ctr-e129-1487033772569-2546-01-000004.hwx.site is not longer live. Removing it
from list of know live collector hosts : []
> 2017-02-16 19:05:50,901 WARN  timeline.HadoopTimelineMetricsSink (AbstractTimelineMetricsSink.java:findPreferredCollectHost(399))
- Couldn't find any live collectors. Returning null
> 2017-02-16 19:05:50,902 WARN  timeline.HadoopTimelineMetricsSink (AbstractTimelineMetricsSink.java:emitMetrics(227))
- No live collector to send metrics to. Metrics to be sent will be discarded.
> 2017-02-16 19:06:48,896 WARN  timeline.HadoopTimelineMetricsSink (AbstractTimelineMetricsSink.java:findLiveCollectorHostsFromKnownCollector(476))
- Unable to connect to collector to find live nodes.
> 2017-02-16 19:06:48,897 INFO  timeline.HadoopTimelineMetricsSink (AbstractTimelineMetricsSink.java:refreshCollectorsFromConfigured(419))
- Collector ctr-e129-1487033772569-2546-01-000004.hwx.site is not longer live. Removing it
from list of know live collector hosts : []
> 2017-02-16 19:06:48,897 INFO  timeline.HadoopTimelineMetricsSink (AbstractTimelineMetricsSink.java:findPreferredCollectHost(359))
- No live collectors from configuration. Requesting zookeeper...
> 2017-02-16 19:06:48,988 INFO  timeline.HadoopTimelineMetricsSink (AbstractTimelineMetricsSink.java:findPreferredCollectHost(369))
- No new collector was found from Zookeeper. Will not request zookeeper for 120000 millis
> 2017-02-16 19:06:48,989 INFO  availability.MetricSinkWriteShardHostnameHashingStrategy
(MetricSinkWriteShardHostnameHashingStrategy.java:findCollectorShard(42)) - Calculated collector
shard ctr-e129-1487033772569-2546-01-000004.hwx.site based on hostname: ctr-e129-1487033772569-2546-01-000003.hwx.site
> 2017-02-16 19:06:59,004 INFO  timeline.HadoopTimelineMetricsSink (AbstractTimelineMetricsSink.java:emitMetrics(217))
- Removing collector ctr-e129-1487033772569-2546-01-000004.hwx.site from allKnownLiveCollectors.
> 2017-02-16 19:07:01,009 WARN  timeline.HadoopTimelineMetricsSink (AbstractTimelineMetricsSink.java:findLiveCollectorHostsFromKnownCollector(476))
- Unable to connect to collector to find live nodes.
> 2017-02-16 19:07:01,010 INFO  timeline.HadoopTimelineMetricsSink (AbstractTimelineMetricsSink.java:refreshCollectorsFromConfigured(419))
- Collector ctr-e129-1487033772569-2546-01-000004.hwx.site is not longer live. Removing it
from list of know live collector hosts : []
> 2017-02-16 19:07:01,010 WARN  timeline.HadoopTimelineMetricsSink (AbstractTimelineMetricsSink.java:findPreferredCollectHost(399))
- Couldn't find any live collectors. Returning null
> 2017-02-16 19:07:01,010 WARN  timeline.HadoopTimelineMetricsSink (AbstractTimelineMetricsSink.java:emitMetrics(227))
- No live collector to send metrics to. Metrics to be sent will be discarded.
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Mime
View raw message