hadoop-hdfs-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Elek, Marton (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (HDDS-343) Containers are stuck in closing state in scm
Date Thu, 16 Aug 2018 07:10:00 GMT

     [ https://issues.apache.org/jira/browse/HDDS-343?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Elek, Marton updated HDDS-343:
------------------------------
    Attachment: HDDS-343.003.patch

> Containers are stuck in closing state in scm
> --------------------------------------------
>
>                 Key: HDDS-343
>                 URL: https://issues.apache.org/jira/browse/HDDS-343
>             Project: Hadoop Distributed Data Store
>          Issue Type: Bug
>          Components: SCM
>            Reporter: Elek, Marton
>            Assignee: Elek, Marton
>            Priority: Blocker
>             Fix For: 0.2.1
>
>         Attachments: HDDS-343.001.patch, HDDS-343.002.patch, HDDS-343.003.patch
>
>
> Containers could not been closed currently.
> The datanode is closing the containers and sending the CLOSED state in the container
report but SCM doesn't register that the state is closed and sending the close command again
and again.
> I think the ContainerMapping.processContainerReport should be improved.
> {code}
> scm_1           | --> RPC message request: SCMHeartbeatRequestProto from 172.25.0.2:33912
> scm_1           | datanodeDetails {
> scm_1           |   uuid: "9c8f80bd-9424-4d74-99ef-a2bd58e66d7f"
> scm_1           |   ipAddress: "172.25.0.2"
> scm_1           |   hostName: "365fd1f44f0b"
> scm_1           |   ports {
> scm_1           |     name: "STANDALONE"
> scm_1           |     value: 9859
> scm_1           |   }
> scm_1           |   ports {
> scm_1           |     name: "RATIS"
> scm_1           |     value: 9858
> scm_1           |   }
> scm_1           |   ports {
> scm_1           |     name: "REST"
> scm_1           |     value: 9880
> scm_1           |   }
> scm_1           | }
> scm_1           | nodeReport {
> scm_1           |   storageReport {
> scm_1           |     storageUuid: "DS-61e76107-85c5-437a-95a7-aeb8b3e7827f"
> scm_1           |     storageLocation: "/tmp/hadoop-hadoop/dfs/data"
> scm_1           |     capacity: 491630870528
> scm_1           |     scmUsed: 2708828160
> scm_1           |     remaining: 24263614464
> scm_1           |     storageType: DISK
> scm_1           |     failed: false
> scm_1           |   }
> scm_1           | }
> scm_1           | containerReport {
> scm_1           |   reports {
> scm_1           |     containerID: 1
> scm_1           |     used: 1061158912
> scm_1           |     readCount: 0
> scm_1           |     writeCount: 64
> scm_1           |     readBytes: 0
> scm_1           |     writeBytes: 1061158912
> scm_1           |     state: CLOSED
> scm_1           |   }
> scm_1           |   reports {
> scm_1           |     containerID: 2
> scm_1           |     used: 1048576000
> scm_1           |     readCount: 0
> scm_1           |     writeCount: 64
> scm_1           |     readBytes: 0
> scm_1           |     writeBytes: 1048576000
> scm_1           |     state: CLOSED
> scm_1           |   }
> scm_1           |   reports {
> scm_1           |     containerID: 3
> scm_1           |     used: 511705088
> scm_1           |     readCount: 0
> scm_1           |     writeCount: 32
> scm_1           |     readBytes: 0
> scm_1           |     writeBytes: 511705088
> scm_1           |     state: OPEN
> scm_1           |   }
> scm_1           | }
> scm_1           | commandStatusReport {
> scm_1           | }
> scm_1           | containerActions {
> scm_1           |   containerActions {
> scm_1           |     containerID: 1
> scm_1           |     action: CLOSE
> scm_1           |     reason: CONTAINER_FULL
> scm_1           |   }
> scm_1           |   containerActions {
> scm_1           |     containerID: 2
> scm_1           |     action: CLOSE
> scm_1           |     reason: CONTAINER_FULL
> scm_1           |   }
> scm_1           | }
> scm_1           | 
> scm_1           | --> RPC message response: SCMHeartbeatRequestProto to 172.25.0.2:33912
> scm_1           | datanodeUUID: "9c8f80bd-9424-4d74-99ef-a2bd58e66d7f"
> scm_1           | 
> scm_1           | 2018-08-08 16:22:51 INFO  CloseContainerEventHandler:56 - Close container
Event triggered for container : 1
> scm_1           | 2018-08-08 16:22:51 INFO  CloseContainerEventHandler:105 - container
with id : 1 is in CLOSING state and need not be closed.
> scm_1           | 2018-08-08 16:22:51 INFO  CloseContainerEventHandler:56 - Close container
Event triggered for container : 2
> scm_1           | 2018-08-08 16:22:51 INFO  CloseContainerEventHandler:105 - container
with id : 2 is in CLOSING state and need not be closed.
> {code}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: hdfs-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: hdfs-issues-help@hadoop.apache.org


Mime
View raw message