geode-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Bruce Schuchardt (JIRA)" <>
Subject [jira] [Resolved] (GEODE-7038) After auto-reconnect a server's multicat communications aren't working correctly
Date Fri, 02 Aug 2019 18:15:00 GMT


Bruce Schuchardt resolved GEODE-7038.
    Resolution: Fixed

> After auto-reconnect a server's multicat communications aren't working correctly
> --------------------------------------------------------------------------------
>                 Key: GEODE-7038
>                 URL:
>             Project: Geode
>          Issue Type: Bug
>          Components: membership, messaging
>            Reporter: Bruce Schuchardt
>            Assignee: Bruce Schuchardt
>            Priority: Major
>          Time Spent: 50m
>  Remaining Estimate: 0h
> This was observed in an server having multicast enabled on a Region.  The server went
into a GC pause and was kicked out of the cluster.  After auto-reconnecting all of the servers
were requested to shut down and they all hung on destroy-region message responses.  Statistics
showed constant multicast retransmission requests but no retransmissions being sent.
> When a Region is configured to use multicast all of its cache operation messages are
multicast, including a destroy-region message.
> Some time ago we decided to stop sending Join Request Responses during discovery.  These
messages were responsible for carrying the JGroups multicast message digest so that a joining
member could install this digest into its multicast protocol.  Today these messages are only
sent if a UDP Diffie-Hellman algorithm has been specified.  We need to also ensure that we
send these messages if multicast is enabled.

This message was sent by Atlassian JIRA

View raw message