From issues-return-68713-archive-asf-public=cust-asf.ponee.io@ignite.apache.org Wed Jun 27 18:06:05 2018 Return-Path: X-Original-To: archive-asf-public@cust-asf.ponee.io Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by mx-eu-01.ponee.io (Postfix) with SMTP id 14B2F180625 for ; Wed, 27 Jun 2018 18:06:04 +0200 (CEST) Received: (qmail 13576 invoked by uid 500); 27 Jun 2018 16:06:04 -0000 Mailing-List: contact issues-help@ignite.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@ignite.apache.org Delivered-To: mailing list issues@ignite.apache.org Received: (qmail 13567 invoked by uid 99); 27 Jun 2018 16:06:04 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd1-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 27 Jun 2018 16:06:04 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd1-us-west.apache.org (ASF Mail Server at spamd1-us-west.apache.org) with ESMTP id BAB85C174D for ; Wed, 27 Jun 2018 16:06:03 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -110.301 X-Spam-Level: X-Spam-Status: No, score=-110.301 tagged_above=-999 required=6.31 tests=[ENV_AND_HDR_SPF_MATCH=-0.5, RCVD_IN_DNSWL_MED=-2.3, SPF_PASS=-0.001, USER_IN_DEF_SPF_WL=-7.5, USER_IN_WHITELIST=-100] autolearn=disabled Received: from mx1-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd1-us-west.apache.org [10.40.0.7]) (amavisd-new, port 10024) with ESMTP id CA_7BrydAlLR for ; Wed, 27 Jun 2018 16:06:02 +0000 (UTC) Received: from mailrelay1-us-west.apache.org (mailrelay1-us-west.apache.org [209.188.14.139]) by mx1-lw-eu.apache.org (ASF Mail Server at mx1-lw-eu.apache.org) with ESMTP id E34CA5F4E7 for ; Wed, 27 Jun 2018 16:06:01 +0000 (UTC) Received: from jira-lw-us.apache.org (unknown [207.244.88.139]) by mailrelay1-us-west.apache.org (ASF Mail Server at mailrelay1-us-west.apache.org) with ESMTP id 0DFE8E103D for ; Wed, 27 Jun 2018 16:06:01 +0000 (UTC) Received: from jira-lw-us.apache.org (localhost [127.0.0.1]) by jira-lw-us.apache.org (ASF Mail Server at jira-lw-us.apache.org) with ESMTP id 6DF4923F9D for ; Wed, 27 Jun 2018 16:06:00 +0000 (UTC) Date: Wed, 27 Jun 2018 16:06:00 +0000 (UTC) From: "Alexei Scherbakov (JIRA)" To: issues@ignite.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (IGNITE-8863) Tx rollback can cause remote tx hang MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/IGNITE-8863?page=3Dcom.atlassia= n.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=3D165= 25229#comment-16525229 ]=20 Alexei Scherbakov commented on IGNITE-8863: ------------------------------------------- It's look like TC history for suite cache5 recently become very bad (see [1= ]) mainly due to CacheSerializableTransactionsTest and PartitionsExchangeOn= DiscoveryHistoryOverflowTest. I've muted PartitionsExchangeOnDiscoveryHistoryOverflowTest because it prod= uces OOM almost each run and someone should push PR from IGNITE-8869 to pre= vent OOM. I think my changes are not related to failures because these failures has s= tarted to happen before this commit. [~agoncharuk], please review. [1] https://ci.ignite.apache.org/viewType.html?buildTypeId=3DIgniteTests24J= ava8_Cache5&branch_IgniteTests24Java8=3D%3Cdefault%3E&tab=3DbuildTypeStatus= Div > Tx rollback can cause remote tx hang > ------------------------------------ > > Key: IGNITE-8863 > URL: https://issues.apache.org/jira/browse/IGNITE-8863 > Project: Ignite > Issue Type: Bug > Reporter: Alexei Scherbakov > Assignee: Alexei Scherbakov > Priority: Major > Fix For: 2.7 > > Attachments: Ignite_Tests_2.4_Java_8_Cache_5_1434.log.zip > > > {noformat} > [16:33:56]W: [org.apache.ignite:ignite-core] [2018-06-08 13:3= 3:56,931][WARN ][sys-#66696%client%][GridNearTxLocal] The transaction was f= orcibly rolled back because a timeout is reached: GridNearTxLocal[xid=3De19= 8a9fd361-00000000-0857-6387-0000-000000000004, xidVersion=3DGridCacheVersio= n [topVer=3D139944839, order=3D1528464836894, nodeOrder=3D4], concurrency= =3DPESSIMISTIC, isolation=3DREPEATABLE_READ, state=3DMARKED_ROLLBACK, inval= idate=3Dfalse, rollbackOnly=3Dtrue, nodeId=3D3c8d85b2-4eb9-46b2-8bd1-6f18f5= 42fc7a, timeout=3D1, duration=3D11] > [16:35:55]W: [org.apache.ignite:ignite-core] [2018-06-08 13:3= 5:55,056][WARN ][grid-timeout-worker-#66394%transactions.TxRollbackOnTimeou= tTest0%][diagnostic] Found long running transaction [startTime=3D13:33:56.9= 31, curTime=3D13:35:55.054, tx=3DGridDhtTxRemote [nearNodeId=3D3c8d85b2-4eb= 9-46b2-8bd1-6f18f542fc7a, rmtFutId=3Daf940d0e361-79c59341-3292-46e4-92ce-5c= 4ef4eddef8, nearXidVer=3DGridCacheVersion [topVer=3D139944839, order=3D1528= 464836894, nodeOrder=3D4], storeWriteThrough=3Dfalse, super=3DGridDistribut= edTxRemoteAdapter [explicitVers=3Dnull, started=3Dtrue, commitAllowed=3D0, = txState=3DIgniteTxRemoteSingleStateImpl [entry=3DIgniteTxEntry [key=3DKeyCa= cheObjectImpl [part=3D1, val=3D1, hasValBytes=3Dtrue], cacheId=3D3556498, t= xKey=3DIgniteTxKey [key=3DKeyCacheObjectImpl [part=3D1, val=3D1, hasValByte= s=3Dtrue], cacheId=3D3556498], val=3D[op=3DCREATE, val=3DCacheObjectImpl [v= al=3Dnull, hasValBytes=3Dtrue]], prevVal=3D[op=3DNOOP, val=3Dnull], oldVal= =3D[op=3DNOOP, val=3Dnull], entryProcessorsCol=3Dnull, ttl=3D-1, conflictEx= pireTime=3D-1, conflictVer=3Dnull, explicitVer=3Dnull, dhtVer=3Dnull, filte= rs=3D[], filtersPassed=3Dfalse, filtersSet=3Dfalse, entry=3DGridDhtCacheEnt= ry [rdrs=3D[], part=3D1, super=3DGridDistributedCacheEntry [super=3DGridCac= heMapEntry [key=3DKeyCacheObjectImpl [part=3D1, val=3D1, hasValBytes=3Dtrue= ], val=3DCacheObjectImpl [val=3Dnull, hasValBytes=3Dtrue], startVer=3D15284= 64836879, ver=3DGridCacheVersion [topVer=3D139944839, order=3D1528464836863= , nodeOrder=3D2], hash=3D1, extras=3DGridCacheMvccEntryExtras [mvcc=3DGridC= acheMvcc [locs=3Dnull, rmts=3D[GridCacheMvccCandidate [nodeId=3D97ee44cd-73= c9-4e79-95df-e1a034800001, ver=3DGridCacheVersion [topVer=3D139944839, orde= r=3D1528464836897, nodeOrder=3D2], threadId=3D75880, id=3D2310313, topVer= =3DAffinityTopologyVersion [topVer=3D-1, minorTopVer=3D0], reentry=3Dnull, = otherNodeId=3D3c8d85b2-4eb9-46b2-8bd1-6f18f542fc7a, otherVer=3Dnull, mapped= DhtNodes=3Dnull, mappedNearNodes=3Dnull, ownerVer=3Dnull, serOrder=3Dnull, = key=3DKeyCacheObjectImpl [part=3D1, val=3D1, hasValBytes=3Dtrue], masks=3Dl= ocal=3D0|owner=3D0|ready=3D0|reentry=3D0|used=3D0|tx=3D1|single_implicit=3D= 0|dht_local=3D0|near_local=3D0|removed=3D0|read=3D0, prevVer=3Dnull, nextVe= r=3Dnull], GridCacheMvccCandidate [nodeId=3D97ee44cd-73c9-4e79-95df-e1a0348= 00001, ver=3DGridCacheVersion [topVer=3D139944839, order=3D1528464836900, n= odeOrder=3D2], threadId=3D75875, id=3D2310317, topVer=3DAffinityTopologyVer= sion [topVer=3D-1, minorTopVer=3D0], reentry=3Dnull, otherNodeId=3D3c8d85b2= -4eb9-46b2-8bd1-6f18f542fc7a, otherVer=3Dnull, mappedDhtNodes=3Dnull, mappe= dNearNodes=3Dnull, ownerVer=3Dnull, serOrder=3Dnull, key=3DKeyCacheObjectIm= pl [part=3D1, val=3D1, hasValBytes=3Dtrue], masks=3Dlocal=3D0|owner=3D1|rea= dy=3D0|reentry=3D0|used=3D1|tx=3D1|single_implicit=3D0|dht_local=3D0|near_l= ocal=3D0|removed=3D0|read=3D0, prevVer=3Dnull, nextVer=3Dnull]]]], flags=3D= 2]]], prepared=3D1, locked=3Dfalse, nodeId=3Dnull, locMapped=3Dfalse, expir= yPlc=3Dnull, transferExpiryPlc=3Dfalse, flags=3D0, partUpdateCntr=3D0, serR= eadVer=3Dnull, xidVer=3Dnull]], skipCompletedVers=3Dfalse, super=3DIgniteTx= Adapter [xidVer=3DGridCacheVersion [topVer=3D139944839, order=3D15284648368= 97, nodeOrder=3D2], writeVer=3DGridCacheVersion [topVer=3D139944839, order= =3D1528464836898, nodeOrder=3D2], implicit=3Dfalse, loc=3Dfalse, threadId= =3D75880, startTime=3D1528464836931, nodeId=3D97ee44cd-73c9-4e79-95df-e1a03= 4800001, startVer=3DGridCacheVersion [topVer=3D139944839, order=3D152846483= 6864, nodeOrder=3D1], endVer=3Dnull, isolation=3DREPEATABLE_READ, concurren= cy=3DPESSIMISTIC, timeout=3D1, sysInvalidate=3Dfalse, sys=3Dfalse, plc=3D2,= commitVer=3Dnull, finalizing=3DNONE, invalidParts=3Dnull, state=3DPREPARED= , timedOut=3Dfalse, topVer=3DAffinityTopologyVersion [topVer=3D4, minorTopV= er=3D0], duration=3D118123ms, onePhaseCommit=3Dfalse]]]] > {noformat} -- This message was sent by Atlassian JIRA (v7.6.3#76005)