Return-Path: X-Original-To: apmail-trafficserver-users-archive@www.apache.org Delivered-To: apmail-trafficserver-users-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 2B3C111C9D for ; Tue, 15 Apr 2014 20:55:21 +0000 (UTC) Received: (qmail 85330 invoked by uid 500); 15 Apr 2014 20:55:19 -0000 Delivered-To: apmail-trafficserver-users-archive@trafficserver.apache.org Received: (qmail 85243 invoked by uid 500); 15 Apr 2014 20:55:19 -0000 Mailing-List: contact users-help@trafficserver.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: users@trafficserver.apache.org Delivered-To: mailing list users@trafficserver.apache.org Received: (qmail 85235 invoked by uid 99); 15 Apr 2014 20:55:19 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 15 Apr 2014 20:55:19 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=5.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: local policy) Received: from [54.235.125.116] (HELO mail.network-geographics.com) (54.235.125.116) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 15 Apr 2014 20:55:13 +0000 Received: from Alita (c-50-179-158-202.hsd1.il.comcast.net [50.179.158.202]) by mail.network-geographics.com (Postfix) with ESMTPSA id 4FA6D204A8 for ; Tue, 15 Apr 2014 20:54:52 +0000 (UTC) Date: Tue, 15 Apr 2014 15:54:41 -0500 From: "Alan M. Carroll" Organization: Network Geographics, Inc. X-Priority: 3 (Normal) Message-ID: <558096448.20140415155441@network-geographics.com> To: Phil Sorber Subject: Re: [VOTE] Release Apache Traffic Server 4.2.1 (RC0) In-Reply-To: References: MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org -1 I think there's still a bug, demonstrated by some testing Phil has done. We have a theory on the cause and Phil is setting up the testing again with a fix. Our view is that the previous patch introduced a race condition involving the ram cache and this is the cause of the crash. The fix is to not do the 4.2.0 fixup on objects from the ram cache, as they have already had it done when read from disk. This is not directly a problem because the 4.2.0 is idempotent, but it can cause a race but not doing the fixup avoids the race condition.