Return-Path: Delivered-To: apmail-cassandra-commits-archive@www.apache.org Received: (qmail 86666 invoked from network); 14 Jun 2010 17:53:37 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 14 Jun 2010 17:53:37 -0000 Received: (qmail 783 invoked by uid 500); 14 Jun 2010 17:53:37 -0000 Delivered-To: apmail-cassandra-commits-archive@cassandra.apache.org Received: (qmail 731 invoked by uid 500); 14 Jun 2010 17:53:36 -0000 Mailing-List: contact commits-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@cassandra.apache.org Delivered-To: mailing list commits@cassandra.apache.org Received: (qmail 721 invoked by uid 99); 14 Jun 2010 17:53:36 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 14 Jun 2010 17:53:36 +0000 X-ASF-Spam-Status: No, hits=-1518.9 required=10.0 tests=ALL_TRUSTED,AWL X-Spam-Check-By: apache.org Received: from [140.211.11.22] (HELO thor.apache.org) (140.211.11.22) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 14 Jun 2010 17:53:36 +0000 Received: from thor (localhost [127.0.0.1]) by thor.apache.org (8.13.8+Sun/8.13.8) with ESMTP id o5EHrF3S028675 for ; Mon, 14 Jun 2010 17:53:15 GMT Message-ID: <15819742.82031276537995390.JavaMail.jira@thor> Date: Mon, 14 Jun 2010 13:53:15 -0400 (EDT) From: "Gary Dusbabek (JIRA)" To: commits@cassandra.apache.org Subject: [jira] Reopened: (CASSANDRA-1169) AES makes Streaming unhappy In-Reply-To: <24890924.209921275882115469.JavaMail.jira@thor> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/CASSANDRA-1169?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gary Dusbabek reopened CASSANDRA-1169: -------------------------------------- > AES makes Streaming unhappy > --------------------------- > > Key: CASSANDRA-1169 > URL: https://issues.apache.org/jira/browse/CASSANDRA-1169 > Project: Cassandra > Issue Type: Bug > Components: Core > Reporter: Jonathan Ellis > Assignee: Gary Dusbabek > Priority: Critical > Fix For: 0.6.3, 0.7 > > Attachments: 1169.txt, aes.txt > > > Streaming service assumes there will only be one stream from S to T at a time for any nodes S and T. For the original purpose of node movement, this was a reasonable assumption (any node T can only perform one move at a time) but AES throws off streaming tasks much more frequently than that given the right conditions, which will de-sync the fragile file ordering that Streaming assumes (that T knows which files S is going to send, in what order). Eventually T is expecting file F1 but S sends a smaller file F2, leading to an infinite loop on T while it waits for F1 to finish, and T waits for S to acknowledge F2, which it never will. > For 0.6 maybe the best solution is for AES to manually wait for one of its streaming tasks to finish, before it allows itself to create another. For 0.7 it would be nice to make Streaming more robust. The whole 4-stage-ack process seems very fragile, and poking around in parent objects via inetaddress keys makes reasoning about small pieces impossible b/c of encapsulation violations. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.