Return-Path: Delivered-To: apmail-cassandra-commits-archive@www.apache.org Received: (qmail 93828 invoked from network); 14 Jun 2010 18:05:42 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 14 Jun 2010 18:05:42 -0000 Received: (qmail 12444 invoked by uid 500); 14 Jun 2010 18:05:41 -0000 Delivered-To: apmail-cassandra-commits-archive@cassandra.apache.org Received: (qmail 12428 invoked by uid 500); 14 Jun 2010 18:05:41 -0000 Mailing-List: contact commits-help@cassandra.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@cassandra.apache.org Delivered-To: mailing list commits@cassandra.apache.org Received: (qmail 12419 invoked by uid 99); 14 Jun 2010 18:05:41 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 14 Jun 2010 18:05:41 +0000 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.22] (HELO thor.apache.org) (140.211.11.22) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 14 Jun 2010 18:05:38 +0000 Received: from thor (localhost [127.0.0.1]) by thor.apache.org (8.13.8+Sun/8.13.8) with ESMTP id o5EI5HU9028910 for ; Mon, 14 Jun 2010 18:05:17 GMT Message-ID: <28977574.82361276538717422.JavaMail.jira@thor> Date: Mon, 14 Jun 2010 14:05:17 -0400 (EDT) From: "Jonathan Ellis (JIRA)" To: commits@cassandra.apache.org Subject: [jira] Commented: (CASSANDRA-1169) AES makes Streaming unhappy In-Reply-To: <24890924.209921275882115469.JavaMail.jira@thor> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/CASSANDRA-1169?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12878670#action_12878670 ] Jonathan Ellis commented on CASSANDRA-1169: ------------------------------------------- won't removing the active SOM bork things, if another stream to that target is going on? > AES makes Streaming unhappy > --------------------------- > > Key: CASSANDRA-1169 > URL: https://issues.apache.org/jira/browse/CASSANDRA-1169 > Project: Cassandra > Issue Type: Bug > Components: Core > Reporter: Jonathan Ellis > Assignee: Gary Dusbabek > Priority: Critical > Fix For: 0.6.3, 0.7 > > Attachments: 1169-2.txt, 1169.txt, aes.txt > > > Streaming service assumes there will only be one stream from S to T at a time for any nodes S and T. For the original purpose of node movement, this was a reasonable assumption (any node T can only perform one move at a time) but AES throws off streaming tasks much more frequently than that given the right conditions, which will de-sync the fragile file ordering that Streaming assumes (that T knows which files S is going to send, in what order). Eventually T is expecting file F1 but S sends a smaller file F2, leading to an infinite loop on T while it waits for F1 to finish, and T waits for S to acknowledge F2, which it never will. > For 0.6 maybe the best solution is for AES to manually wait for one of its streaming tasks to finish, before it allows itself to create another. For 0.7 it would be nice to make Streaming more robust. The whole 4-stage-ack process seems very fragile, and poking around in parent objects via inetaddress keys makes reasoning about small pieces impossible b/c of encapsulation violations. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.