Return-Path: X-Original-To: apmail-kafka-dev-archive@www.apache.org Delivered-To: apmail-kafka-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 4E2F8D0AD for ; Fri, 1 Mar 2013 17:55:14 +0000 (UTC) Received: (qmail 63771 invoked by uid 500); 1 Mar 2013 17:55:14 -0000 Delivered-To: apmail-kafka-dev-archive@kafka.apache.org Received: (qmail 63742 invoked by uid 500); 1 Mar 2013 17:55:14 -0000 Mailing-List: contact dev-help@kafka.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@kafka.apache.org Delivered-To: mailing list dev@kafka.apache.org Received: (qmail 63732 invoked by uid 99); 1 Mar 2013 17:55:14 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 01 Mar 2013 17:55:14 +0000 Date: Fri, 1 Mar 2013 17:55:14 +0000 (UTC) From: "John Fung (JIRA)" To: dev@kafka.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (KAFKA-772) System Test Transient Failure on testcase_0122 MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/KAFKA-772?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13590771#comment-13590771 ] John Fung commented on KAFKA-772: --------------------------------- There is a similar failure in testcase_0125 yesterday in our distributed environment. Attached the log4j messages and data log segment files for reference. The failure is as follows (similar to testcase_0122): Unique messages from consumer on [test_1] at simple_consumer_test_1-0_r1.log : 1715 Unique messages from consumer on [test_1] at simple_consumer_test_1-0_r2.log : 1715 Unique messages from consumer on [test_1] at simple_consumer_test_1-0_r3.log : 1715 Unique messages from consumer on [test_1] at simple_consumer_test_1-1_r1.log : 1711 Unique messages from consumer on [test_1] at simple_consumer_test_1-1_r2.log : 1711 Unique messages from consumer on [test_1] at simple_consumer_test_1-1_r3.log : 1711 Unique messages from consumer on [test_1] at simple_consumer_test_1-2_r1.log : 1469 Unique messages from consumer on [test_1] at simple_consumer_test_1-2_r2.log : 1469 Unique messages from consumer on [test_1] at simple_consumer_test_1-2_r3.log : 1469 Unique messages from consumer on [test_2] : 4895 Unique messages from consumer on [test_2] at simple_consumer_test_2-0_r1.log : 1715 Unique messages from consumer on [test_2] at simple_consumer_test_2-0_r2.log : 1715 Unique messages from consumer on [test_2] at simple_consumer_test_2-0_r3.log : 1682 Unique messages from consumer on [test_2] at simple_consumer_test_2-1_r1.log : 1708 Unique messages from consumer on [test_2] at simple_consumer_test_2-1_r2.log : 1708 Unique messages from consumer on [test_2] at simple_consumer_test_2-1_r3.log : 1708 Unique messages from consumer on [test_2] at simple_consumer_test_2-2_r1.log : 1467 Unique messages from consumer on [test_2] at simple_consumer_test_2-2_r2.log : 1467 Unique messages from consumer on [test_2] at simple_consumer_test_2-2_r3.log : 1467 Unique messages from producer on [test_2] : 4900 Validate for data matched on topic [test_1] across replicas : PASSED Validate for data matched on topic [test_2] : PASSED Validate for data matched on topic [test_2] across replicas : FAILED Validate for merged log segment checksum in cluster [source] : FAILED Validate leader election successful : PASSED > System Test Transient Failure on testcase_0122 > ---------------------------------------------- > > Key: KAFKA-772 > URL: https://issues.apache.org/jira/browse/KAFKA-772 > Project: Kafka > Issue Type: Bug > Affects Versions: 0.8 > Reporter: John Fung > Assignee: Sriram Subramanian > Labels: kafka-0.8, p1 > Attachments: testcase_0122.tar.gz, testcase_0125.tar.gz > > > * This test case is failing randomly in the past few weeks. Please note there is a small % data loss allowance for the test case with Ack = 1. But the failure in this case is the mismatch of log segment checksum across the replicas. > * Test description: > 3 brokers cluster > Replication factor = 3 > No. topic = 2 > No. partitions = 3 > Controlled failure (kill -15) > Ack = 1 > * Test case output > _test_case_name : testcase_0122 > _test_class_name : ReplicaBasicTest > arg : auto_create_topic : true > arg : bounce_broker : true > arg : broker_type : leader > arg : message_producing_free_time_sec : 15 > arg : num_iteration : 3 > arg : num_partition : 3 > arg : replica_factor : 3 > arg : sleep_seconds_between_producer_calls : 1 > validation_status : > Leader Election Latency - iter 1 brokerid 3 : 377.00 ms > Leader Election Latency - iter 2 brokerid 1 : 374.00 ms > Leader Election Latency - iter 3 brokerid 2 : 384.00 ms > Leader Election Latency MAX : 384.00 > Leader Election Latency MIN : 374.00 > Unique messages from consumer on [test_1] at simple_consumer_test_1-0_r1.log : 1750 > Unique messages from consumer on [test_1] at simple_consumer_test_1-0_r2.log : 1750 > Unique messages from consumer on [test_1] at simple_consumer_test_1-0_r3.log : 1750 > Unique messages from consumer on [test_1] at simple_consumer_test_1-1_r1.log : 1750 > Unique messages from consumer on [test_1] at simple_consumer_test_1-1_r2.log : 1750 > Unique messages from consumer on [test_1] at simple_consumer_test_1-1_r3.log : 1750 > Unique messages from consumer on [test_1] at simple_consumer_test_1-2_r1.log : 1500 > Unique messages from consumer on [test_1] at simple_consumer_test_1-2_r2.log : 1500 > Unique messages from consumer on [test_1] at simple_consumer_test_1-2_r3.log : 1500 > Unique messages from consumer on [test_2] : 5000 > Unique messages from consumer on [test_2] at simple_consumer_test_2-0_r1.log : 1714 > Unique messages from consumer on [test_2] at simple_consumer_test_2-0_r2.log : 1714 > Unique messages from consumer on [test_2] at simple_consumer_test_2-0_r3.log : 1680 > Unique messages from consumer on [test_2] at simple_consumer_test_2-1_r1.log : 1708 > Unique messages from consumer on [test_2] at simple_consumer_test_2-1_r2.log : 1708 > Unique messages from consumer on [test_2] at simple_consumer_test_2-1_r3.log : 1708 > Unique messages from consumer on [test_2] at simple_consumer_test_2-2_r1.log : 1469 > Unique messages from consumer on [test_2] at simple_consumer_test_2-2_r2.log : 1469 > Unique messages from consumer on [test_2] at simple_consumer_test_2-2_r3.log : 1469 > Unique messages from producer on [test_2] : 4900 > Validate for data matched on topic [test_1] across replicas : PASSED > Validate for data matched on topic [test_2] : FAILED > Validate for data matched on topic [test_2] across replicas : FAILED > Validate for merged log segment checksum in cluster [source] : FAILED > Validate leader election successful : PASSED -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira