Return-Path: X-Original-To: apmail-samza-dev-archive@minotaur.apache.org Delivered-To: apmail-samza-dev-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 7009A18B5F for ; Wed, 18 Nov 2015 20:43:00 +0000 (UTC) Received: (qmail 55460 invoked by uid 500); 18 Nov 2015 20:43:00 -0000 Delivered-To: apmail-samza-dev-archive@samza.apache.org Received: (qmail 55398 invoked by uid 500); 18 Nov 2015 20:43:00 -0000 Mailing-List: contact dev-help@samza.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@samza.apache.org Delivered-To: mailing list dev@samza.apache.org Received: (qmail 55383 invoked by uid 99); 18 Nov 2015 20:43:00 -0000 Received: from Unknown (HELO spamd2-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 18 Nov 2015 20:43:00 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd2-us-west.apache.org (ASF Mail Server at spamd2-us-west.apache.org) with ESMTP id 929901A29C8 for ; Wed, 18 Nov 2015 20:42:59 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd2-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -0.099 X-Spam-Level: X-Spam-Status: No, score=-0.099 tagged_above=-999 required=6.31 tests=[DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, URIBL_BLOCKED=0.001] autolearn=disabled Authentication-Results: spamd2-us-west.apache.org (amavisd-new); dkim=pass (1024-bit key) header.d=chartbeat.com Received: from mx1-eu-west.apache.org ([10.40.0.8]) by localhost (spamd2-us-west.apache.org [10.40.0.9]) (amavisd-new, port 10024) with ESMTP id kqeXyMwec2Mz for ; Wed, 18 Nov 2015 20:42:48 +0000 (UTC) Received: from mail-qg0-f45.google.com (mail-qg0-f45.google.com [209.85.192.45]) by mx1-eu-west.apache.org (ASF Mail Server at mx1-eu-west.apache.org) with ESMTPS id DB70B20EBA for ; Wed, 18 Nov 2015 20:42:47 +0000 (UTC) Received: by qgad10 with SMTP id d10so37797599qga.3 for ; Wed, 18 Nov 2015 12:42:40 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=chartbeat.com; s=google; h=content-type:mime-version:subject:from:in-reply-to:date:message-id :references:to; bh=+dD8onSy+w7judqXlhM8qYG6nOLnFOL1TpsOim2438A=; b=mEoOo7HP61cT+EjAKftrqfjlSn2+P4yqFVWfEKn9Jggi5ZksiG/FRAtXDP82iW/u52 BNsj8nZ8P8/AgoGSYJXHVOK4zGcwsvNzdKSlSiKTnsFmP/n+0ym2s6lqevoRa1nDWATM K5ka+2pjAlwChYCtjAg+8nqta6NA+xOikDb6Y= X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:content-type:mime-version:subject:from :in-reply-to:date:message-id:references:to; bh=+dD8onSy+w7judqXlhM8qYG6nOLnFOL1TpsOim2438A=; b=f41zM+tdhexqIT8R0iNabFJkssUwtfzQ9QxFH6/5adeQxRi/Fp+KwVC/8rEDQnrvzh 1yRkX0x2c8qVQfixGihae474IlarxhQ5uhz9XmfiYx3ciGdJ5OnNOkJjTo8YSoYQM7rd l1febJj+G39l++pX8ujfqL1r7mt6BdLZf2Jbjf84Dj8jpt9U0CFlXUg/gBnjXREwUZzb AaqNtJ8PWW1ff+kBAQFHHY3XXc8/Uh3QGnFFnd4BSk2BNz0fDgcFDrcj6oxMNW77PeDi pz5oNvhZPxELx2e0KZF77OO27q4FocFlwrHFyAScA0591yhoH6kLidKzfrdVGzaJDlnF 2zvQ== X-Gm-Message-State: ALoCoQnj+6uMBjK6lYSvQcnMwsp6JGTBPuPlcC4k5pNWUVOkieWIgWWU5+w2fXdSe4SY+UdM2dkW X-Received: by 10.140.23.83 with SMTP id 77mr3683237qgo.58.1447879360845; Wed, 18 Nov 2015 12:42:40 -0800 (PST) Received: from ip-192-168-152-149.ec2.internal (static-100-38-5-130.nycmny.fios.verizon.net. [100.38.5.130]) by smtp.gmail.com with ESMTPSA id x129sm1414782qhc.33.2015.11.18.12.42.40 for (version=TLS1 cipher=ECDHE-RSA-AES128-SHA bits=128/128); Wed, 18 Nov 2015 12:42:40 -0800 (PST) Content-Type: multipart/signed; boundary="Apple-Mail=_20075DF1-26C2-4F21-A719-E2464EA826FF"; protocol="application/pgp-signature"; micalg=pgp-sha256 Mime-Version: 1.0 (Mac OS X Mail 8.2 \(2104\)) Subject: Re: Sporadic errors in JobRunner X-Pgp-Agent: GPGMail 2.5.1 From: Rick Mangi In-Reply-To: <3D3FB3B9-020B-4AA7-99B4-C632FF2CC40E@chartbeat.com> Date: Wed, 18 Nov 2015 15:42:39 -0500 Message-Id: <49500F32-6668-4DC6-B9FC-7213795ADDF7@chartbeat.com> References: <24AD0C26-0448-4B13-8486-CBCFBA442AE2@chartbeat.com> <3D3FB3B9-020B-4AA7-99B4-C632FF2CC40E@chartbeat.com> To: dev@samza.apache.org X-Mailer: Apple Mail (2.2104) --Apple-Mail=_20075DF1-26C2-4F21-A719-E2464EA826FF Content-Transfer-Encoding: quoted-printable Content-Type: text/plain; charset=utf-8 I take that back, it happened again. Will try your patch. > On Nov 18, 2015, at 3:36 PM, Rick Mangi wrote: >=20 > I seem to have solved it by only specifying a single zookeeper node in = my job config. Maybe a race condition of some sort? >=20 >=20 >> On Nov 18, 2015, at 2:37 PM, Yi Pan wrote: >>=20 >> Hi, Rick, >>=20 >> I think that you are running into SAMZA-754. I have a RB available = for it >> already. I will upload the patch and it would be good if you can try = the >> patch to see whether that solves your problem. >>=20 >> -Yi >>=20 >> On Tue, Nov 17, 2015 at 12:01 PM, Rick Mangi = wrote: >>=20 >>> Hi, getting things working on samza 0.10.0 finally :) >>>=20 >>> I=E2=80=99m seeing the following error about 1/4 of the time from = run-job.sh when >>> starting jobs: >>>=20 >>> [yarnmaster01] out: 2015-11-17 14:56:00 KafkaSystemAdmin$ [INFO] Got >>> metadata: Map(__samza_coordinator_t-key-grouper_dev -> = SystemStreamMetadata >>> [streamName=3D__samza_coordinator_t-key-grouper_dev, >>> partitionMetadata=3D{Partition = [partition=3D0]=3DSystemStreamPartitionMetadata >>> [oldestOffset=3Dnull, newestOffset=3Dnull, upcomingOffset=3D0]}]) >>> [yarnmaster01] out: Exception in thread "main" >>> java.lang.NullPointerException >>> [yarnmaster01] out: at >>> = java.util.concurrent.ConcurrentHashMap.put(ConcurrentHashMap.java:1124) >>> [yarnmaster01] out: at >>> = scala.collection.convert.Wrappers$JMapWrapperLike$class.update(Wrappers.sc= ala:257) >>> [yarnmaster01] out: at >>> = scala.collection.convert.Wrappers$JConcurrentMapWrapper.update(Wrappers.sc= ala:348) >>> [yarnmaster01] out: at >>> = scala.collection.mutable.MapLike$class.getOrElseUpdate(MapLike.scala:189) >>> [yarnmaster01] out: at >>> scala.collection.mutable.AbstractMap.getOrElseUpdate(Map.scala:91) >>> [yarnmaster01] out: at >>> = org.apache.samza.system.kafka.KafkaSystemConsumer.register(KafkaSystemCons= umer.scala:108) >>> [yarnmaster01] out: at >>> = org.apache.samza.coordinator.stream.CoordinatorStreamSystemConsumer.regist= er(CoordinatorStreamSystemConsumer.java:112) >>> [yarnmaster01] out: at >>> org.apache.samza.job.JobRunner.run(JobRunner.scala:88) >>> [yarnmaster01] out: at >>> org.apache.samza.job.JobRunner$.main(JobRunner.scala:43) >>> [yarnmaster01] out: at >>> org.apache.samza.job.JobRunner.main(JobRunner.scala) >>> [yarnmaster01] out: >>>=20 >>>=20 >>> The same job will startup fine a minute later. >>>=20 >=20 --Apple-Mail=_20075DF1-26C2-4F21-A719-E2464EA826FF Content-Transfer-Encoding: 7bit Content-Disposition: attachment; filename=signature.asc Content-Type: application/pgp-signature; name=signature.asc Content-Description: Message signed with OpenPGP using GPGMail -----BEGIN PGP SIGNATURE----- iQEcBAEBCAAGBQJWTOK/AAoJENgofZAtxzWQZoIH/jxauqCXnCD1mfmgXvnoPjyI q6qoLV6RGnHFavA4mcGzTcpXNrDv2M/RmXhiUseijQWI8G1CO6/zZjOzA6sVC6OY hPtEjSGd0EkLtk63waD3ZWMe62txuF4NUraLCpZLRWi53THE11YchmuzEN7WidS5 5ds9xcZ/Z0Tj4sV6yiIGVAA41oRn4AcIvHL82iw0QlVmVOQmDyWBLr5s1LN0QO4h EpIHAMOm1DsqCK2jX3TY/VvLbR4TxgnVWdK4qd8PgINxZTT4tjXh+AQG9hgVRVk/ 6Cac8vltN5wZlEJVabjjUq5Fjw668heEQ1HLyCsrLeZEdAH+NtQiUohgyfTrQqY= =XEiZ -----END PGP SIGNATURE----- --Apple-Mail=_20075DF1-26C2-4F21-A719-E2464EA826FF--