Return-Path: X-Original-To: apmail-incubator-kafka-dev-archive@minotaur.apache.org Delivered-To: apmail-incubator-kafka-dev-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 898BFD292 for ; Wed, 1 Aug 2012 16:08:14 +0000 (UTC) Received: (qmail 61576 invoked by uid 500); 1 Aug 2012 16:08:14 -0000 Delivered-To: apmail-incubator-kafka-dev-archive@incubator.apache.org Received: (qmail 61545 invoked by uid 500); 1 Aug 2012 16:08:14 -0000 Mailing-List: contact kafka-dev-help@incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: kafka-dev@incubator.apache.org Delivered-To: mailing list kafka-dev@incubator.apache.org Received: (qmail 61537 invoked by uid 99); 1 Aug 2012 16:08:14 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 01 Aug 2012 16:08:14 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of neha.narkhede@gmail.com designates 209.85.213.47 as permitted sender) Received: from [209.85.213.47] (HELO mail-yw0-f47.google.com) (209.85.213.47) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 01 Aug 2012 16:08:09 +0000 Received: by yhjj56 with SMTP id j56so6764236yhj.6 for ; Wed, 01 Aug 2012 09:07:48 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type:content-transfer-encoding; bh=yrJ/2JMzg0kt90s4ip5GtFMrmYjnxBtLHwrzRd+E5yg=; b=m33Fmn6zODdAAtBWQbFoFeRbNMUVW2aUpafur6VjE50B2M9GVHXoMikKCRzZTzXCI7 WErOlsz2S0djq+JwAyEk32XHYvCBP1OTgw74yie4GmyITCTi1gSecloqd+WRDIwAbpbn giI4icjE5reL7EUO67NotDPv+WYyc9mPKmIyT4okZc+4ZrY/yEj2/6zBakEclV6nsh96 6XpWQtwxQfoK65wXNTeGmuvZJdZhDN/BvoYyON6Xn2Q1hIDvKZkXnR+uUHI2m3V8VPqa cUsBNBgsEmb1ZKlr7HQ6mPKgEKvhPTq8uyY1h/+vPQYkiVvcBD6VBeTi4fHDzGLgt6Vr fdKg== MIME-Version: 1.0 Received: by 10.50.213.39 with SMTP id np7mr5914229igc.51.1343837268344; Wed, 01 Aug 2012 09:07:48 -0700 (PDT) Received: by 10.64.18.175 with HTTP; Wed, 1 Aug 2012 09:07:48 -0700 (PDT) In-Reply-To: <611B2319-62A3-4512-A045-6CBF39B2371E@gmail.com> References: <133DF7E2-A2EA-4975-81D7-2199968849CA@gmail.com> <611B2319-62A3-4512-A045-6CBF39B2371E@gmail.com> Date: Wed, 1 Aug 2012 09:07:48 -0700 Message-ID: Subject: Re: Log4jAppender backoff if server is down From: Neha Narkhede To: kafka-dev@incubator.apache.org Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable X-Virus-Checked: Checked by ClamAV on apache.org David, >From the logs, seems like the box you ran a Kafka server on, ran out of disk space. That's why the Kafka server starts up but shuts itself down immediately. Please can you try starting Kafka on a box with enough disk space ? Thanks, Neha On Tue, Jul 31, 2012 at 5:08 PM, David Arthur wrote: > Here are some log snippets: > > Kafka server logs: https://gist.github.com/c440ada8daa629e337e2 > Solr logs: https://gist.github.com/42624c901fc7967fd137 > > In this case, I am sending all the "org.apache.solr" logs to Kafka, so ea= ch document update in Solr produces a log message. Each update to Solr prod= uced an exception like this which caused things to slow way down. > > My earlier statement about Kafka being up but unable to write logs was in= correct. During this, it seems Kafka was simply down (our supervisor that r= estarts Kafka gave up after a few tries). So in the case that Kafka is down= , what should the client behavior be like? > > Ideally, to me, the client could know about what brokers are available th= rough ZK watches and just refuse to attempt a send/produce if no one is ava= ilable. > > What do you guys think? I'm not saying this is necessarily a Kafka issue,= I'm just not sure what's the best thing to do here. > > Cheers > -David > > On Jul 31, 2012, at 5:48 PM, Neha Narkhede wrote: > >> David, >> >> Would you mind sending around the error stack traces ? That will help >> determine the right fix. >> >> Thanks, >> Neha >> >> On Tue, Jul 31, 2012 at 1:09 PM, David Arthur wrote: >>> Greetings all, >>> >>> I'm using the KafkaLog4jAppender with Solr and ran into an interesting = issue recently. The disk filled up on my Kafka broker (just a single broker= , this is a dev environment) and Solr slowed down to a near halt. My best e= stimation is that each log4j log message created was incurring quite a bit = of overhead dealing with exceptions coming back from the Kafka broker. >>> >>> So I'm wondering, would it make sense to implement some back off strate= gy for this client if it starts getting exceptions from the server? Alterna= tively, could maybe the Kafka broker mark it self as "down" in ZooKeeper if= it gets into certain situations (like disk full). I guess this really coul= d apply to any client, not just the log4j appender. >>> >>> Thanks! >>> -David >