Return-Path: X-Original-To: apmail-flume-user-archive@www.apache.org Delivered-To: apmail-flume-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 005DBE457 for ; Tue, 12 Mar 2013 21:12:54 +0000 (UTC) Received: (qmail 47065 invoked by uid 500); 12 Mar 2013 21:12:53 -0000 Delivered-To: apmail-flume-user-archive@flume.apache.org Received: (qmail 47014 invoked by uid 500); 12 Mar 2013 21:12:53 -0000 Mailing-List: contact user-help@flume.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@flume.apache.org Delivered-To: mailing list user@flume.apache.org Received: (qmail 47004 invoked by uid 99); 12 Mar 2013 21:12:53 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 12 Mar 2013 21:12:53 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of roshan@hortonworks.com designates 209.85.214.169 as permitted sender) Received: from [209.85.214.169] (HELO mail-ob0-f169.google.com) (209.85.214.169) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 12 Mar 2013 21:12:47 +0000 Received: by mail-ob0-f169.google.com with SMTP id ta14so335982obb.14 for ; Tue, 12 Mar 2013 14:12:26 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20120113; h=mime-version:x-received:in-reply-to:references:date:message-id :subject:from:to:content-type:x-gm-message-state; bh=RPbPaJna68AYYkwjYv5iHi6Js/TG6YcOrOdj78hnnss=; b=gr7MA/E2AXtXfl+c7ASbOtktAr7ibYjBE4RxPPt64jRGcayatgqn7mkCXOyQ9dFbAB ncKlnEZxVKNvKbQeZJu37/shbz4ytmQ5iSPINgpi70QAmEL3Dl5MKgdDNGqz2P2vEuZY u2CkXexw5IeScTPtZsO5zkk7mYE2lyypQT1xC9mH8nQIaee66mNlak1k2MqhsIDvimdh 1y9wwwrlH7VCL8rP2mkaH2OHvFA9ZoMEm318dYLXJb9pe/kkjzBUKFzQJaFVVC26u8Sf TVitHXbDuFWb6rmQ6iioO79bpQ5MXhnp+tUAAOCG1IAdeiNvOaxU9XQtI5AfYp3705GT u9VA== MIME-Version: 1.0 X-Received: by 10.182.146.42 with SMTP id sz10mr13475413obb.83.1363122746823; Tue, 12 Mar 2013 14:12:26 -0700 (PDT) Received: by 10.182.11.102 with HTTP; Tue, 12 Mar 2013 14:12:26 -0700 (PDT) In-Reply-To: References: Date: Tue, 12 Mar 2013 14:12:26 -0700 Message-ID: Subject: Re: Best way to increase throughput of Exec->Memory->Avro agent. From: Roshan Naik To: user@flume.apache.org Content-Type: text/plain; charset=ISO-8859-1 X-Gm-Message-State: ALoCoQnmPyeqArO/d6g5nLWPmeoCV2dfuoWIByUxW5QUKcqKWOvC+XBIwEd/PM6QV0jgLyCkc1tR X-Virus-Checked: Checked by ClamAV on apache.org i meant 640,000 not 64,000 On Tue, Mar 12, 2013 at 2:10 PM, Roshan Naik wrote: > beyond a certain # of sinks it wont help adding more. my suspicion is > you may have gone way overboard. > > if your sink-side batch size is that large and you have 64 sinks in > the round-robin.. it will take a lot of events (64,000) to be pumped > in by the source order before the first event can start trickling out > of any sink. Also memory consumption will be quite high.. each sink > will open a transaction and hold on to 10000 events. This the cause > for the Memory channel filling up. Until the sink side transaction is > committed (i.e 10k events are pulled), the memory reservation on the > channel is not relinquished. So your memory channel size will have to > really high to support so manch sinks each with such a big batch size. > > My gut feel is that your source-side batch size is not much of an > issue and can be smaller. Increasing the number of sinks will only > help if the sink is indeed the bott > > On Tue, Mar 12, 2013 at 1:43 PM, Chris Neal wrote: >> Hi all. >> >> I've been working on this for quite some time, and need some advice from the >> experts. I have a two tiered Flume architecture: >> >> App Tier (all on one server): >> 124 ExecSources -> MemoryChannel -> AvroSinks >> >> HDFS Tier (on two servers): >> AvroSource -> FileChannel -> HDFSSinks >> >> When I run the agents, the HDFS tier is keeping up fine with the App Tier. >> queue sizes stay between 0-10000 (I have a batch size of 10000). All is >> good. >> >> On the App Tier, when I view the JMX data through jconsole, I watch the size >> of the MemoryChannel grow steadily until it reaches the max, then it starts >> throwing exceptions about not being able to put the batch on the channel as >> expected. >> >> There seems to be two basic ways to increase the throughput of the App Tier: >> 1. Increase the MemoryChannel's transactionCapacity and the corresponding >> AvroSink's batch-size. Both are set to 10000 for me. >> 2. Increase the number of AvroSinks to drain the MemoryChannel. I'm up to >> 64 Sinks now which round-robin between the two Flume Agents on the HDFS >> tier. >> >> Both of those values seem quite high to me (batch size and number of sinks). >> >> Am I missing something as far as tuning? >> Which would allow for greater increase to throughput, more Sinks or larger >> batch size? >> >> I'm stumped here. I still think I can get this to work. :) >> >> Any suggestions are most welcome. >> Thanks for your time. >> Chris >>