Return-Path: X-Original-To: apmail-flume-user-archive@www.apache.org Delivered-To: apmail-flume-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 1CF6211208 for ; Thu, 25 Sep 2014 09:40:49 +0000 (UTC) Received: (qmail 14167 invoked by uid 500); 25 Sep 2014 09:40:48 -0000 Delivered-To: apmail-flume-user-archive@flume.apache.org Received: (qmail 14119 invoked by uid 500); 25 Sep 2014 09:40:48 -0000 Mailing-List: contact user-help@flume.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@flume.apache.org Delivered-To: mailing list user@flume.apache.org Received: (qmail 14109 invoked by uid 99); 25 Sep 2014 09:40:48 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 25 Sep 2014 09:40:48 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of asim.zafir@gmail.com designates 74.125.82.169 as permitted sender) Received: from [74.125.82.169] (HELO mail-we0-f169.google.com) (74.125.82.169) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 25 Sep 2014 09:40:43 +0000 Received: by mail-we0-f169.google.com with SMTP id k48so7573606wev.28 for ; Thu, 25 Sep 2014 02:40:22 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=YXJx/vPm/1jL+qEnd5auZVqPZ4T0RsnBYCog4zPPHII=; b=TS9E2iuFPcrrT4Fmo2g7E8g+ybzGp7HltWsQBLY+OdsDNKG9AhHcj7iJEW/cRrmwcP 1bCBCob8cGgQbqcEWsX9wiHxoJDCRn+0I68g2Xg55RWZzBQ9Cnpe3ZuUjvg1pOPBgEaX TAmKpKqrBq5iiP9UFeT+UW1VFWesL1VdwpK+aOgFWx5KYLVVdJtt2UaQUBebbQ465Mo3 OcON+sa6/TPRQZ/KWCYz33W6ncZO4ECuzsC1elWHCsr2pX00goncakKyFBYGLbakX8xu q+CZZ7M3hC6Kc94Pz7ogAj1LU28fLLGCLRW2kk29wtAuvrmdsMHQQxJ5PyUj6VC0cq7O 3hwA== MIME-Version: 1.0 X-Received: by 10.194.188.110 with SMTP id fz14mr14778957wjc.70.1411638022005; Thu, 25 Sep 2014 02:40:22 -0700 (PDT) Received: by 10.194.38.103 with HTTP; Thu, 25 Sep 2014 02:40:21 -0700 (PDT) In-Reply-To: References: Date: Thu, 25 Sep 2014 02:40:21 -0700 Message-ID: Subject: Re: Performance of Flume in production systems From: Asim Zafir To: user@flume.apache.org Content-Type: multipart/alternative; boundary=047d7bb03cb8da6b460503e09699 X-Virus-Checked: Checked by ClamAV on apache.org --047d7bb03cb8da6b460503e09699 Content-Type: text/plain; charset=UTF-8 It really depends but couple of questions before a proper suggestion can be made. : What kind of agent are you using in your pipeline sinking to HDFS? Does your pipeline involves a collector? What kind of channel you are using accross the data pipeline? How frequently do you want to roll the flume events? It will be helpful to see your data pipeline architecture before making a suggestion? Asim Zafir On Wed, Sep 24, 2014 at 10:53 PM, Blade Liu wrote: > Hi, > > I'm going to deploy Flume in production systems, but a little worried > about its performance in real-world environment. Could anyone tell me about > Flume's actual performance in production environment? say, if Flume can > deal with 20,000 events per second from a single source(and what about > 100-200 sources with one final HDFS sink). > > In addition, to reach good performance of tens of thousands of events per > second, how many servers(agents) should be used? More agents(and more > tiers), better performance? > > Thanks very much for your suggestions. > > > Cheers, > Blade > --047d7bb03cb8da6b460503e09699 Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable
It really depends but couple of questions before a pr= oper suggestion can be made. :

What kind of agent are y= ou using in your pipeline sinking to HDFS?
Does your pipeline involves = a collector? =C2=A0
What kind of channel you are using accross th= e data pipeline?
How frequently do you want to roll the flume eve= nts?
It will be helpful to see your data pipeline architecture be= fore making a suggestion?

Asim Zafir

On Wed, Sep 24, 2014 at 10:53 = PM, Blade Liu <hafzcdcn@gmail.com> wrote:
Hi,

I'm= going to deploy Flume in production systems, but a little worried about it= s performance in real-world environment. Could anyone tell me about Flume&#= 39;s actual performance in production environment? say, if Flume can deal w= ith 20,000 events per second from a single source(and what about 100-200 so= urces with one final HDFS sink).

In addition, to r= each good performance of tens of thousands of events per second, how many s= ervers(agents) should be used?=C2=A0 More agents(and more tiers), better pe= rformance?

Thanks very much for your suggestions.<= br>


Cheers,
Blade

--047d7bb03cb8da6b460503e09699--