Return-Path: X-Original-To: apmail-storm-user-archive@minotaur.apache.org Delivered-To: apmail-storm-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id E7B9611807 for ; Tue, 22 Jul 2014 07:41:46 +0000 (UTC) Received: (qmail 52578 invoked by uid 500); 22 Jul 2014 07:41:46 -0000 Delivered-To: apmail-storm-user-archive@storm.apache.org Received: (qmail 52541 invoked by uid 500); 22 Jul 2014 07:41:46 -0000 Mailing-List: contact user-help@storm.incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@storm.incubator.apache.org Delivered-To: mailing list user@storm.incubator.apache.org Received: (qmail 52531 invoked by uid 99); 22 Jul 2014 07:41:46 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 22 Jul 2014 07:41:46 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of saisaigraph@gmail.com designates 209.85.217.181 as permitted sender) Received: from [209.85.217.181] (HELO mail-lb0-f181.google.com) (209.85.217.181) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 22 Jul 2014 07:41:40 +0000 Received: by mail-lb0-f181.google.com with SMTP id 10so3497602lbg.40 for ; Tue, 22 Jul 2014 00:41:19 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:date:message-id:subject:from:to:content-type; bh=E5h6UE9bh3pWj8DgJ5FhRH0rI995wAo0AYLSPul/M/M=; b=vEHOOhjPNvTvBMG4iX2wR9ouBp8HijVGn3LEZ7S3m1kkqWO+6YfrcUxx/7x52NKyE6 Ely239iXH+Qhu2rEx3/GBOIQWHRwpn8Sj7vTGxiLnbaEPF09dqV6diUPYItZkbkKCi90 VLUJwuZ0x2+gSiT61to1O4NbZx25+NF2BXa2onnJQIeXLNXiZaFB+K+mABYWMPncoNLN B8wkYu8ogJWqFsvDmFCwcInLV5LDH2g3tsdEg7U08IMlbigPhJx0IIDjca/qQPRXLljJ AImUdEEHf91j3YqhbO/Mj1ZrvCzDl9jVyvpkaZQrkNZXfpLrBvu3n6Fgwll2x5+33QJ5 lWKw== MIME-Version: 1.0 X-Received: by 10.153.7.74 with SMTP id da10mr31699495lad.27.1406014879349; Tue, 22 Jul 2014 00:41:19 -0700 (PDT) Received: by 10.112.15.107 with HTTP; Tue, 22 Jul 2014 00:41:19 -0700 (PDT) Date: Tue, 22 Jul 2014 03:41:19 -0400 Message-ID: Subject: Storm Nub questions. From: Sai SaiGraph To: user@storm.incubator.apache.org Content-Type: multipart/alternative; boundary=001a11346d5c6ebf7c04fec359be X-Virus-Checked: Checked by ClamAV on apache.org --001a11346d5c6ebf7c04fec359be Content-Type: text/plain; charset=UTF-8 Hi I m trying to understand parellism in Storm a few questions about it: 1. Lets say i have only a single file i am trying to test with a single line L1 Now if i use 2 spouts, will there be 2 spouts created or is it only 1 spout which will b created as there is only a line. 2. Is parallelism hint mean that it is not for sure that multiple objects are created and it is upto storm to create them based on the size of the data. 3. In a single node this will create 1 spout that is 1 object in 1 thread: builder.setBolt(SPLIT_BOLT_ID, splitBolt, 1) .shuffleGrouping(SENTENCE_SPOUT_ID); and In a single node this will create 2 spouts that is 2 objects in 1 thread: builder.setBolt(SPLIT_BOLT_ID, splitBolt, 2) .shuffleGrouping(SENTENCE_SPOUT_ID); My questions is, is this really parallelism. Is it true to say this is parallelism at a thread level in a single node will this really help achieve the parallelism. The reason i ask is because in hadoop we achieve parallelisn at a process level which is easy to digest that we are achieving parallelism, but in storm i am confused. Can some one pls pour some light. Thanks Sai --001a11346d5c6ebf7c04fec359be Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable
Hi
I m trying to understand parellism in Stor= m a few questions about it:

1. Lets say i have only a single file i = am trying to test
with a single line L1
Now if i use 2 spouts, will t= here be 2 spouts created
or is it only 1 spout which will b created as there is only a line.

= 2. Is parallelism hint mean that it is not for sure
that multiple object= s are created and it is upto
storm to create them based on the size of t= he data.

3. In a single node this will create 1 spout that is 1 object in 1 thre= ad:
=C2=A0=C2=A0=C2=A0
builder.setBolt(SPLIT_BOLT_ID, splitBolt, 1)<= br>=C2=A0=C2=A0=C2=A0 .shuffleGrouping(SENTENCE_SPOUT_ID);
=C2=A0=C2=A0= =C2=A0=C2=A0=C2=A0=C2=A0
and In a single node this will create 2 spouts= =C2=A0 that is 2 objects in 1 thread:
=C2=A0=C2=A0=C2=A0
builder.setBolt(SPLIT_BOLT_ID, splitBolt, 2)
=C2= =A0=C2=A0=C2=A0 .shuffleGrouping(SENTENCE_SPOUT_ID);

My questions is= , is this really parallelism.
Is it true to say this is parallelism at a= thread level
in a single node will this really help achieve the paralle= lism.

The reason i ask is because in hadoop we achieve parallelisn
at a p= rocess level which is easy to digest
that we are achieving parallelism, = but in storm i am
confused. Can some one pls pour some light.
Thanks
Sai
--001a11346d5c6ebf7c04fec359be--