Return-Path: X-Original-To: apmail-hadoop-hdfs-user-archive@minotaur.apache.org Delivered-To: apmail-hadoop-hdfs-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 7E5D99C25 for ; Sun, 19 Aug 2012 21:50:31 +0000 (UTC) Received: (qmail 87803 invoked by uid 500); 19 Aug 2012 21:50:26 -0000 Delivered-To: apmail-hadoop-hdfs-user-archive@hadoop.apache.org Received: (qmail 87571 invoked by uid 500); 19 Aug 2012 21:50:26 -0000 Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hadoop.apache.org Delivered-To: mailing list user@hadoop.apache.org Received: (qmail 87564 invoked by uid 99); 19 Aug 2012 21:50:26 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 19 Aug 2012 21:50:26 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=5.0 tests=RCVD_IN_DNSWL_NONE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of michael_segel@hotmail.com designates 65.55.111.97 as permitted sender) Received: from [65.55.111.97] (HELO blu0-omc2-s22.blu0.hotmail.com) (65.55.111.97) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 19 Aug 2012 21:50:16 +0000 Received: from BLU0-SMTP153 ([65.55.111.73]) by blu0-omc2-s22.blu0.hotmail.com with Microsoft SMTPSVC(6.0.3790.4675); Sun, 19 Aug 2012 14:49:55 -0700 X-Originating-IP: [173.15.87.38] X-EIP: [enlQFY7NiL/4QVQXrI1QmicypZQQ/qTM] X-Originating-Email: [michael_segel@hotmail.com] Message-ID: Received: from [10.1.10.10] ([173.15.87.38]) by BLU0-SMTP153.blu0.hotmail.com over TLS secured channel with Microsoft SMTPSVC(6.0.3790.4675); Sun, 19 Aug 2012 14:49:54 -0700 Content-Type: text/plain; charset="us-ascii" MIME-Version: 1.0 (Mac OS X Mail 6.0 \(1485\)) Subject: Re: Can Hadoop replace the use of MQ b/w processes? From: Michael Segel In-Reply-To: <48ECEBDF-1210-4E0C-B87D-1DC6493A6547@gmail.com> Date: Sun, 19 Aug 2012 16:49:52 -0500 Content-Transfer-Encoding: quoted-printable References: <48ECEBDF-1210-4E0C-B87D-1DC6493A6547@gmail.com> To: user@hadoop.apache.org X-Mailer: Apple Mail (2.1485) X-OriginalArrivalTime: 19 Aug 2012 21:49:54.0385 (UTC) FILETIME=[914F2C10:01CD7E54] There has been some work to replace the use of queues with HBase.=20 This would be used to feed processes off the queue to help balance out = the load on the cluster.=20 In one specific use case, this was effective because the time spent = processing each mapper.map() iteration is a couple of orders of = magnitude as the time it takes to pull the data from the 'queue' and to = each node for processing.=20 Again, YMMV, it is an interesting hack though.... On Aug 19, 2012, at 11:46 AM, Robert Nicholson = wrote: > We have an application or a series of applications that listen to = incoming feeds they then distribute this data in XML form to a number of = queues. Another set of processes listen to these queues and process the = messages. Order of processing is important in so far as related messages = need to be processed in sequence hence today all related messages go to = the same queue and are processed by the same queue consumer. >=20 > The idea would be replace the use of MQ with some kind of reliable = distributed dispatch. Does Hadoop provide that? >=20 >=20 >=20 >=20