Return-Path: X-Original-To: apmail-hadoop-common-dev-archive@www.apache.org Delivered-To: apmail-hadoop-common-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id DF0376E50 for ; Thu, 7 Jul 2011 00:49:41 +0000 (UTC) Received: (qmail 28971 invoked by uid 500); 7 Jul 2011 00:49:40 -0000 Delivered-To: apmail-hadoop-common-dev-archive@hadoop.apache.org Received: (qmail 28867 invoked by uid 500); 7 Jul 2011 00:49:39 -0000 Mailing-List: contact common-dev-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: common-dev@hadoop.apache.org Delivered-To: mailing list common-dev@hadoop.apache.org Received: (qmail 28857 invoked by uid 99); 7 Jul 2011 00:49:39 -0000 Received: from minotaur.apache.org (HELO minotaur.apache.org) (140.211.11.9) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 07 Jul 2011 00:49:39 +0000 Received: from localhost (HELO awittena-md.linkedin.biz) (127.0.0.1) (smtp-auth username aw, mechanism plain) by minotaur.apache.org (qpsmtpd/0.29) with ESMTP; Thu, 07 Jul 2011 00:49:39 +0000 Content-Type: text/plain; charset=us-ascii Mime-Version: 1.0 (Apple Message framework v1082) Subject: Re: Hadoop Master and Slave Discovery From: Allen Wittenauer In-Reply-To: Date: Wed, 6 Jul 2011 17:49:37 -0700 Content-Transfer-Encoding: quoted-printable Message-Id: References: <31981952.post@talk.nabble.com> <4E119987.2050501@apache.org> <4E12DC2B.9040903@apache.org> To: X-Mailer: Apple Mail (2.1082) On Jul 6, 2011, at 5:05 PM, Eric Yang wrote: > Did you know that almost all linux desktop system comes with avahi > pre-installed and turn on by default? ... which is why most admins turn those services off by default. = :) > What is more interesting is > that there are thousands of those machines broadcast in large > cooperation without anyone noticing them?=20 That's because many network teams turn off multicast past the = subnet boundary and many corporate desktops are in class C subnets. = This automatically limits the host count down to 200-ish per network. = Usually just the unicast traffic is bad enough. Throwing multicast into = the mix just makes it worse. > I have recently built a > multicast dns browser and look into the number of machines running in > a large company environment. The number of desktop, laptop and > printer machines running multicast dns is far exceeding 1000 machines > in the local subnet. =46rom my understanding of Y!'s network, the few /22's they have = (which would get you 1022 potential hosts on a subnet) have multicast = traffic dropped at the router and switch levels. Additionally, DNS-SD = (the service discovery portion of mDNS) offers unicast support as well. = So there is a very good chance that the traffic you are seeing is from = unicast, not multicast. The 1000 number, BTW, comes from Apple. I'm sure they'd be = interested in your findings given their role in ZC. =20 BTW, I'd much rather hear that you set up a /22 with many many = machines running VMs trying to actually use mDNS for something useful. = A service browser really isn't that interesting. > They are all happily working fine without causing any issues. ... that you know of. Again, I'm 99% certain that Y! is = dropping multicast packets into the bit bucket at the switch boundaries. = [I remember having this conversation with them when we setup the new = data centers.] > Printer works fine, Most admins turn SLP and other broadcast services on printers = off. For large networks, one usually sees print services enabled via = AD or master print servers broadcasting the information on the local = subnet. This allows a central point of control rather than randomness. = Snow Leopard (I don't think Leopard did this) actually tells you where = the printer is coming from now, so that's handy to see if they are ZC or = AD or whatever. > itune sharing from someone > else works fine. iTunes specifically limits its reach so that it can't extend = beyond the local subnet and definitely does unicast in addition to ZC, = so that doesn't really say much of anything, other than potentially = invalidating your results. > For some reason, things tend to work better on my > side of universe. :) =20 I'm sure it does, but not for the reasons you think they do. > Allen, if you want to get stuck on stone age > tools, I won't stop you. >=20 =09 Multicast has a time and place (mainly for small, non-busy = networks). Using it without understanding the network impact is never a = good idea. FWIW, I've seen multicast traffic bring down an entire campus of = tens of thousands of machines due to routers and switches having bugs = where they didn't subtract from the packet's TTL. I'm not the only one = with these types of experiences. Anything multicast is going to have a = very large uphill battle for adoption because of these widespread = problems. Many network vendors really don't get this one right, for = some reason.=