Return-Path: X-Original-To: apmail-hadoop-common-user-archive@www.apache.org Delivered-To: apmail-hadoop-common-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 25CDA792C for ; Thu, 22 Sep 2011 04:33:34 +0000 (UTC) Received: (qmail 48434 invoked by uid 500); 22 Sep 2011 04:33:31 -0000 Delivered-To: apmail-hadoop-common-user-archive@hadoop.apache.org Received: (qmail 47864 invoked by uid 500); 22 Sep 2011 04:33:30 -0000 Mailing-List: contact common-user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: common-user@hadoop.apache.org Delivered-To: mailing list common-user@hadoop.apache.org Delivered-To: moderator for common-user@hadoop.apache.org Received: (qmail 43087 invoked by uid 99); 21 Sep 2011 18:02:30 -0000 X-ASF-Spam-Status: No, hits=-2.3 required=5.0 tests=RCVD_IN_DNSWL_MED,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: local policy) MIME-Version: 1.0 Message-ID: <01546d50-8af9-4dd8-ab21-c597242bf562@default> Date: Wed, 21 Sep 2011 11:01:59 -0700 (PDT) From: Bill Habermaas To: common-user@hadoop.apache.org Subject: RE: risks of using Hadoop References: > <> <> <> <> < In-Reply-To: X-Priority: 3 X-Mailer: Oracle Beehive Extensions for Outlook 2.0.1.4.1.0 (410211) [OL 12.0.6562.5003] Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: quoted-printable X-Source-IP: rtcsinet22.oracle.com [66.248.204.30] X-CT-RefId: str=0001.0A090204.4E7A269A.009F,ss=1,re=0.000,fgs=0 Amen to that. I haven't heard a good rant in a long time, I am definitely a= mused end entertained.=20 As a veteran of 3 years with Hadoop I will say that the SPOF issue is whate= ver you want to make it. But it has not, nor will it ever defer me from usi= ng this great system. Every system has its risks and they can be minimized = by careful architectural crafting and intelligent usage.=20 Bill -----Original Message----- From: Michael Segel [mailto:michael_segel@hotmail.com]=20 Sent: Wednesday, September 21, 2011 1:48 PM To: common-user@hadoop.apache.org Subject: RE: risks of using Hadoop Kobina The points 1 and 2 are definitely real risks. SPOF is not. As I pointed out in my mini-rant to Tom was that your end users / developer= s who use the cluster can do more harm to your cluster than a SPOF machine = failure. I don't know what one would consider a 'long learning curve'. With the adop= tion of any new technology, you're talking at least 3-6 months based on the= individual and the overall complexity of the environment.=20 Take anyone who is a strong developer, put them through Cloudera's training= , plus some play time, and you've shortened the learning curve. The better the java developer, the easier it is for them to pick up Hadoop. I would also suggest taking the approach of hiring a senior person who can = cross train and mentor your staff. This too will shorten the runway. HTH -Mike > Date: Wed, 21 Sep 2011 17:02:45 +0100 > Subject: Re: risks of using Hadoop > From: kobina.kwarko@gmail.com > To: common-user@hadoop.apache.org >=20 > Jignesh, >=20 > Will your point 2 still be valid if we hire very experienced Java > programmers? >=20 > Kobina. >=20 > On 20 September 2011 21:07, Jignesh Patel wrote: >=20 > > > > @Kobina > > 1. Lack of skill set > > 2. Longer learning curve > > 3. Single point of failure > > > > > > @Uma > > I am curious to know about .20.2 is that stable? Is it same as the one = you > > mention in your email(Federation changes), If I need scaled nameNode an= d > > append support, which version I should choose. > > > > Regarding Single point of failure, I believe Hortonworks(a.k.a Yahoo) i= s > > updating the Hadoop API. When that will be integrated with Hadoop. > > > > If I need > > > > > > -Jignesh > > > > On Sep 17, 2011, at 12:08 AM, Uma Maheswara Rao G 72686 wrote: > > > > > Hi Kobina, > > > > > > Some experiences which may helpful for you with respective to DFS. > > > > > > 1. Selecting the correct version. > > > I will recommend to use 0.20X version. This is pretty stable versi= on > > and all other organizations prefers it. Well tested as well. > > > Dont go for 21 version.This version is not a stable version.This is r= isk. > > > > > > 2. You should perform thorough test with your customer operations. > > > (of-course you will do this :-)) > > > > > > 3. 0.20x version has the problem of SPOF. > > > If NameNode goes down you will loose the data.One way of recovering= is > > by using the secondaryNameNode.You can recover the data till last > > checkpoint.But here manual intervention is required. > > > In latest trunk SPOF will be addressed bu HDFS-1623. > > > > > > 4. 0.20x NameNodes can not scale. Federation changes included in late= st > > versions. ( i think in 22). this may not be the problem for your cluste= r. > > But please consider this aspect as well. > > > > > > 5. Please select the hadoop version depending on your security > > requirements. There are versions available for security as well in 0.20= X. > > > > > > 6. If you plan to use Hbase, it requires append support. 20Append has= the > > support for append. 0.20.205 release also will have append support but = not > > yet released. Choose your correct version to avoid sudden surprises. > > > > > > > > > > > > Regards, > > > Uma > > > ----- Original Message ----- > > > From: Kobina Kwarko > > > Date: Saturday, September 17, 2011 3:42 am > > > Subject: Re: risks of using Hadoop > > > To: common-user@hadoop.apache.org > > > > > >> We are planning to use Hadoop in my organisation for quality of > > >> servicesanalysis out of CDR records from mobile operators. We are > > >> thinking of having > > >> a small cluster of may be 10 - 15 nodes and I'm preparing the > > >> proposal. my > > >> office requires that i provide some risk analysis in the proposal. > > >> > > >> thank you. > > >> > > >> On 16 September 2011 20:34, Uma Maheswara Rao G 72686 > > >> wrote: > > >> > > >>> Hello, > > >>> > > >>> First of all where you are planning to use Hadoop? > > >>> > > >>> Regards, > > >>> Uma > > >>> ----- Original Message ----- > > >>> From: Kobina Kwarko > > >>> Date: Saturday, September 17, 2011 0:41 am > > >>> Subject: risks of using Hadoop > > >>> To: common-user > > >>> > > >>>> Hello, > > >>>> > > >>>> Please can someone point some of the risks we may incur if we > > >>>> decide to > > >>>> implement Hadoop? > > >>>> > > >>>> BR, > > >>>> > > >>>> Isaac. > > >>>> > > >>> > > >> > > > > =09=09 =09 =09=09