Return-Path: X-Original-To: apmail-hadoop-user-archive@minotaur.apache.org Delivered-To: apmail-hadoop-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id C73B0D2B5 for ; Fri, 18 Jan 2013 06:52:26 +0000 (UTC) Received: (qmail 58875 invoked by uid 500); 18 Jan 2013 06:52:22 -0000 Delivered-To: apmail-hadoop-user-archive@hadoop.apache.org Received: (qmail 58048 invoked by uid 500); 18 Jan 2013 06:52:20 -0000 Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hadoop.apache.org Delivered-To: mailing list user@hadoop.apache.org Received: (qmail 58012 invoked by uid 99); 18 Jan 2013 06:52:19 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 18 Jan 2013 06:52:19 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of dontariq@gmail.com designates 209.85.220.174 as permitted sender) Received: from [209.85.220.174] (HELO mail-vc0-f174.google.com) (209.85.220.174) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 18 Jan 2013 06:52:15 +0000 Received: by mail-vc0-f174.google.com with SMTP id n11so1377011vch.19 for ; Thu, 17 Jan 2013 22:51:54 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=x-received:mime-version:in-reply-to:references:from:date:message-id :subject:to:content-type; bh=IE+0SK+5WaYK89hy+lJPY+XpK/VNjgFxXQ2kNWLarhU=; b=GE8G4PfZ3wgJQAY8oIggXuq9TP52+iQmn3f/JOu6vI2ljXksl+qdjH6DlwhRMlcmmK c8o0dP1i9xETHl2rvsqdU66LoZBOzqQAXbRfDOWCv5B7wjlblbBI9+39QdzvKtadO/dL JqBQ7/80vMaZaHHnx3k6Sd5dVRontbaNGgf2EpEAwExz5OD0e6Kp+JsWZTAuChkenzPA yCge3aJFkZ85NX3cl/lG2L5EyWO6qhL9YvoJ78M7y3kOwlpbOHkLPwaVyF2QPCvaDjBu DJFPeuJo9mGUjI0Erhiizd2QY16XaeJ6rbdCqXOW3D87cJilXC/v60HqgeyxBDlXSWiY B1AA== X-Received: by 10.220.239.71 with SMTP id kv7mr8339030vcb.53.1358491914220; Thu, 17 Jan 2013 22:51:54 -0800 (PST) MIME-Version: 1.0 Received: by 10.58.34.16 with HTTP; Thu, 17 Jan 2013 22:51:14 -0800 (PST) In-Reply-To: References: From: Mohammad Tariq Date: Fri, 18 Jan 2013 12:21:14 +0530 Message-ID: Subject: Re: Query: Hadoop's threat to Informatica To: "user@hadoop.apache.org" Content-Type: multipart/alternative; boundary=14dae9d24a1cfa359504d38a8b28 X-Virus-Checked: Checked by ClamAV on apache.org --14dae9d24a1cfa359504d38a8b28 Content-Type: text/plain; charset=windows-1252 Content-Transfer-Encoding: quoted-printable Hello Sameer, Pl find my comments embedded below : Warm Regards, Tariq https://mtariq.jux.com/ cloudfront.blogspot.com On Fri, Jan 18, 2013 at 11:21 AM, Sameer Jain wrote: > Hi, > > > > I am trying to understand the different data analysis algorithms availabl= e > in the market. Analyst opinion suggests that Informatica and Hadoop have > the best offerings in this space. > > > > However, I am not very clear as to how the two are different and how they > compete, because Hadoop is being used by IBM etc. Since you appear to be = a > fairly seasoned expert in this domain, I would like to get your perspecti= ve > on the following: > > > > I would hugely appreciate any thoughts/insights around > > =B7 The workings of Hadoop/Mapreduce > >>Hadoop is an open source platform that allows us to store and process huge, really huge, amount of data over a network of machines(need not be very sophisticated). It has 2 layers viz : HDFS & MapReduce for storage & processing respectively. > =B7 Informatica=92s product offering > >>They can tell you better. This list is specific to Hadoop ecosystem. > =B7 A comparison of which one of these is better > >>Depends upon the particular use case. One size doesn't fit all. > =B7 A view of can and/or is Hadoop in competition with Informati= ca. > >>I don't think so. Informatica is basically an ETL thing(if I am not wrong), while we leverage Hadoop's power to create ETL tools with the Help of different Hadoop sub projects. Though it is possible to use them together. > > > Regards, > > Sameer > > > > *Sameer Jain* > ------------------------------ > > Research Lead > > Evalueserve > > Office: + 91 124 4621615 > > Mob: + 91 7827256066 > > Fax: + 91 124 406 3430 > > www.evalueserve.com > > > > > > . > > ------------------------------ > > The information in this e-mail is the property of Evalueserve and is > confidential and privileged. It is intended solely for the addressee. > Access to this email by anyone else is unauthorized. If you are not the > intended recipient, any disclosure, copying, distribution or any action > taken in reliance on it is prohibited and will be unlawful. If you receiv= e > this message in error, please notify the sender immediately and delete al= l > copies of this message. > --14dae9d24a1cfa359504d38a8b28 Content-Type: text/html; charset=windows-1252 Content-Transfer-Encoding: quoted-printable
Hello Sameer,

=A0 =A0 =A0Pl find my com= ments embedded below :


On Fri, Jan 18, 2013 at 11:21 AM, Sameer= Jain <Sameer.Jain@evalueserve.com> wrote:

Hi,<= /p>

=A0<= /p>

I am trying= to understand the different data analysis algorithms available in the mark= et. Analyst opinion suggests that Informatica and Hadoop have the best offe= rings in this space.

=A0<= /p>

However, I = am not very clear as to how the two are different and how they compete, bec= ause Hadoop is being used by IBM etc. Since you appear to be a fairly seaso= ned expert in this domain, I would like to get your perspective on the following:

=A0<= /p>

I would hug= ely appreciate any thoughts/insights around

=B7=A0=A0=A0=A0= =A0=A0=A0=A0 The wor= kings of Hadoop/Mapreduce

>= ;>Hadoop is an open source platform that allows=A0
us to store and process huge, really huge, amount
of data over a = network of machines(need not be=A0
very sophisticated). It has 2 = layers viz : HDFS=A0&=A0
MapReduce for storage & processi= ng respectively.

=B7=A0=A0=A0=A0= =A0=A0=A0=A0 Informa= tica=92s product offering

>= ;>They can tell you better. This list is specific to
Hadoop ecosystem.=A0

=B7=A0=A0=A0=A0= =A0=A0=A0=A0 A compa= rison of which one of these is better

>>Depends upon the particular use case. One size
doesn't fit all.=A0

=B7=A0=A0=A0=A0= =A0=A0=A0=A0 A view = of can and/or is Hadoop in competition with Informatica.

>>I don't think so. Informatica is ba= sically an ETL thing(if I=A0
am not wrong),=A0while we leverage Hadoop's power to create=A0
ETL tools with the Help of different Hadoop sub projects.
= Though it is possible to use them together.

=A0<= /p>

Regards,

Sameer

=A0<= /p>

Sameer Jain


Research Lead

Evalueserve

Office: + 91 124 4621615

Mob: + 91 7827256066

Fax: + 91 124 406 3430

www.eva= lueserve.com

=A0<= /p>

=A0

= .




The information in this e-mail is the property of Evalueserve and is confid= ential and privileged. It is intended solely for the addressee. Access to t= his email by anyone else is unauthorized. If you are not the intended recip= ient, any disclosure, copying, distribution or any action taken in reliance on it is prohibited and will be unlawful. = If you receive this message in error, please notify the sender immediately = and delete all copies of this message.

--14dae9d24a1cfa359504d38a8b28--