Return-Path: X-Original-To: apmail-hadoop-user-archive@minotaur.apache.org Delivered-To: apmail-hadoop-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 6BDBE10309 for ; Sat, 23 Nov 2013 03:06:33 +0000 (UTC) Received: (qmail 72943 invoked by uid 500); 23 Nov 2013 03:06:21 -0000 Delivered-To: apmail-hadoop-user-archive@hadoop.apache.org Received: (qmail 72863 invoked by uid 500); 23 Nov 2013 03:06:19 -0000 Mailing-List: contact user-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hadoop.apache.org Delivered-To: mailing list user@hadoop.apache.org Received: (qmail 72844 invoked by uid 99); 23 Nov 2013 03:06:12 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 23 Nov 2013 03:06:12 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_IMAGE_ONLY_32,HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of unmeshabiju@gmail.com designates 209.85.212.54 as permitted sender) Received: from [209.85.212.54] (HELO mail-vb0-f54.google.com) (209.85.212.54) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 23 Nov 2013 03:06:05 +0000 Received: by mail-vb0-f54.google.com with SMTP id p6so1412783vbe.41 for ; Fri, 22 Nov 2013 19:05:45 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=JC+tRE3iOnxTFjf7ENWmZAxQHI5bIlTPUt+fNzciuuQ=; b=WeOm5uDN2Rnoy1WvqipDdNzvXPBXf+mep9bDu8HsfeWiyejJC270gxw+BuQLGobYKj zziv3e9peFSlUHAacre2eM1AnQLca6T2u2WoIs3IrDNSgn7d+IgppbqD7r13+IT3fyEu uJjltQh6Yx6rxb8nQ6VPrsyKzyv9/u8IUyhXqGhz22AhPEwwxgyPWaq5nUH4vXDsql5W SzKXaAk4yt28YB1Yg4fPWuBJlqfwxHTXeUC69PuuxiBfmD3PBFq3Jo7ItWrC/wPhfMd8 g/2pOqkd1C6TT55eEOCEo9EQAVKLLBTjfrIS1Jj1gfo+YK59kjtYWGzXcDMbfYYf4qr9 WMpQ== MIME-Version: 1.0 X-Received: by 10.58.255.71 with SMTP id ao7mr14545109ved.16.1385175945142; Fri, 22 Nov 2013 19:05:45 -0800 (PST) Received: by 10.59.8.2 with HTTP; Fri, 22 Nov 2013 19:05:45 -0800 (PST) In-Reply-To: References: Date: Sat, 23 Nov 2013 08:35:45 +0530 Message-ID: Subject: Re: Difference between clustering and classification in hadoop From: unmesha sreeveni To: User Hadoop Content-Type: multipart/alternative; boundary=047d7bf0d1b429688104ebcf684d X-Virus-Checked: Checked by ClamAV on apache.org --047d7bf0d1b429688104ebcf684d Content-Type: text/plain; charset=ISO-8859-1 Thanks Devin :) That was a nice explanation. On Fri, Nov 22, 2013 at 6:20 PM, Devin Suiter RDX wrote: > They are both for machine learning. Classification is known as "supervised > learning" where you feed the engine data of known patterns and instruct it > what are the key nodes. Clustering is "unsupervised learning" where you > allow the algorithm to "guess" at what is significant in the correlations > picked up by the algorithm. Spam filtering is a popular example of > classification, and image indexing is a popular example of clustering. It > is mainly used on Hadoop because when it comes to machine learning, the > more data that passes through the algorithm the more accurate it should be, > and Hadoop can handle large data better than anything else around at the > moment. > > *Devin Suiter* > Jr. Data Solutions Software Engineer > 100 Sandusky Street | 2nd Floor | Pittsburgh, PA 15212 > Google Voice: 412-256-8556 | www.rdx.com > > > On Fri, Nov 22, 2013 at 2:54 AM, unmesha sreeveni wrote: > >> what is the differences b/w classification algorithms and clustering >> algorithms in hadoop? >> >> >> -- >> *Thanks & Regards* >> >> Unmesha Sreeveni U.B >> >> *Junior Developer* >> >> >> > -- *Thanks & Regards* Unmesha Sreeveni U.B *Junior Developer* --047d7bf0d1b429688104ebcf684d Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable
Thanks Devin :) That was a nice explanation.


On Fri, Nov 22, 2013= at 6:20 PM, Devin Suiter RDX <dsuiter@rdx.com> wrote:
They are both for machine l= earning. Classification is known as "supervised learning" where y= ou feed the engine data of known patterns and instruct it what are the key = nodes. Clustering is "unsupervised learning" where you allow the = algorithm to "guess" at what is significant in the correlations p= icked up by the algorithm. Spam filtering is a popular example of classific= ation, and image indexing is a popular example of clustering. It is mainly = used on Hadoop because when it comes to machine learning, the more data tha= t passes through the algorithm the more accurate it should be, and Hadoop c= an handle large data better than anything else around at the moment.

Devin= Suiter
Jr. Data Solutions Software Engineer
100 Sandusky Street | 2nd Floor | Pittsburgh, PA 15212
Google Voice: 412-256-8556 |=A0www.rdx.com


On Fri, Nov 22, 2013 at 2:54 AM, unmesha= sreeveni <unmeshabiju@gmail.com> wrote:
what is the differences b/w classification algorithms and clust= ering algorithms in hadoop?


--
Thanks & Regards

Unmesha Sreeveni U.B
Junior Developer





--
Thanks & Regards

Unmesha Sreeveni U.B
Junior Developer

--047d7bf0d1b429688104ebcf684d--