mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Chameera Wijebandara <chameerawijeband...@gmail.com>
Subject Re: Classify Handwritten Digits
Date Tue, 28 Jan 2014 09:37:02 GMT
Thanks Ted.

Thanks,
    Chameera


On Fri, Jan 24, 2014 at 11:12 PM, Ted Dunning <ted.dunning@gmail.com> wrote:

> You can also put out lots of clusters and use cluster membership as the
> features for a classifier.
>
> There was a discussion here (or possibly on the dev@mahout list) on this
> topic several weeks ago.  Search the archives for "iris" and my name.
>
>
>
> On Fri, Jan 24, 2014 at 8:46 AM, Angus Macnab <angus.macnab@gmail.com
> >wrote:
>
> > You can do supervised learning by outputing the clusters and labeling
> them
> > 0-9.
> >
> > > On Jan 23, 2014, at 10:34 PM, Tharindu Rusira <
> tharindurusira@gmail.com>
> > wrote:
> > >
> > > On Fri, Jan 24, 2014 at 9:50 AM, Angus Macnab <angus.macnab@gmail.com
> > >wrote:
> > >
> > >> This is a pretty classic machine learning problem and can be handled
> > with
> > >> several different algorithms.  Logistic regression is the obvious
> > choice,
> > >> but clustering algorithms will work fine also.  Just decompose the
> > pixels
> > >> into a really long vector and train your algorithm with the
> input-output
> > >> pairs.  You can get 100% accuracy on this pretty easily if you are
> > careful
> > >> with your bias-variance decomposition.  This is a fun one for neural
> > >> networks too!
> > >>
> > >> Essentially any machine learning book will delve into greater detail
> on
> > >> this as the US postal digit data has been around for a long time.  I
> > think
> > >> Kaggle even had this as a training exercise for a while, so there's
> > >> probably a ton of discussion of various methods and algorithms on
> their
> > >> message boards.
> > >>
> > >> For kicks why don't you compare k-means clustering to logistic
> > regression
> > >> using Mahout?
> > > Hi Angus, Chameera's requirement is to classify handwritten digits, so
> > > could you please explain how could K-means clustering be helpful in
> this
> > > scenario? Of course it would find different clusters but this is still
> a
> > > classification problem. Please correct me if I'm wrong.
> > >
> > > Thanks,
> > >
> > >
> > >>
> > >> -Angus
> > >>
> > >>
> > >>
> > >>
> > >> On Thu, Jan 23, 2014 at 8:00 PM, Chameera Wijebandara <
> > >> chameerawijebandara@gmail.com> wrote:
> > >>
> > >>> Hi,
> > >>>
> > >>> I am trying to classify handwritten digits using mahout
> classification.
> > >> Any
> > >>> suggestion to come up with good solution?
> > >>>
> > >>> --
> > >>> Thanks,
> > >>>    Chameera
> > >
> > >
> > >
> > > --
> > > M.P. Tharindu Rusira Kumara
> > >
> > > Department of Computer Science and Engineering,
> > > University of Moratuwa,
> > > Sri Lanka.
> > > +94757033733
> > > www.tharindu-rusira.blogspot.com
> >
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message