mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From David Rahman <drahman1...@googlemail.com>
Subject Re: Mahout and multi-label classification
Date Tue, 08 Nov 2011 14:21:02 GMT
Sorry, I mean classifier, my bad...
Unfortunatly I can't release the raw data. And I don't really know how to
blind the data correctly. I could give the categories numbers and hash the
text (title and abstract) if that would be enough...

Thanks and regards,
David

2011/11/8 Ted Dunning <ted.dunning@gmail.com>

> Recommender?  Recommenders are not normally used for adding categories to
> documents.
>
> Is it possible for you to release blinded data in which all terms and
> categories are replaced by numbers and permuted?  Or even just stemmed and
> sorted as with the RCV1 corpus?
>
> Having such a test corpus would make it much easier to get folks to help.
>
> On Tue, Nov 8, 2011 at 7:56 AM, David Rahman <drahman1985@googlemail.com
> >wrote:
>
> > I have a lots of data from where I work. The data are documents (title +
> > abstract) and each document can have one or more categories (e.g. social
> > sciences + policics). We want to build a recommender and analyze the
> output
> > for further testing.
> >
> > Thanks and regards,
> > David
> >
> > 2011/11/8 Ted Dunning <ted.dunning@gmail.com>
> >
> > > The practical techniques for such problems are pretty diverse.
> > >
> > > One method is to simply define multiple binary classifiers.  If you can
> > > stratify your labels, then you can have some labels depend on others.
> > >  Another option is to find commonly occurring sets of labels and build
> > > classifiers for those sets directly.
> > >
> > > Do you have an application mind?  Do you have data available?
> > >
> > > On Tue, Nov 8, 2011 at 6:37 AM, David Rahman <
> drahman1985@googlemail.com
> > > >wrote:
> > >
> > > > Yes, I was asking for an example where multiple labels might be
> aplied
> > > to a
> > > > single example.
> > > >
> > > > Thanks and regards,
> > > > David
> > > >
> > > > 2011/11/8 Ted Dunning <ted.dunning@gmail.com>
> > > >
> > > > > What exactly do you mean by multi-label classification?
> > > > >
> > > > > The 20 newsgroup example has many possible label values.
> > > > >
> > > > > Are you asking for an example where multiple labels might be
> applied
> > > to a
> > > > > single example?  If so, no, we don't have a nice example of that.
> > > > >
> > > > > On Tue, Nov 8, 2011 at 5:36 AM, David Rahman <
> > > drahman1985@googlemail.com
> > > > > >wrote:
> > > > >
> > > > > > Hi,
> > > > > >
> > > > > > I have a general question about multi-label classification.
> Binary-
> > > or
> > > > > > single-label classification is working, as shown in several
> > examples
> > > > > > (Wikipedia and 20Newsgroup, Mahout In Action book...).
> > > > > >
> > > > > > Are there some working examples on multi-label calssification
for
> > > > trying
> > > > > > out?
> > > > > > Or is there some data available on how mahout performs on
> > mult-label
> > > > > > classification problems?
> > > > > >
> > > > > > Thanks and regards,
> > > > > > David
> > > > > >
> > > > >
> > > >
> > >
> >
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message