mahout-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Benson Margulies <bimargul...@gmail.com>
Subject Re: Possible contribution at somewhat of a tangent to Mahout
Date Sat, 03 Oct 2009 22:51:32 GMT
If I succeed in getting this thing published one way or another, I will
amused to find out if you can think of a way to make it better.

On Sat, Oct 3, 2009 at 6:47 PM, Ted Dunning <ted.dunning@gmail.com> wrote:

> This is, indeed, a difficult choice.
>
> I would tend to say no.  Rationale is that the code is C++, standalone, has
> no points of integration or use and there isn't a volunteer to make it
> better.
>
> On Sat, Oct 3, 2009 at 9:45 AM, Sean Owen <srowen@gmail.com> wrote:
>
> > I personally am caught between a desire on one hand to be inclusive of
> > everything, and a desire on the other hand to not make the project a
> > collection of bits and bobs from all over, with some algorithms
> > existing in C++, others in Java, some distributed, some not, some
> > supported, some a one-time dump, etc. It really harms end users
> > ability to place what Mahout 'is' and how much to expect of it. Either
> > people will be surprised that some new scratch code isn't bug-free,
> > or, will assume that the mature bits of the code are probably just
> > very rough too when they may not be.
> >
> > The latter wins out in my mind, in this case --  it 'feels' like a
> > different project at this point.
> >
> > Let me however revive my suggestion that Mahout include a 'sandbox'
> > module of sorts to host anything at all. This neatly allows for
> > incorporation of anything, in any state, without confusing users as to
> > what should be expected of Mahout 'proper', which should be a
> > reasonably high bar come version 1.0.
> >
> > On Sat, Oct 3, 2009 at 5:17 PM, Benson Margulies <bimargulies@gmail.com>
> > wrote:
> > > Folks,
> > >
> > > I may be in a position to contribute a very slick implementation of the
> > > Brown, dePietro, etc. bigram mutual information word clustering scheme
> > > sometime soon. It is written in C++, and if there's any map-reduce, its
> > via
> > > OpenMP, not hadoop :-).
> > >
> > > As an ASF member, if I'm facilitating getting something useful out as
> > open
> > > source, I'd rather push it out at Apache.
> > >
> > > Any interest in stretching the Mahout tent out to accomodate it?
> > >
> > > I'm asking now because I'm starting a negotiation with the academic
> owner
> > > thereof, and it would be useful to know in advance if I have a
> tentative
> > > home for it at Apache as opposed to having to just dump it into
> > SourceForge.
> > >
> > > You could take the attitude that it's part of Mahout as a challenge:
> can
> > > anyone out there come up with a practical variation in Java/Hadoop?
> > >
> > > --benson
> > >
> >
>
>
>
> --
> Ted Dunning, CTO
> DeepDyve
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message