mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ted Dunning <>
Subject Re: Collocations in Mahout?
Date Fri, 08 Jan 2010 00:49:19 GMT
It definitely belongs.

And besides lots and lots of the data in large scale machine learning looks
like text.  Friends on linkedIn, history of traffic violations for
insurance, list of users who have clicked on an ad, the list goes on

Basically "text" is an ordered sequence of symbols and you encounter that
all over the place.  Cooccurrence at the window and the document level is
very widely applicable.

On Thu, Jan 7, 2010 at 4:03 PM, Otis Gospodnetic <
> wrote:

> NLP does fall under the Mahout umbrella, I'd say.  Future subproject
> perhaps?

Ted Dunning, CTO

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message