mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Ted Dunning <ted.dunn...@gmail.com>
Subject Re: Collocations in Mahout?
Date Fri, 08 Jan 2010 00:49:19 GMT
It definitely belongs.

And besides lots and lots of the data in large scale machine learning looks
like text.  Friends on linkedIn, history of traffic violations for
insurance, list of users who have clicked on an ad, the list goes on
forever.

Basically "text" is an ordered sequence of symbols and you encounter that
all over the place.  Cooccurrence at the window and the document level is
very widely applicable.



On Thu, Jan 7, 2010 at 4:03 PM, Otis Gospodnetic <otis_gospodnetic@yahoo.com
> wrote:

> NLP does fall under the Mahout umbrella, I'd say.  Future subproject
> perhaps?




-- 
Ted Dunning, CTO
DeepDyve

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message