mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From zaki rahaman <>
Subject Collocations in Mahout?
Date Tue, 05 Jan 2010 17:02:00 GMT
Pardon my ignorance as this is probably best handled by an NLP package like
GATE or LingPipe, but does Mahout provide anything for collocations? Or does
anyone know of a MapReducible way to calculate something like t-values for
tokens in N-grams? I've got quite a large collection that I have to prune,
filter, and preprocess, but I still expect it to be a significant size.

Zaki Rahaman

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message