mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
Subject Re: Create vector from text
Date Thu, 11 Oct 2012 08:52:14 GMT
On Thu, Oct 11, 2012 at 12:29 PM, Ted Dunning <> wrote:

> You have to tokenize your text and then use some form of vector encoding.
> If you have a known dictionary of all interesting words, you can simply
> make a vector as long as the number of words in your dictionary and put a 1
> in the right place.
> If you don't want to do that either because you don't know all the words in
> advance or because the number of words is too large, you can use
> a TextValueEncoder to do the deed.  There is sample code in the Mahout in
> Action code for this and Chapter 14 in Mahout in Action talks about the
> code.  You can get the code from

Hi Ted

Thanks for the pointer.
It works.
Sorry to shoot another question.
Is there any way get lable for classifier result as of 0.7 API

Best regards


  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message