mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From JAGANADH G <jagana...@gmail.com>
Subject Re: Create vector from text
Date Thu, 11 Oct 2012 08:52:14 GMT
On Thu, Oct 11, 2012 at 12:29 PM, Ted Dunning <ted.dunning@gmail.com> wrote:

> You have to tokenize your text and then use some form of vector encoding.
>
> If you have a known dictionary of all interesting words, you can simply
> make a vector as long as the number of words in your dictionary and put a 1
> in the right place.
>
> If you don't want to do that either because you don't know all the words in
> advance or because the number of words is too large, you can use
> a TextValueEncoder to do the deed.  There is sample code in the Mahout in
> Action code for this and Chapter 14 in Mahout in Action talks about the
> code.  You can get the code from http://github.com/tdunning/MiA
>
>

Hi Ted

Thanks for the pointer.
It works.
Sorry to shoot another question.
Is there any way get lable for classifier result as of 0.7 API

Best regards

-- 
**********************************
JAGANADH G
http://jaganadhg.in
*ILUGCBE*
http://ilugcbe.org.in

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message