mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Nick Pentreath <nick.pentre...@gmail.com>
Subject Re: Adding dimensions to an existing TF-IDF vector
Date Sat, 25 Jun 2011 11:02:40 GMT
If you want some technical papers etc that cover how (and also why) it
works, check out http://hunch.net/~jl/projects/hash_reps/index.html


On Sat, Jun 25, 2011 at 1:51 AM, Ted Dunning <ted.dunning@gmail.com> wrote:

> Look at the class FeatureValueEncoder.  The test cases show most of the
> ways
> that is used.
>
> Also the class TrainNewsGroups in examples.
>
> See chapters 14 and 16 of Mahout in Action.  The sample server for chapter
> 16 does encoding like you need.
>
> On Fri, Jun 24, 2011 at 5:04 PM, Mark <static.void.dev@gmail.com> wrote:
>
> > Where can I find out more about this hashed encoding you mentioned?
> >
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message