mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Lance Norskog <goks...@gmail.com>
Subject Re: Coocurrence job
Date Sat, 12 Nov 2011 23:02:00 GMT
Thanks.

On Sat, Nov 12, 2011 at 2:54 AM, Sebastian Schelter <ssc@apache.org> wrote:

> On 12.11.2011 11:26, Sean Owen wrote:
> > Looking at the uses of it, I think it receives as input vectors that
> > are "non overlapping" and is just stitching them together, so yes it's
> > correct.
> > But Sebastian can double-check.
>
> It is used exactly as Sean says. It is applied in the first pass over
> the data which transposes the input matrix and we are sure that there
> are no overlapping dimensions.
>
>
> --sebastian
>
> >
> > On Sat, Nov 12, 2011 at 2:48 AM, Lance Norskog <goksron@gmail.com>
> wrote:
> >>
> org.apache.mahout.math.hadoop.similarity.cooccurrence.Vectors.merge(Iterable<VectorWritable>)
> >>
> >> This ORs together several (sparse) VectorWritables. It does not sum
> >> together overlapping dimensions, it just overwrites them from the most
> >> final vector in the list. Is this ok? Should they be summed?
> >>
> >> --
> >> Lance Norskog
> >> goksron@gmail.com
> >>
>
>


-- 
Lance Norskog
goksron@gmail.com

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message