mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Paul Jones <>
Subject LSI, cosine and others which use vectors
Date Wed, 24 Jun 2009 00:01:48 GMT
Yes another question, am going through a rapid learning curve...

All these vector based systems, which require you to build a term-doc etc, are they of any
use in a system where the data is changing, i.e lets assume the docs are webpages, which are
being crawled, and hence updated. Surely if there is a vector diagram being formed, then the
position of these vectors changes based on the changes (size, content) of the entire matrix,
or am I missing something here.

If the above is correct, then is a actual live project how is this done, are distances worked
out on a per-day type of basis, and the indexes then updated ?


  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message