Return-Path: Delivered-To: apmail-lucene-mahout-user-archive@minotaur.apache.org Received: (qmail 519 invoked from network); 24 Jun 2009 00:02:09 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 24 Jun 2009 00:02:09 -0000 Received: (qmail 762 invoked by uid 500); 24 Jun 2009 00:02:20 -0000 Delivered-To: apmail-lucene-mahout-user-archive@lucene.apache.org Received: (qmail 702 invoked by uid 500); 24 Jun 2009 00:02:20 -0000 Mailing-List: contact mahout-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: mahout-user@lucene.apache.org Delivered-To: mailing list mahout-user@lucene.apache.org Received: (qmail 692 invoked by uid 99); 24 Jun 2009 00:02:20 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 24 Jun 2009 00:02:20 +0000 X-ASF-Spam-Status: No, hits=3.4 required=10.0 tests=HTML_MESSAGE,SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (nike.apache.org: local policy) Received: from [212.82.104.162] (HELO web24605.mail.ird.yahoo.com) (212.82.104.162) by apache.org (qpsmtpd/0.29) with SMTP; Wed, 24 Jun 2009 00:02:09 +0000 Received: (qmail 98448 invoked by uid 60001); 24 Jun 2009 00:01:48 -0000 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=yahoo.co.uk; s=s1024; t=1245801708; bh=rCA4VY4yeqzjzK0C0q4fsAnjQrKr5UoslCV+h0k7wMo=; h=Message-ID:X-YMail-OSG:Received:X-Mailer:References:Date:From:Subject:To:In-Reply-To:MIME-Version:Content-Type; b=2hUzOGAtbOHfhAAfYyc2cNWURBGMH8rrppSD4HdF2XEr+ZSIzpuryxh67AgldXdnBLx1SmxiExUBx3Y22Cd3rB3XeNW7TprU+vxujbwL2mID+Sndz1kGDM2JbVNPyty1S0mKTkbuygK2cmA51Oqv0NA60zap+Ra1lvXwSiYVp5c= DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=s1024; d=yahoo.co.uk; h=Message-ID:X-YMail-OSG:Received:X-Mailer:References:Date:From:Subject:To:In-Reply-To:MIME-Version:Content-Type; b=ji9S3YJZycMGW1u2c2o9KJjrUDnxnuyMjNnU3l37FnJkm4BEmL1xeiAQGj+DQk+ijWb3a9B4cq0k9mjCnrHEIGTbZLpOomSbzKjDQQqa1LOX1e4nh9ddAH8ZmLSKQEvBFA4cJwjQJPgZEFPfZNFBVOUV+bwoE9WFReFis2z8Qvg=; Message-ID: <347836.95459.qm@web24605.mail.ird.yahoo.com> X-YMail-OSG: RQNpceAVM1m9LQmd2EuBvSoFR61FEFHrERAQiMgrweQ1VKkl7Q9DVZbv8ZN86xDlkK5YEreGJnPYrInBrmoGMgJnbL1XUdruYduyg3DFtR6q75HmMrhICIHYABdrUtRsPSi5Tw9b9sw46MMoiiwHjNRnh0VgnWQyPBJ9gPJwa4CXYaRJjWA0kzkdFjGnfv1fJkeQxwg_e4dPLELMMpCitAJ5YEqx08ZJsdrBcfqQX4BuCiBGG4M05THEDJAmm5NXKB52vFpqYUsqG9Nxr2E- Received: from [79.76.203.213] by web24605.mail.ird.yahoo.com via HTTP; Tue, 23 Jun 2009 17:01:48 PDT X-Mailer: YahooMailRC/1277.43 YahooMailWebService/0.7.289.15 References: <24175732.post@talk.nabble.com> <24175800.post@talk.nabble.com> Date: Tue, 23 Jun 2009 17:01:48 -0700 (PDT) From: Paul Jones Subject: LSI, cosine and others which use vectors To: mahout-user@lucene.apache.org In-Reply-To: MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="0-755688156-1245801708=:95459" X-Virus-Checked: Checked by ClamAV on apache.org --0-755688156-1245801708=:95459 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable Yes another question, am going through a rapid learning curve...=0A=0AAll t= hese vector based systems, which require you to build a term-doc etc, are t= hey of any use in a system where the data is changing, i.e lets assume the = docs are webpages, which are being crawled, and hence updated. Surely if th= ere is a vector diagram being formed, then the position of these vectors ch= anges based on the changes (size, content) of the entire matrix, or am I mi= ssing something here.=0A=0AIf the above is correct, then is a actual live p= roject how is this done, are distances worked out on a per-day type of basi= s, and the indexes then updated ?=0A=0APaul=0A=0A=0A=0A --0-755688156-1245801708=:95459--