From java-user-return-17280-apmail-lucene-java-user-archive=lucene.apache.org@lucene.apache.org Wed Nov 02 14:28:53 2005 Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 13127 invoked from network); 2 Nov 2005 14:27:48 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur.apache.org with SMTP; 2 Nov 2005 14:27:48 -0000 Received: (qmail 31532 invoked by uid 500); 2 Nov 2005 14:27:22 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 31510 invoked by uid 500); 2 Nov 2005 14:27:22 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 31494 invoked by uid 99); 2 Nov 2005 14:27:22 -0000 Received: from asf.osuosl.org (HELO asf.osuosl.org) (140.211.166.49) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 02 Nov 2005 06:27:22 -0800 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests= X-Spam-Check-By: apache.org Received-SPF: pass (asf.osuosl.org: local policy) Received: from [129.6.101.41] (HELO rogue.ncsl.nist.gov) (129.6.101.41) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 02 Nov 2005 06:27:17 -0800 Received: from rogue.ncsl.nist.gov.nist.gov (localhost.localdomain [127.0.0.1]) by rogue.ncsl.nist.gov (8.13.1/8.13.1) with ESMTP id jA2EQxTC021660 for ; Wed, 2 Nov 2005 09:27:00 -0500 From: Ian Soboroff To: java-user@lucene.apache.org Subject: Re: Vector Model and Relevance Feedback References: <200511021310.18686.rj@last.fm> <4368C513.3010600@curtin.com> <4368C8E4.8080303@kbse.net> Date: Wed, 02 Nov 2005 09:26:59 -0500 In-Reply-To: <4368C8E4.8080303@kbse.net> (Stefan Gusenbauer's message of "Wed, 02 Nov 2005 15:10:44 +0100") Message-ID: <9cfvezb9lcc.fsf@rogue.ncsl.nist.gov> User-Agent: Gnus/5.110004 (No Gnus v0.4) Emacs/21.3 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Virus-Checked: Checked by ClamAV on apache.org X-Spam-Rating: minotaur.apache.org 1.6.2 0/1000/N Stefan Gusenbauer writes: > Is there an add on for lucene to get a real vector representation? > Does anyone has experiences with this issue? No code, but some small thinking. You can do hacks with boosts and whatnot, but I think in the end you really want a new Query subclass (and the other machinery to make that work). I've seen some of the boost hacks, and, ok, Rocchio is something of a hack (at least the way it's usually done), but adding a hack to a hack is just too much. Next, some joker is going to come along and want to to probabilistic LSI or language models and then you're up the creek ;-) I have a code infrastructure to handle nearly any VSM-style retrieval function (including Okapi BM25), but it's waiting for the low-level VSM query structure to make it worthwhile. If someone starts work in this area, I'd love to help out to the extent I can. Ian --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org