Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 11317 invoked from network); 7 Nov 2007 17:50:26 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 7 Nov 2007 17:50:26 -0000 Received: (qmail 58086 invoked by uid 500); 7 Nov 2007 17:50:08 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 58050 invoked by uid 500); 7 Nov 2007 17:50:07 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 58039 invoked by uid 99); 7 Nov 2007 17:50:07 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 07 Nov 2007 09:50:07 -0800 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of isaacrc82@gmail.com designates 209.85.198.187 as permitted sender) Received: from [209.85.198.187] (HELO rv-out-0910.google.com) (209.85.198.187) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 07 Nov 2007 17:50:44 +0000 Received: by rv-out-0910.google.com with SMTP id k20so1755254rvb for ; Wed, 07 Nov 2007 09:49:49 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=beta; h=domainkey-signature:received:received:message-id:date:from:to:subject:mime-version:content-type:content-transfer-encoding:content-disposition; bh=jtT8FgGJVyfPL9eFsDG5Sc23LYfGE7S33JeQttlkhI0=; b=ivB9jb8H2FqXxj8eW8EsGZCwIDG/+vNQcOWhct9PjxjIxnAhXiXTQr8+bJ4xoz1H3NENY8irEWAyJZp1Pjn1C+q2y0j7pHtquZZDp5wlLYql9COE0F6BlNvOLS9olQEhj4IogL9yd9NXtZCFDrwmdmO2SMffHB7Kp8XGqcii0UM= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=beta; h=received:message-id:date:from:to:subject:mime-version:content-type:content-transfer-encoding:content-disposition; b=m8QUGrJ1N7+JWjH1xj3gH1xOrRJFEjUE1deFmSuxpTCNCIOTzwUh+Y4CJ+JZb7ap6uOI5Zy9MoOIOg1QU6KIs+P72A42T+p7MUxe/yUJGeUtIBxWBBfIpr5WutgoghqhZz/Y2pS/YHfHFp8YV7A+SbPy69Zmyd+pnq8n74aWMC8= Received: by 10.141.15.19 with SMTP id s19mr3675225rvi.1194457788676; Wed, 07 Nov 2007 09:49:48 -0800 (PST) Received: by 10.140.133.6 with HTTP; Wed, 7 Nov 2007 09:49:48 -0800 (PST) Message-ID: Date: Wed, 7 Nov 2007 12:49:48 -0500 From: Ariel To: java-user@lucene.apache.org Subject: Re: How to build your custom termfreq vector an add it to the field ? MIME-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Content-Disposition: inline X-Virus-Checked: Checked by ClamAV on apache.org Then if I want to use another scoring formula I must to implement my own Query/Weigh/Scorer ? For example instead of cousine distance leiderbage distance or .. another. I'm studying Query/Weigh/Scorer classes to find out how to do that but there is not much documentation about that. I have seen I could change similarity factors extending the simlarity class, but I have not seen any example about changing scoring formula and changing the weight by term in the term vector. Do you know any tutorial about this ? What I want to do changing frecuency in the terms vector is this: for example instead of take the tf term frecuency of the term and stored in the vector I want to consider the correlation of the term with the other terms of the documents and store that measure by term in the vector so later with my custom similarity formula calculate the ranking of a document against a query considering the correlation between terms. Dou you think is a big mistake try to do this with lucene ??? Is there any way ? --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org