Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 96661 invoked from network); 30 May 2007 18:35:13 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 30 May 2007 18:35:13 -0000 Received: (qmail 20880 invoked by uid 500); 30 May 2007 18:35:09 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 20839 invoked by uid 500); 30 May 2007 18:35:09 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 20828 invoked by uid 99); 30 May 2007 18:35:09 -0000 Received: from herse.apache.org (HELO herse.apache.org) (140.211.11.133) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 30 May 2007 11:35:09 -0700 X-ASF-Spam-Status: No, hits=-0.0 required=10.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (herse.apache.org: domain of ryantxu@gmail.com designates 64.233.162.224 as permitted sender) Received: from [64.233.162.224] (HELO nz-out-0506.google.com) (64.233.162.224) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 30 May 2007 11:35:04 -0700 Received: by nz-out-0506.google.com with SMTP id x7so1122786nzc for ; Wed, 30 May 2007 11:34:44 -0700 (PDT) DKIM-Signature: a=rsa-sha1; c=relaxed/relaxed; d=gmail.com; s=beta; h=domainkey-signature:received:received:message-id:date:from:user-agent:mime-version:to:subject:references:in-reply-to:content-type:content-transfer-encoding; b=cnYL6x1Dbs08AJz5UFCXO7h3RkmqnWhE8W3/hXfdM8nhsrrw5P6kKu9UrdVSrtE9GMikb15emSJ+fIgBaEnr8ulQovOgtASnLAyL7SzWxwSSDFdSjyYBZEB2SOG6RAxSlJsSMS9MPOwYW/sB4JGCCzCcxWE62GbM+NCagB2RG2w= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=beta; h=received:message-id:date:from:user-agent:mime-version:to:subject:references:in-reply-to:content-type:content-transfer-encoding; b=Aw1G/zY6dxrfXoTYHpON9FBLSDfsdm80E6ihsfT4L8I/2CGdZYHNT9rp3sBPgtTVhHkEjAohVhglSNc91M31lfFgSk7IN0HcLB61FHBf1ju1IbD/7CsrVVodyoLxQAwpjcD/sJiOBBkZ5nzcllXjMjzcNL/KlKMkBPExRdEafXg= Received: by 10.114.174.2 with SMTP id w2mr4149247wae.1180550083328; Wed, 30 May 2007 11:34:43 -0700 (PDT) Received: from ?127.0.0.1? ( [24.7.67.61]) by mx.google.com with ESMTP id m6sm520104wag.2007.05.30.11.34.41; Wed, 30 May 2007 11:34:42 -0700 (PDT) Message-ID: <465DC398.3020101@gmail.com> Date: Wed, 30 May 2007 11:34:00 -0700 From: Ryan McKinley User-Agent: Thunderbird 2.0.0.0 (Windows/20070326) MIME-Version: 1.0 To: java-user@lucene.apache.org Subject: Re: MoreLikeThis API changes? References: <324465.77161.qm@web26001.mail.ukl.yahoo.com> In-Reply-To: <324465.77161.qm@web26001.mail.ukl.yahoo.com> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org mark harwood wrote: >>> I want to return the "interesting" terms used for MLT > > Could you do this using Query.extractTerms() on the rewritten version of the MoreLikeThis query (a BooleanQuery)? > thanks! that works and avoids the PriorityQueue traverstal problems. I can even get the boost (normalized tf/idf) using BooleanQuery.clauses(). I will post a patch that: 1. lets you set the Similarity 2. makes isNoiseWord() protected thanks ryan --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org