Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 49109 invoked from network); 16 Dec 2006 17:08:27 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 16 Dec 2006 17:08:27 -0000 Received: (qmail 11137 invoked by uid 500); 16 Dec 2006 17:08:30 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 10171 invoked by uid 500); 16 Dec 2006 17:08:27 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 10160 invoked by uid 99); 16 Dec 2006 17:08:27 -0000 Received: from herse.apache.org (HELO herse.apache.org) (140.211.11.133) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 16 Dec 2006 09:08:27 -0800 X-ASF-Spam-Status: No, hits=2.6 required=10.0 tests=HTML_00_10,HTML_MESSAGE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (herse.apache.org: domain of markrmiller@gmail.com designates 64.233.182.190 as permitted sender) Received: from [64.233.182.190] (HELO nf-out-0910.google.com) (64.233.182.190) by apache.org (qpsmtpd/0.29) with ESMTP; Sat, 16 Dec 2006 09:08:17 -0800 Received: by nf-out-0910.google.com with SMTP id n28so165879nfc for ; Sat, 16 Dec 2006 09:07:56 -0800 (PST) DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=beta; d=gmail.com; h=received:message-id:date:from:to:subject:mime-version:content-type; b=E0c74S9f4O7DCyrYyOaGqJh7epsVPod0v7008uoVWK5WzzFCwrW5XTX7FaBmUqx0N44Ux+4h/bAT4BqQEwvlcRq7TYUyugWBt4QZEPAOunKfUH6tQIeFL6Ic6Oam9BX1GzRKtIF0Y/PWuFlT0Mfu/aqaJdcev7lF1HmkDFwfaXc= Received: by 10.48.230.2 with SMTP id c2mr704149nfh.1166288876156; Sat, 16 Dec 2006 09:07:56 -0800 (PST) Received: by 10.48.48.20 with HTTP; Sat, 16 Dec 2006 09:07:55 -0800 (PST) Message-ID: Date: Sat, 16 Dec 2006 12:07:55 -0500 From: "Mark Miller" To: java-user@lucene.apache.org Subject: sorting by per doc hit count MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="----=_Part_157814_6731114.1166288875793" X-Virus-Checked: Checked by ClamAV on apache.org ------=_Part_157814_6731114.1166288875793 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Content-Disposition: inline I have not really looked into this yet, but maybe you can save me some time -- Is it feasible/simple to sort by the number of hits found per document? Would this require changing the scoring system (remove idf etc etc) and doing a normal relevancy search? Could it be done with functionquery? Any Hints? If it is a lot of work I am not interested in doing it, but if it is somewhat simple it would make a few customers feel fuzzy. Thanks, Mark ------=_Part_157814_6731114.1166288875793--