Return-Path: X-Original-To: apmail-lucene-java-user-archive@www.apache.org Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 70C6C10FC2 for ; Thu, 6 Mar 2014 18:34:51 +0000 (UTC) Received: (qmail 25995 invoked by uid 500); 6 Mar 2014 18:34:48 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 25608 invoked by uid 500); 6 Mar 2014 18:34:39 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 25600 invoked by uid 99); 6 Mar 2014 18:34:38 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 06 Mar 2014 18:34:38 +0000 X-ASF-Spam-Status: No, hits=-1.3 required=5.0 tests=RCVD_IN_DNSWL_MED,SPF_SOFTFAIL X-Spam-Check-By: apache.org Received-SPF: softfail (nike.apache.org: transitioning domain of christian.reuschling@gmail.com does not designate 131.246.120.220 as permitted sender) Received: from [131.246.120.220] (HELO mailgw1.uni-kl.de) (131.246.120.220) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 06 Mar 2014 18:34:32 +0000 Received: from dfki.uni-kl.de (dfki-1002.dfki.uni-kl.de [131.246.195.2]) by mailgw1.uni-kl.de (8.14.3/8.14.3/Debian-9.4) with ESMTP id s26IY8fM017179 for ; Thu, 6 Mar 2014 19:34:08 +0100 Received: from serv-4100.kl.dfki.de (serv-4100.kl.dfki.de [192.168.41.180]) by dfki.uni-kl.de (8.13.8+Sun/8.11.4) with ESMTP id s26IY87f028199 for ; Thu, 6 Mar 2014 19:34:08 +0100 (CET) Received: from pc-4176.kl.dfki.de (pc-4176.kl.dfki.de [192.168.41.166]) by serv-4100.kl.dfki.de (8.14.4+Sun/8.14.4) with ESMTP id s26IY8YO001138 for ; Thu, 6 Mar 2014 19:34:08 +0100 (CET) Message-ID: <5318BFA0.2090807@gmail.com> Date: Thu, 06 Mar 2014 19:34:08 +0100 From: Christian Reuschling User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:24.0) Gecko/20100101 Thunderbird/24.1.0 MIME-Version: 1.0 To: java-user@lucene.apache.org Subject: tf/idf similarity with modified document similarity X-Enigmail-Version: 1.6 Content-Type: text/plain; charset=ISO-8859-15 Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org -----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 Hello, what is the best method to score documents similar to default similarity, but the document frequency should be calculated per query against the matching result document set, not statically against the whole corpus. Didn't found a good and performant solution yet. Thank you! Christian -----BEGIN PGP SIGNATURE----- Version: GnuPG v2.0.19 (GNU/Linux) Comment: Using GnuPG with Thunderbird - http://www.enigmail.net/ iEYEARECAAYFAlMYv6AACgkQ6EqMXq+WZg+cjQCbBCwxnGyn18kEEbJ2aHbiyTNv xpcAnRho4H/YGKzsmoOXN91+06nruhHa =g3Ka -----END PGP SIGNATURE----- --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org