Return-Path: X-Original-To: apmail-lucene-solr-user-archive@minotaur.apache.org Delivered-To: apmail-lucene-solr-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id A2156104C5 for ; Tue, 8 Apr 2014 09:37:19 +0000 (UTC) Received: (qmail 58045 invoked by uid 500); 8 Apr 2014 09:37:14 -0000 Delivered-To: apmail-lucene-solr-user-archive@lucene.apache.org Received: (qmail 57999 invoked by uid 500); 8 Apr 2014 09:37:13 -0000 Mailing-List: contact solr-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: solr-user@lucene.apache.org Delivered-To: mailing list solr-user@lucene.apache.org Received: (qmail 57991 invoked by uid 99); 8 Apr 2014 09:37:12 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 08 Apr 2014 09:37:12 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=5.0 tests=RCVD_IN_DNSWL_NONE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of iorixxx@yahoo.com designates 98.138.91.159 as permitted sender) Received: from [98.138.91.159] (HELO nm29-vm3.bullet.mail.ne1.yahoo.com) (98.138.91.159) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 08 Apr 2014 09:37:04 +0000 Received: from [98.138.101.130] by nm29.bullet.mail.ne1.yahoo.com with NNFMP; 08 Apr 2014 09:36:41 -0000 Received: from [98.138.87.4] by tm18.bullet.mail.ne1.yahoo.com with NNFMP; 08 Apr 2014 09:36:41 -0000 Received: from [127.0.0.1] by omp1004.mail.ne1.yahoo.com with NNFMP; 08 Apr 2014 09:36:41 -0000 X-Yahoo-Newman-Property: ymail-3 X-Yahoo-Newman-Id: 283207.44520.bm@omp1004.mail.ne1.yahoo.com Received: (qmail 93200 invoked by uid 60001); 8 Apr 2014 09:36:41 -0000 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=yahoo.com; s=s1024; t=1396949801; bh=iPs+6WaJvhSav7MWAuhyDKHwVo0Vx+rA5CC8u5LStyA=; h=X-YMail-OSG:Received:X-Rocket-MIMEInfo:X-Mailer:References:Message-ID:Date:From:Reply-To:Subject:To:In-Reply-To:MIME-Version:Content-Type:Content-Transfer-Encoding; b=qqeSr+DG4RZ+Rch7TekvxXvT3Vvn+gHRFnT5KNYAQBlfNjKxgyB4BT8I0xUmWYlijF45hCtBTi0wvAjoENti+8LYB8nFhv0iKVq5jFn4OHMckQljleXEAkyuQurxhJWFsOt7r1ftxjJq+klTZnTwRaOTM3rEmV7Tfz4K3LUbzck= DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=s1024; d=yahoo.com; h=X-YMail-OSG:Received:X-Rocket-MIMEInfo:X-Mailer:References:Message-ID:Date:From:Reply-To:Subject:To:In-Reply-To:MIME-Version:Content-Type:Content-Transfer-Encoding; b=TpoCpSbDyOqaHYSi65YxOdZzYVZQMnZq5ZyNbMnL/1XhNMrzjCYWwUeDaY312EwWmZs1H6qGEXTK3L81Uttst0XSobt7AlFJzJhSrtjyK6aGWfTOVXKdlsuZADvWsXKR66pKk7jMLAxBT3moi4K98PD7/PEJXXr6DHcJVxJnB/w=; X-YMail-OSG: MUiKahEVM1mngNBz.RbntGhhg.CfzLlSrkS9ZDuC1n3VExz 96kQ4PujpVg9OXs47QgNZ0igBneyzgfSbjjoD03y0zIoolBxr4RHuiWaSZ41 shwFRaouHnjSvGeVMNYQIsvaFsgfawjsdiI__2Buy_upjjmmnWGkEmZhdMfe SrYwT8FWLbtidY78CRNG5gkBQ4256sI0gYE1loyYWE8uHH.b2bxO6qkALE9G NGWKc4o6c0kce2cO8.cN9PGm.Ekg2h._1i6tvof9cHqWFjOVNb1EWIKmZKXZ QmFPh5wxfijKKHHTV5nDDDeXHQaDYKczi0vKPmrl6a21cyYCTIZNcEe_p7Kg Szuw_fQ.Tc5cNX07xIjTfgs9puhHLDctzY3GmxAtr5L2IV8G3po2amAdL2RH LlMAIM9teJYwY4JqnV9uqPfeSJdx8DIXKsXM5gv19DrBryEqj7mnrfWy12eN FENVEvX.BWezQo7ljQfDZuNRHd_xrOsyd5eQ.tcksgg4o_PYnxPDGMD5wYvn 7qSSTbjE2x.9WBwMb_aAoQU5rljh9TsD1F5W5erRhub.iu85Jig-- Received: from [193.140.184.100] by web124701.mail.ne1.yahoo.com via HTTP; Tue, 08 Apr 2014 02:36:41 PDT X-Rocket-MIMEInfo: 002.001,SGnCoE5pZWxzZW4sCgpUaGVyZSBpcyBubyBzcGVjaWFsIGF0dGVudGlvbiBwYWlkIHRvIGZpcnN0IHdvcmQuIFlvdSBhcmUgcHJvYmFibHkgaGl0dGluZyBsZW5ndGggbm9ybWFsaXNhdGlvbi7CoApMdWNlbmUvU29sciBwdW5pc2hlcyBsb25nIGRvY3VtZW50cywgZmF2b3VycyBzaG9ydCBkb2N1bWVudHMuwqAKKDUgdGltZXMgYXBwZWFyaW5nIG9uZSkgbG9uZ2VyPwoKCgpPbiBUdWVzZGF5LCBBcHJpbCA4LCAyMDE0IDEyOjAzIFBNLCBKb2huIE5pZWxzZW4gPGpuQG1jYi5kaz4gd3JvdGU6CkhpLAoKV2UgYXIBMAEBAQE- X-Mailer: YahooMailWebService/0.8.182.648 References: Message-ID: <1396949801.85086.YahooMailNeo@web124701.mail.ne1.yahoo.com> Date: Tue, 8 Apr 2014 02:36:41 -0700 (PDT) From: Ahmet Arslan Reply-To: Ahmet Arslan Subject: Re: Strange relevance scoring To: "solr-user@lucene.apache.org" In-Reply-To: MIME-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Transfer-Encoding: quoted-printable X-Virus-Checked: Checked by ClamAV on apache.org Hi=A0Nielsen,=0A=0AThere is no special attention paid to first word. You ar= e probably hitting length normalisation.=A0=0ALucene/Solr punishes long doc= uments, favours short documents.=A0=0A(5 times appearing one) longer?=0A=0A= =0A=0AOn Tuesday, April 8, 2014 12:03 PM, John Nielsen wrote:= =0AHi,=0A=0AWe are seeing a strange phenomenon with our Solr setup which I = have been=0Aunable to answer.=0A=0AMy Google-fu is clearly not up to the ta= sk, so I am trying here.=0A=0AIt appears that if i do a freetext search for= a single word, say "modellering"=0Aon a text field, the scoring is massive= ly boosted if the first word of the=0Atext field is a hit.=0A=0AFor instanc= e if there is only one occurrence of the word "modellering" in=0Athe text f= ield and that occurrence is the first word of the text, then that=0Adocumen= t gets a higher relevancy than if the word "modelling" occurs 5=0Atimes in = the text and the first word of the text is any other word.=0A=0AIs this nor= mal behavior? Is special attention paid to the first word in a=0Atext field= ? I would think that the latter case would get the highest score.=0A=0A=0A-= - =0AMed venlig hilsen / Best regards=0A=0A*John Nielsen*=0AProgrammer=0A= =0A=0A=0A*MCB A/S*=0AEnghaven 15=0ADK-7500 Holstebro=0A=0AKundeservice: +45= 9610 2824=0Apost@mcb.dk=0Awww.mcb.dk=0A