Return-Path: X-Original-To: apmail-lucene-solr-user-archive@minotaur.apache.org Delivered-To: apmail-lucene-solr-user-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id C7309106BD for ; Fri, 6 Dec 2013 07:49:18 +0000 (UTC) Received: (qmail 42570 invoked by uid 500); 6 Dec 2013 07:48:58 -0000 Delivered-To: apmail-lucene-solr-user-archive@lucene.apache.org Received: (qmail 42505 invoked by uid 500); 6 Dec 2013 07:48:55 -0000 Mailing-List: contact solr-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: solr-user@lucene.apache.org Delivered-To: mailing list solr-user@lucene.apache.org Received: (qmail 42497 invoked by uid 99); 6 Dec 2013 07:48:53 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 06 Dec 2013 07:48:53 +0000 X-ASF-Spam-Status: No, hits=2.2 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_NONE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: local policy) Received: from [98.138.91.37] (HELO nm18-vm0.bullet.mail.ne1.yahoo.com) (98.138.91.37) by apache.org (qpsmtpd/0.29) with SMTP; Fri, 06 Dec 2013 07:48:44 +0000 Received: from [98.138.226.178] by nm18.bullet.mail.ne1.yahoo.com with NNFMP; 06 Dec 2013 07:48:21 -0000 Received: from [98.138.226.166] by tm13.bullet.mail.ne1.yahoo.com with NNFMP; 06 Dec 2013 07:48:21 -0000 Received: from [127.0.0.1] by omp1067.mail.ne1.yahoo.com with NNFMP; 06 Dec 2013 07:48:21 -0000 X-Yahoo-Newman-Property: ymail-3 X-Yahoo-Newman-Id: 204452.59024.bm@omp1067.mail.ne1.yahoo.com Received: (qmail 33006 invoked by uid 60001); 6 Dec 2013 07:48:21 -0000 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=yahoo.com; s=s1024; t=1386316101; bh=gOUFfs5Zaski1BZ2YB1171PPSXTM4/ERE/lIwXG++Gs=; h=X-YMail-OSG:Received:X-Rocket-MIMEInfo:X-Mailer:References:Message-ID:Date:From:Reply-To:Subject:To:In-Reply-To:MIME-Version:Content-Type; b=ROc4aeZq8+NyFKtMKszoRPB2I2jsQVNcU5FPknqXQdo2KQy2EJRddens7McxoiT0u6KdUVsrgwH+ENNWSxASxVh8gg//UAtBopTeYo38rG0E9+HViL41QTMA+DcsjLExc6kFHZP6wCZt1iwSxF0L4+kzbSuY6HMnYzmX1/XThAg= DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=s1024; d=yahoo.com; h=X-YMail-OSG:Received:X-Rocket-MIMEInfo:X-Mailer:References:Message-ID:Date:From:Reply-To:Subject:To:In-Reply-To:MIME-Version:Content-Type; b=GvF0ALboozp7dTfG5wYe72RRx6PHlX0dNLIK6xmhwDaGbyT5qHaDXNFSYHKKZXbOIMMa4jzqh5WDJEQ+5P8Zh1k0a+OiwJ/lx1+zr6b/2Pb0GMZMEPN0IvZntPJdP0QEgCeP7MzaonjYw0MjfPHnUe42dfTOA20MD6xF8Df+oNk=; X-YMail-OSG: xa3Jf_gVM1mGI2sWO2GAsR1iri2EDiZTExjWuFSiUj.Lwzi Q4BLBA90tfZ1W0Naq.P.pJmsOX86KXEwQdY65.hI6ACDm.5o1FDT3nG_TNsB DrOIbGfG6ZoOtg.Ov9sswehq.Ru.P6JocH7.62qfsz2ze91FgysOqf7B2Tsu GAjq00aa1F6ong6bR227CNwMRPzouWIEAnWBYh8OfnXGRbAEdfE3JmSUQ81g FPBYLkIIP296kVyhMvJtNGLdeaDCsFGmBwCWa_TX9B_pbh8yLDDqV.17M56d BzvcXj4wCaGyzeV.o9LvKB86oR_ncZnjEIM84F0YDs0r.OQQL7sTrD5SKZQS kMR0ApUKpUzt03BTPhhHXc_VT8TB.zaPmFiEKIHE10R1d3xCO8PuJh1uKdA0 qwUIKGDTyv5ORbiZXkgF_DqDXRARRnnkFyBGoz2DJ9crP0yaxUao5S5pnjn8 dnk9FWDTW8GmU4EYSjcm.AF5pTs_Gd0y8VtvDOLaEjqRX7iGU.ulTp8tdPeM uSvt7Gyy7wh0GrMYVmEFpJhs2MyuOY5CWpi54vDkKnN49q_nMrG6n2CUNOe0 hUfLWk9U5Fuqvma8- Received: from [193.140.16.102] by web125306.mail.ne1.yahoo.com via HTTP; Thu, 05 Dec 2013 23:48:21 PST X-Rocket-MIMEInfo: 002.001,SGkgSXNhYWMsCgpEaWQgeW91IGNvbnNpZGVyIG9taXR0aW5nIG5vcm1zIGNvbXBsZXRlbHkgZm9yIHRoYXQgZmllbGQ_IG9taXROb3Jtcz0idHJ1ZSIKQXJlIHlvdSB1c2luZ8Kgc29sci5SZW1vdmVEdXBsaWNhdGVzVG9rZW5GaWx0ZXJGYWN0b3J5PwoKCgpPbiBUaHVyc2RheSwgRGVjZW1iZXIgNSwgMjAxMyA4OjU1IFBNLCBJc2FhYyBIZWJzaCA8aXNhYWMuaGVic2hAZ21haWwuY29tPiB3cm90ZToKIApIaSwKd2UgaW1wbGVtZW50ZWQgYSBtb3JwaG9sb2dpYyBhbmFseXplciwgd2hpY2ggc3RlbXMgd29yZHMBMAEBAQE- X-Mailer: YahooMailWebService/0.8.169.609 References: Message-ID: <1386316101.30429.YahooMailNeo@web125306.mail.ne1.yahoo.com> Date: Thu, 5 Dec 2013 23:48:21 -0800 (PST) From: Ahmet Arslan Reply-To: Ahmet Arslan Subject: Re: Bad fieldNorm when using morphologic synonyms To: "solr-user@lucene.apache.org" In-Reply-To: MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="-2016055526-1055035004-1386316101=:30429" X-Virus-Checked: Checked by ClamAV on apache.org ---2016055526-1055035004-1386316101=:30429 Content-Type: text/plain; charset=iso-8859-1 Content-Transfer-Encoding: quoted-printable Hi Isaac,=0A=0ADid you consider omitting norms completely for that field? o= mitNorms=3D"true"=0AAre you using=A0solr.RemoveDuplicatesTokenFilterFactory= ?=0A=0A=0A=0AOn Thursday, December 5, 2013 8:55 PM, Isaac Hebsh wrote:=0A =0AHi,=0Awe implemented a morphologic analyzer, whic= h stems words on index time.=0AFor some reasons, we index both the original= word and the stem (on the same=0Aposition, of course).=0AThe stemming is d= one on a specific language, so other languages are not=0Astemmed at all.=0A= =0ABecause of that, two documents with the same amount of terms, may have= =0Adifferent termVector size. document which contains many words that being= =0Astemmed, will have a double sized termVector. This behaviour affects the= =0Arelevance score in a BAD way. the fieldNorm of these documents reduces= =0Athier score. This is NOT the wanted behaviour in our case.=0A=0AWe are l= ooking for a way to "mark" the stemmed words (on index time, of=0Acourse) s= o they won't affect the fieldNorm. Do such a way exist?=0A=0ADo you have an= other idea? ---2016055526-1055035004-1386316101=:30429--