Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 51925 invoked from network); 15 Oct 2010 13:19:58 -0000 Received: from unknown (HELO mail.apache.org) (140.211.11.3) by 140.211.11.9 with SMTP; 15 Oct 2010 13:19:58 -0000 Received: (qmail 4331 invoked by uid 500); 15 Oct 2010 13:19:56 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 3940 invoked by uid 500); 15 Oct 2010 13:19:53 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 3932 invoked by uid 99); 15 Oct 2010 13:19:52 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 15 Oct 2010 13:19:52 +0000 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests=FREEMAIL_FROM,RCVD_IN_DNSWL_NONE,SPF_PASS,T_TO_NO_BRKTS_FREEMAIL X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of pasalic.zaharije@gmail.com designates 209.85.216.48 as permitted sender) Received: from [209.85.216.48] (HELO mail-qw0-f48.google.com) (209.85.216.48) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 15 Oct 2010 13:19:44 +0000 Received: by qwf7 with SMTP id 7so439813qwf.35 for ; Fri, 15 Oct 2010 06:19:23 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:received:in-reply-to :references:date:message-id:subject:from:to:content-type :content-transfer-encoding; bh=Xd7Iat1R3P3/S3iZpPb/jDicNDygI2Rsh+7zCbUMUPs=; b=RD5GbZMeBUHznUYS+7cOICT0J/BY7oxkTcHdWAPDAjaWl2VPeMNniwdkOwe9qMMMGf CkEPXzVJxFrVDEMKHjqQnctv7NL385g92wtPtCsDxxIgV/YcaPCKUquel/xfumqXzaYN dEFP7+yncNiJGAJ7bSL++q9bJHnj0zfAeS1KA= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type:content-transfer-encoding; b=ac/GaDrs5ejIpZ4jpXuu1l0jX9rkkxE1XIFMHT/znaRadVKBZwstBNwTcgpjokKyPz 77s4vK27CKFTY+5mjTtSgJog9qrcVGT2MJfa2iz2fxna9x2D5yBQp2Yg4qs5WCdpuxBz U59iL65olgMUc20O/7Ukb8DbDEVse8kr5yzJs= MIME-Version: 1.0 Received: by 10.224.36.42 with SMTP id r42mr8697497qad.164.1287148763543; Fri, 15 Oct 2010 06:19:23 -0700 (PDT) Received: by 10.229.34.70 with HTTP; Fri, 15 Oct 2010 06:19:23 -0700 (PDT) In-Reply-To: References: Date: Fri, 15 Oct 2010 15:19:23 +0200 Message-ID: Subject: Re: Overriding DefaultScore From: Zaharije Pasalic To: java-user@lucene.apache.org Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable X-Virus-Checked: Checked by ClamAV on apache.org Can anybody explain or point me to couple of links where i can find more info about payloads? Thx On Fri, Oct 15, 2010 at 11:09 AM, Danil =C5=A2ORIN wro= te: > You could encode term score as payload while indexing, and use those > payloads on search time. > > On Fri, Oct 15, 2010 at 11:30, Zaharije Pasalic > wrote: >> Hi >> >> my original problem is to index large number of documents which >> contains 360 integers in rage from 0-90K. Searching it's a little bit >> complicated - I need to find most similar documents where query data >> is also 360 numbers in range 0-90K. But (there is always 'but') i need >> to create score with some predefined weight table. Here is example: >> >> Index contains: >> >> DOC1 : 1, 3, 5 >> DOC2 : 1, 100 >> DOC3 : 1, 5 >> >> I need to find all documents which are 'like' this: >> >> SEARCH: 1,5,100 >> >> And suppose that i'm having table which says: "if value is larger than >> 10 wight hit as 0.5, else as 1" (in real application this is more >> complicated weight table). >> >> So for Query 1,5,100 i will have: >> >> DOC1: SCORE=3D2 =C2=A0 =C2=A0[1,5] >> DOC3: SCORE=3D2 =C2=A0 =C2=A0[1,5] >> DOC2: SCORE=3D1.5 [1,100 (100>10- wight 0.5] >> >> Searching is just: if hits occurs on field, increments score by 1*weight= (value) >> >> My first step was to create index with one field which contains all >> 360 values and to remove normals from it. >> >> Now when i'm doing search like: >> >> "F:1 F:5 F:100" >> >> I'm getting results ok but score is not correct. Of course it gives me >> score sorted by 'number of hits' (am I right?) but score value is not >> calculated by increments of 1 nor i'm using wights at all. >> >> So, my question is - is this even possible with lucene and if can, can >> you point me into some directions (i already looked a little bit at >> DefaultSimilarity overriding). >> >> Thanks >> >> --------------------------------------------------------------------- >> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org >> For additional commands, e-mail: java-user-help@lucene.apache.org >> >> > > --------------------------------------------------------------------- > To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org > For additional commands, e-mail: java-user-help@lucene.apache.org > > --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org