Return-Path: X-Original-To: apmail-lucene-java-user-archive@www.apache.org Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 2783CDCE7 for ; Sun, 4 Nov 2012 00:54:12 +0000 (UTC) Received: (qmail 51151 invoked by uid 500); 4 Nov 2012 00:54:10 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 51111 invoked by uid 500); 4 Nov 2012 00:54:10 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 51102 invoked by uid 99); 4 Nov 2012 00:54:10 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 04 Nov 2012 00:54:10 +0000 X-ASF-Spam-Status: No, hits=-0.7 required=5.0 tests=RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of rcmuir@gmail.com designates 209.85.216.176 as permitted sender) Received: from [209.85.216.176] (HELO mail-qc0-f176.google.com) (209.85.216.176) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 04 Nov 2012 00:54:03 +0000 Received: by mail-qc0-f176.google.com with SMTP id n41so945684qco.35 for ; Sat, 03 Nov 2012 17:53:42 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type; bh=CoNAPHPeJhpPhPiEc/xk2u4AjpcX10eo5mZkhgtz3b4=; b=CdqjUQtMmg8ZClFXNDcd2OHJyOQVTs1WXjWPh1IhKMADuBkrYvm+IO+UAX4MyGl6JW 0lPeGiGZhVUJu0uNaPFScqLAkMo8wjqsgCbKVoCGfG17Fyv2OD21O45eM43ws9qug2be gNoTe+HfOvBUJr3wbXzZgaJYx7yTqn27G8dZZKt4I6wIUp2jFcgZMY/v3YHowtZYDJQ2 X9sBC/U41zq538HPenScHtWdre6zo2UXCTQIwDniGC9stsiFq5NuaP4MFw1Ph3goS5iU 8lseQ5kCcQKhdSNN/R3Ba9PrJ5kaUBYrQfbLqpK+cEI3L52CGUpqf7orT7E7FjS0a/F2 T5gQ== Received: by 10.224.70.140 with SMTP id d12mr8607070qaj.53.1351990422813; Sat, 03 Nov 2012 17:53:42 -0700 (PDT) MIME-Version: 1.0 Received: by 10.49.105.168 with HTTP; Sat, 3 Nov 2012 17:53:22 -0700 (PDT) In-Reply-To: References: From: Robert Muir Date: Sat, 3 Nov 2012 20:53:22 -0400 Message-ID: Subject: Re: Using new similarities in Lucene 4.0 To: java-user@lucene.apache.org Content-Type: text/plain; charset=UTF-8 X-Virus-Checked: Checked by ClamAV on apache.org On Tue, Oct 30, 2012 at 10:20 AM, parnab kumar wrote: > Hi all, > > Lucene 4 has introduced several state of the art ranking functions. I > was wondering how could i make use of those similarities . IndexSearcher.setSimilarity(new XYZSimilarity()); > These models > obviously uses some more term and collection statistics as opposed to > the traditional Vector Space Model . I guess these statistics > are precomputed during indexing .Should there be any changes to the > indexing mechanism ,like specifying the indexing options to make use of > these models . >From the CHANGES.txt: "All models default to the same index-time norm encoding as DefaultSimilarity, so you can easily try these out/switch back and forth/run experiments and comparisons without reindexing. Note: most of the models do rely upon index statistics that are new in Lucene 4.0, so for existing 3.x indexes its a good idea to upgrade your index to the new format with IndexUpgrader first. " --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org