Return-Path: X-Original-To: apmail-jackrabbit-users-archive@minotaur.apache.org Delivered-To: apmail-jackrabbit-users-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 8D60094DA for ; Tue, 6 Dec 2011 13:13:03 +0000 (UTC) Received: (qmail 87532 invoked by uid 500); 6 Dec 2011 13:13:02 -0000 Delivered-To: apmail-jackrabbit-users-archive@jackrabbit.apache.org Received: (qmail 87501 invoked by uid 500); 6 Dec 2011 13:13:02 -0000 Mailing-List: contact users-help@jackrabbit.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: users@jackrabbit.apache.org Delivered-To: mailing list users@jackrabbit.apache.org Received: (qmail 87483 invoked by uid 99); 6 Dec 2011 13:13:02 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 06 Dec 2011 13:13:02 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=5.0 tests=SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: local policy) Received: from [188.40.37.16] (HELO hq1.backendmedia.com) (188.40.37.16) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 06 Dec 2011 13:12:56 +0000 Received: from localhost (unknown [127.0.0.1]) by hq1.backendmedia.com (Postfix) with ESMTP id 102822E30001; Tue, 6 Dec 2011 13:12:37 +0000 (UTC) X-Virus-Scanned: amavisd-new at backendmedia.com Received: from hq1.backendmedia.com ([127.0.0.1]) by localhost (hq1.backendmedia.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id nmUSZRwdAeCg; Tue, 6 Dec 2011 14:12:36 +0100 (CET) Received: from [192.168.1.48] (17-103.79-83.cust.bluewin.ch [83.79.103.17]) (using TLSv1 with cipher AES128-SHA (128/128 bits)) (No client certificate requested) (Authenticated sender: mls@pooteeweet.org) by hq1.backendmedia.com (Postfix) with ESMTPSA id 65CAD200C18A; Tue, 6 Dec 2011 14:12:36 +0100 (CET) Subject: Re: Boosting newer documents Mime-Version: 1.0 (Apple Message framework v1251.1) Content-Type: text/plain; charset=iso-8859-1 From: Lukas Kahwe Smith In-Reply-To: Date: Tue, 6 Dec 2011 14:12:04 +0100 Cc: dev@jackrabbit.apache.org Content-Transfer-Encoding: quoted-printable Message-Id: References: <4EDDDFF3.8090601@liip.ch> To: users@jackrabbit.apache.org X-Mailer: Apple Mail (2.1251.1) X-Virus-Checked: Checked by ClamAV on apache.org On Dec 6, 2011, at 13:58 , Jukka Zitting wrote: > Hi, >=20 >=20 > On Tue, Dec 6, 2011 at 10:27 AM, Christian Stocker > wrote: >> Before we dig more into this: Would this be the correct way? Is this >> even possible in Jackrabbit without having to change too much? Or is >> there an easier way to give "newer" Documents more weight than >> older once? >=20 > I guess you could achieve the same effect by having a custom indexing > configuration file [1] that you modify once per day or per week to > increase the boost for specific node types. >=20 > However, it seems counterintuitive to have to keep increasing the > boost either with configuration changes or with a boost function like > the one you proposed. yes .. this needs to be a boost that is determined at query time and not = at index time > As already suggested by Alex, I'd rather use sorting for this. To > allow the full text match score to affect the sort order you could use > just the year, month or week number as the first sort term and let the > matches within that time period be sorted according to the match > score. well sorting by date isnt a good solution at all if you actually also = care about the score. this is basically reducing lucene to an RDBMS LIKE = engine. > Solr has a more complex mechanism for such date-based scoring (see > [2]), but making something like that work with Jackrabbit probably > needs quite a bit of work on the search index layer. >=20 > [1] http://wiki.apache.org/jackrabbit/IndexingConfiguration > [2] = http://wiki.apache.org/solr/SolrRelevancyFAQ#How_can_I_boost_the_score_of_= newer_documents yes .. maybe its not feasible "just" for this. but imho this is = currently the biggest gap with Jackrabbit that it doesnt expose more of = Lucene (for example facetting etc). it would be kind of a pitty to have = to duplicate the entire data inside another Solr/ElasticSearch instance = that would be then totally unaware of ACL's and path relations regards, Lukas Kahwe Smith mls@pooteeweet.org