Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 16380 invoked from network); 23 Apr 2009 20:31:13 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 23 Apr 2009 20:31:12 -0000 Received: (qmail 79844 invoked by uid 500); 23 Apr 2009 20:31:08 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 79745 invoked by uid 500); 23 Apr 2009 20:31:08 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 79725 invoked by uid 99); 23 Apr 2009 20:31:08 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 23 Apr 2009 20:31:08 +0000 X-ASF-Spam-Status: No, hits=3.4 required=10.0 tests=FUZZY_CPILL,HTML_MESSAGE,SPF_NEUTRAL X-Spam-Check-By: apache.org Received-SPF: neutral (nike.apache.org: local policy) Received: from [209.85.218.179] (HELO mail-bw0-f179.google.com) (209.85.218.179) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 23 Apr 2009 20:30:57 +0000 Received: by bwz27 with SMTP id 27so824984bwz.5 for ; Thu, 23 Apr 2009 13:30:36 -0700 (PDT) MIME-Version: 1.0 Received: by 10.223.103.133 with SMTP id k5mr499949fao.23.1240518636544; Thu, 23 Apr 2009 13:30:36 -0700 (PDT) In-Reply-To: <7e536b1f0904231301u3e251b6ayb266086f09cff4ad@mail.gmail.com> References: <7e536b1f0904231301u3e251b6ayb266086f09cff4ad@mail.gmail.com> Date: Thu, 23 Apr 2009 22:30:36 +0200 Message-ID: <7e536b1f0904231330k38bd7269h25634ee405219058@mail.gmail.com> Subject: Re: Change boost of documents / single fields / external scoring ? From: Marcus Herou To: java-user@lucene.apache.org, solr-user@lucene.apache.org Content-Type: multipart/alternative; boundary=001636c5a758ab27ab04683ec08f X-Virus-Checked: Checked by ClamAV on apache.org --001636c5a758ab27ab04683ec08f Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Could an ExternalFileField help me ? http://lucene.apache.org/solr/api/org/apache/solr/schema/ExternalFileField.html On Thu, Apr 23, 2009 at 10:01 PM, Marcus Herou wrote: > Hi. > > Confusing subject eh ? Trying to become a little clearer in a few > sentences. > > We have a Solr/Lucene index where each document is a Blog Entry. We have > just implemented the PageRank algorithm for Blogs and are about to add a > column to the index called score and perhaps adjust the document boost. > > We have as well decided that it is the blog itself and not the individual > pages that are to be ranked so all entries belonging to one blog will > receive the same score. > > I have not found a way to apply a document score without actually > re-indexing all fields in the affected entries (could very well be 100% at > every PageRank recalculation) and this will of course take hell of a long > time to reindex which effectively will render the process useless since it > would take a week or of reindexing as of current and will take more and more > time. (100M blog entries as of current and rapidly increasing). > > Guess we have run into the issue where we have some "static" data which we > do not want to touch at all but we want to update certain "dynamic" fields. > > Lucene is not a database I know but is there a way to implement external > search-time scoring or update individual fields ? Would there be a > possibilty to do some kind of join (parallell searches separate index types) > ? or send the result to a separate sorting algorithm ? Hmmm.... Perhaps a > subclass of Sort ? Grasping at straws here folks... > > Hope anyone of the core experts can help us. > > Cheers > > //Marcus Herou > > > > -- > Marcus Herou CTO and co-founder Tailsweep AB > +46702561312 > marcus.herou@tailsweep.com > http://www.tailsweep.com/ > http://blogg.tailsweep.com/ > -- Marcus Herou CTO and co-founder Tailsweep AB +46702561312 marcus.herou@tailsweep.com http://www.tailsweep.com/ http://blogg.tailsweep.com/ --001636c5a758ab27ab04683ec08f--