Return-Path: Delivered-To: apmail-lucene-java-user-archive@www.apache.org Received: (qmail 76311 invoked from network); 22 Jan 2007 19:34:02 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 22 Jan 2007 19:34:02 -0000 Received: (qmail 69373 invoked by uid 500); 22 Jan 2007 19:34:01 -0000 Delivered-To: apmail-lucene-java-user-archive@lucene.apache.org Received: (qmail 69341 invoked by uid 500); 22 Jan 2007 19:34:01 -0000 Mailing-List: contact java-user-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: java-user@lucene.apache.org Delivered-To: mailing list java-user@lucene.apache.org Received: (qmail 69329 invoked by uid 99); 22 Jan 2007 19:34:01 -0000 Received: from herse.apache.org (HELO herse.apache.org) (140.211.11.133) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 22 Jan 2007 11:34:01 -0800 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests= X-Spam-Check-By: apache.org Received-SPF: neutral (herse.apache.org: local policy) Received: from [82.216.111.37] (HELO smtp1.tech.numericable.fr) (82.216.111.37) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 22 Jan 2007 11:33:52 -0800 Received: from [192.168.1.101] (ip-3.net-82-216-252.rev.numericable.fr [82.216.252.3]) by smtp1.tech.numericable.fr (Postfix) with ESMTP id 49780E081D for ; Mon, 22 Jan 2007 20:33:31 +0100 (CET) From: Nicolas =?utf-8?q?Lalev=C3=A9e?= To: java-user@lucene.apache.org Subject: Re: Lucene Internals question Date: Mon, 22 Jan 2007 20:33:29 +0100 User-Agent: KMail/1.9.5 References: In-Reply-To: Organization: Anyware Technologies MIME-Version: 1.0 Content-Type: text/plain; charset="utf-8" Content-Transfer-Encoding: quoted-printable Content-Disposition: inline Message-Id: <200701222033.29717.nicolas.lalevee@anyware-tech.com> X-Virus-Checked: Checked by ClamAV on apache.org Le Lundi 22 Janvier 2007 19:33, EDMOND KEMOKAI a =C3=A9crit=C2=A0: > Hi All > This is a question for those familiar with lucene document scoring. How > does it compare with googles PageRank or HITS, or are they very different? > I have being looking at the PageRank algorithm but I'll need to brush-off > my math skills before delving into it:) In fact Lucene is just a search engine. Then you can use the search engine = to=20 search in web pages, like Nutch is using Lucene. And Google is more like=20 Nutch : a web crawler plus a web-search engine. So when you are taking abou= t=20 page raking, it has nothing to do with Lucene scoring. Lucene scoring is ho= w=20 about the result entry match your query. Page raking is more about how=20 relevant is the web page. So for a document, the Lucene scoring depends on= =20 the query, and the page raking is quite absolute. Nicolas --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org For additional commands, e-mail: java-user-help@lucene.apache.org