Return-Path: Delivered-To: apmail-jakarta-lucene-dev-archive@www.apache.org Received: (qmail 21893 invoked from network); 29 Jul 2004 07:32:36 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur-2.apache.org with SMTP; 29 Jul 2004 07:32:36 -0000 Received: (qmail 27722 invoked by uid 500); 29 Jul 2004 07:32:32 -0000 Delivered-To: apmail-jakarta-lucene-dev-archive@jakarta.apache.org Received: (qmail 27565 invoked by uid 500); 29 Jul 2004 07:32:31 -0000 Mailing-List: contact lucene-dev-help@jakarta.apache.org; run by ezmlm Precedence: bulk List-Unsubscribe: List-Subscribe: List-Help: List-Post: List-Id: "Lucene Developers List" Reply-To: "Lucene Developers List" Delivered-To: mailing list lucene-dev@jakarta.apache.org Received: (qmail 27552 invoked by uid 99); 29 Jul 2004 07:32:31 -0000 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests=PORN_URL_SEX,SPF_PASS X-Spam-Check-By: apache.org Received: from [134.96.191.5] (HELO mousepad.xtramind.dfki.de) (134.96.191.5) by apache.org (qpsmtpd/0.27.1) with ESMTP; Thu, 29 Jul 2004 00:32:27 -0700 Received: from localhost (localhost [127.0.0.1]) by mousepad.xtramind.dfki.de (Postfix) with ESMTP id 4539E7F29 for ; Thu, 29 Jul 2004 09:32:25 +0200 (MEST) Received: from ihawk.win.xtramind.dfki.de (unknown [192.168.4.39]) by mousepad.xtramind.dfki.de (Postfix) with ESMTP id A141C7F10 for ; Thu, 29 Jul 2004 09:32:21 +0200 (MEST) Content-class: urn:content-classes:message MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable Subject: AW: Page ranking and the keyword meta tag X-MimeOLE: Produced By Microsoft Exchange V6.5.7226.0 Date: Thu, 29 Jul 2004 09:32:21 +0200 Message-ID: <90BBF8FAA569414698E019D903E37DA703D8F3@ihawk.win.xtramind.dfki.de> X-MS-Has-Attach: X-MS-TNEF-Correlator: Thread-Topic: Page ranking and the keyword meta tag Thread-Index: AcR01ESqqsEpRnq6QmaLYkDhFw04ZwAAW8aQABoDMRA= From: "Karsten Konrad" To: "Lucene Developers List" X-Virus-Scanned: by AMaViS with Sophos Sweep X-Virus-Checked: Checked X-Spam-Rating: minotaur-2.apache.org 1.6.2 0/1000/N Hi, >> it gives higher ranking to html pages that contain the searched query = inside the keyword meta tag, but Lucene seems to ignore everything html = when searching. Anybody has any idea how to do this? >> put the content of the meta tags into a separate field and search both = over content and meta tag field. Lucene will rank the meta tag content higher = as it will be shorter than the complete text. Works quite well. You will have to parse the HTML to get your meta tags and construct a query that searches both over the content and meta data simultaneously. Regards, Karsten -- Dr.-Ing. Karsten Konrad Head of Artificial Intelligence Lab Xtramind Technologies GmbH=20 Stuhlsatzenhausweg 3=20 D-66123 Saarbr=FCcken Phone +49 (681) 3 02-51 13=20 Fax +49 (681) 3 02-51 09 karsten.konrad@xtramind.com=20 www.xtramind.com >Besuchen Sie uns ! >DMS | Halle 2 Stand 2705 | 07.- 09. September 2004 | Messe Essen | = www.dmsexpo.de > > =20 -----Urspr=FCngliche Nachricht----- Von: Monique Platzer [mailto:mplatzer@evertecinc.com]=20 Gesendet: Mittwoch, 28. Juli 2004 21:05 An: lucene-dev@jakarta.apache.org Betreff: FW: Page ranking and the keyword meta tag Hi! I'm interested in customizing Lucene so that it gives higher ranking = to html pages that contain the searched query inside the keyword meta = tag, but Lucene seems to ignore everything html when searching. Anybody = has any idea how to do this? Thanks! Monique Platzer mplatzer@evertecinc.com --------------------------------------------------------------------- To unsubscribe, e-mail: lucene-dev-unsubscribe@jakarta.apache.org For additional commands, e-mail: lucene-dev-help@jakarta.apache.org