Return-Path: Delivered-To: apmail-jakarta-lucene-user-archive@www.apache.org Received: (qmail 79697 invoked from network); 14 Oct 2004 03:48:49 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (209.237.227.199) by minotaur-2.apache.org with SMTP; 14 Oct 2004 03:48:49 -0000 Received: (qmail 11313 invoked by uid 500); 14 Oct 2004 03:48:42 -0000 Delivered-To: apmail-jakarta-lucene-user-archive@jakarta.apache.org Received: (qmail 11233 invoked by uid 500); 14 Oct 2004 03:48:41 -0000 Mailing-List: contact lucene-user-help@jakarta.apache.org; run by ezmlm Precedence: bulk List-Unsubscribe: List-Subscribe: List-Help: List-Post: List-Id: "Lucene Users List" Reply-To: "Lucene Users List" Delivered-To: mailing list lucene-user@jakarta.apache.org Received: (qmail 11217 invoked by uid 99); 14 Oct 2004 03:48:41 -0000 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests=FORGED_RCVD_HELO,SPF_HELO_PASS X-Spam-Check-By: apache.org Received-SPF: neutral (hermes.apache.org: local policy) Received: from [205.152.59.70] (HELO imf22aec.mail.bellsouth.net) (205.152.59.70) by apache.org (qpsmtpd/0.28) with ESMTP; Wed, 13 Oct 2004 20:48:40 -0700 Received: from w2ks.synernet.com ([65.12.150.225]) by imf22aec.mail.bellsouth.net (InterMail vM.5.01.06.11 201-253-122-130-111-20040605) with ESMTP id <20041014034835.PMKT2420.imf22aec.mail.bellsouth.net@w2ks.synernet.com> for ; Wed, 13 Oct 2004 23:48:35 -0400 Message-Id: <6.0.1.1.2.20041013234526.03cbca50@fast.synernet.com> X-Sender: ftoth@fast.synernet.com X-Mailer: QUALCOMM Windows Eudora Version 6.0.1.1 Date: Wed, 13 Oct 2004 23:48:16 -0400 To: "Lucene Users List" From: Fred Toth Subject: Re: Index + Searching In-Reply-To: <416D7720.6060608@sun.com> References: <416D7720.6060608@sun.com> Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii"; format=flowed X-Virus-Checked: Checked X-Spam-Rating: minotaur-2.apache.org 1.6.2 0/1000/N Hi, Could be your best bet is to use HTML tags. Create tags for name, description, etc. (title is already parsed). The HTML parser that ships with Lucene will parse these tags into java Properties. You will need to hack a bit, but you can easily pick these up and add them as specific fields to your index. Fred At 02:42 PM 10/13/2004, you wrote: >Hello, > >I am using the IndexHTML class to index around 30,000 files and it is >working fine. Question that I have is, is there a way to add multiple >fields to index so that when the actual search is performed I can extract >the exact match. >E.g. >the fields can be >1) title - abc >2) name - foo inc, >3) description - Lorem ipsum dolor sit >4) URL - www.lorem.ipsum > >and so on, > > From search when the match for title 'abc' is found then searching for > doc.get("name") can return foo inc and so on. > >Is this already happening in any other indexing class if not what do I >need to add to IndexHTML class to accomplish this? > >thanks for all the help gang. >-H > > >--------------------------------------------------------------------- >To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org >For additional commands, e-mail: lucene-user-help@jakarta.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org For additional commands, e-mail: lucene-user-help@jakarta.apache.org