lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Donna L Gresh <gr...@us.ibm.com>
Subject Re: Adding attribute to index
Date Wed, 02 Apr 2008 18:52:23 GMT
This is "fast and loose" code (from my head; check the syntax). I *highly* 
recommend you get a copy of the book Lucene in Action; it will really 
help.

To create the index, add a document with two fields; one for the sender 
and one for the email text.

IndexWriter indexWriter = new IndexWriter(...... 

Document emailDoc = new Document();
Field senderField = new Field("sender", senderEmailAddress, 
Field.Store.YES, Field.Index.UN_TOKENIZED);
emailDoc.add(senderField);
Field textField = new Field("emailText", textOfEmail, Field.Store.YES, 
Field.Index.TOKENIZED);
emailDoc.add(textField);
indexWriter.addDocument(emailDoc);


Then when you are searching, search in the email text field:

Query query = new TermQuery(new Term("emailText","searchTerm"));
Hits hits = searcher.search(query);
Document doc = hits.doc(0); //best fit document
String emailSender = doc.get("sender");
String emailText = doc.get("emailText");


Donna L. Gresh
Services Research, Mathematical Sciences Department
IBM T.J. Watson Research Center
(914) 945-2472
http://www.research.ibm.com/people/g/donnagresh
gresh@us.ibm.com


"Nitasha Walia (niwalia)" <niwalia@cisco.com> wrote on 04/02/2008 02:26:45 
PM:

> Hi, 
> 
> I am a new user of Java Lucene and need to learn how to add a new 
> attribute, such that, given a database of emails, containing sender 
> information, searching for a keyword, results in 
> 1. The sender of the email
> 2. The email. 
> 
> I am using Lucene-2.3.1, and don't know where to start in the huge code 
base. 
> 
> Can someone please advise on the same?
> 
> Thanks,
> 
> [image removed] 
> 
> Nitasha Walia
> Software Engineer
> Product Development
> 
> niwalia@cisco.com
> Mobile: 412-736 4507
> 
> 
> 
> United States
> Cisco home page

> 
> 
> 
> [image removed] 
> 
> [image removed] Think before you print.
> 
> This e-mail may contain confidential and privileged material for the
> sole use of the intended recipient. Any review, use, distribution or
> disclosure by others is strictly prohibited. If you are not the 
> intended recipient (or authorized to receive for the recipient), 
> please contact the sender by reply e-mail and delete all copies of 
> this message.
> 
> [image removed] 
> 
> 
Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message