lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Vijay <vijay.stanf...@gmail.com>
Subject Question on Efficient field updates in the Lucene index in Nutch
Date Mon, 01 Jun 2009 22:32:23 GMT
Hi all,

      I have a question regarding field updates to the lucene index in
nutch.

       Suppose I am indexing webpages along with tags as an extra field. I
want to add an extra tag to a webpage. Is there a clean way for me to do
this without having to re-index the page with the updated tags field and
deleting duplicates?

      For example, can I create a new document with the same doc id as the
object made from the relevant URL in the past, with the new tag alone as a
field, with no content section. Could this enable adding the correct doc id
to the new tag postings list, thereby creating the same effect as indexing
the webpage afresh with all the given tags?
     Alternatively is there any other efficient way to do this?


Thanks a ton,
Vijay

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message