lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From aslam bari <>
Subject Re: indexing rss feeds in multiple languages
Date Thu, 22 Mar 2007 06:56:48 GMT
My last message has come here by mistake. It was for someone else, It is just a silly mistake.

sorry People.

----- Original Message ----
From: aslam bari <>
Sent: Thursday, 22 March, 2007 12:12:57 PM
Subject: Re: indexing rss feeds in multiple languages

Have a  look to my resume attached with the mail. if it suits you, let me know.

----- Original Message ----
From: Melanie Langlois <>
Sent: Thursday, 22 March, 2007 11:33:03 AM
Subject: indexing rss feeds in multiple languages


I saw that there are many post on the mailing list about indexing in multiple language, so
I will try to not post duplicate question. In my case, I want to index rss feeds, so one feed
contains several items in different languages, and some common data for all the items (date,
source..).  After reading the different posts, I think I will create a document per item,
index them in the same index using each time a language specific analyzer, and store lang
field for specific search. But I'm wondering how I should handle the common fields, it seems
I have two options:

1 : store the common data in each item. What happen if duplicate information are entered,
are they duplicate in the index ?

2 : create a separate document for the common data. In this case I will need to link these
data to all underlying items storing some ids. The issue is that I would need to search the
index twice if the search is done only per date, because I would need to retrieve the items

Thank in advance for your help.


Here’s a new way to find what you're looking for - Yahoo! Answers 
To unsubscribe, e-mail:
For additional commands, e-mail:

Yahoo! India Answers: Share what you know. Learn something new
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message