lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From aslam bari <iamasla...@yahoo.co.in>
Subject Re: indexing rss feeds in multiple languages
Date Thu, 22 Mar 2007 06:56:48 GMT
OOPs!!!
Sorry, 
My last message has come here by mistake. It was for someone else, It is just a silly mistake.

sorry People.


----- Original Message ----
From: aslam bari <iamaslamok@yahoo.co.in>
To: java-user@lucene.apache.org
Sent: Thursday, 22 March, 2007 12:12:57 PM
Subject: Re: indexing rss feeds in multiple languages


Hi,
Have a  look to my resume attached with the mail. if it suits you, let me know.
Thanks...


----- Original Message ----
From: Melanie Langlois <Melanie.Langlois@tradingscreen.com>
To: java-user@lucene.apache.org
Sent: Thursday, 22 March, 2007 11:33:03 AM
Subject: indexing rss feeds in multiple languages


Hi,



I saw that there are many post on the mailing list about indexing in multiple language, so
I will try to not post duplicate question. In my case, I want to index rss feeds, so one feed
contains several items in different languages, and some common data for all the items (date,
source..).  After reading the different posts, I think I will create a document per item,
index them in the same index using each time a language specific analyzer, and store lang
field for specific search. But I'm wondering how I should handle the common fields, it seems
I have two options:

1 : store the common data in each item. What happen if duplicate information are entered,
are they duplicate in the index ?



2 : create a separate document for the common data. In this case I will need to link these
data to all underlying items storing some ids. The issue is that I would need to search the
index twice if the search is done only per date, because I would need to retrieve the items
contents. 



Thank in advance for your help.



Mélanie





Here’s a new way to find what you're looking for - Yahoo! Answers 
---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


		
__________________________________________________________
Yahoo! India Answers: Share what you know. Learn something new
http://in.answers.yahoo.com/
Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message