lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Steven A Rowe <>
Subject RE: Query and language conversion
Date Tue, 01 Sep 2009 16:46:27 GMT
Hi Alex,

What you want to do is commonly referred to as "Cross Language Information Retrieval".  Doug
Oard at the University of Maryland has a page of CLIR resources here:

Grant Ingersoll responded to a similar question a couple of years ago on this list:


Here's another recent thread with lots of good info, from the solr-user mailing list, on the
same topic:


Here's a paper written by a group that put together a Greek-English cross-language retrieval
system using Lucene:

And here's another paper written by a group that made a Hindi and Telugu to English cross-language
retrieval system using Lucene, from the CLEF 2006 conference proceedings:


> -----Original Message-----
> From: Alex []
> Sent: Tuesday, September 01, 2009 10:30 AM
> To:
> Subject: Query and language conversion
> Hi,
> I am new to Lucene so excuse me if this is a trivial question ..
> I have data that I Index in a given language (English). My users will
> come from different countries and my search screen will be
> internationalized. My users will then probably query thing in their
> own language. Is it possible too lookup for Items that were indexed
> in a different language.
> To make thing a bit more clear.
> My "Business" object has a "type" attribute. In lucene the "type" field
> is created. The Business object for  "Doctor Smuck" will be indexed with
> the "type" field as  "medical doctor" or anything similar. My German
> users will query using german languange. He tries to find a Doctor
> using "Arzt" or maybe "Mediziner" as a query. Is Lucene able to match
> the query to the value that was indexed in another language ?
> Is there an analyser for that ?
> By the way : I can provide the probable input language, based on the
> client's search page language,  as a parameter if that helps (it
> probably will) .
> Many thanks for your thoughts !

To unsubscribe, e-mail:
For additional commands, e-mail:

View raw message