lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From VIGNESH S <vigneshkln...@gmail.com>
Subject Re: Synonym Search in Lucene..
Date Wed, 09 Oct 2013 11:46:13 GMT
Hi Koji,

Thanks for your reply and guidance.

I have read the Below Article and it is really helpful in getting the
relevant synonyms.

But How are you getting the synonym from Wikipedia..do wikipedia expose any
API or is there any readymade dictionary file wikipedia is giving for all
languages.

Please kindly help.




On Mon, Oct 7, 2013 at 8:06 PM, Koji Sekiguchi <koji@r.email.ne.jp> wrote:

> (13/10/07 18:33), VIGNESH S wrote:
>
>> Hi,
>>
>> How to implement  synonym Search for All languages..
>>
>> As far as i know,Wordnet has only English Support..Is there any other we
>> can use to get support for all languages.
>>
>
> I think most people make synonym data manually...
> I've never explored Wordnet, but I think it is too general to adopt for
> your
> business field?
>
> I've developed a program that extracts synonym knowledge from Wikipedia
> (see my signature below). The outcome is useful for general purpose.
> But I think, instead of using universal set of Wikipedia but using
> subset of it, the program could extract more useful synonym knowledge for
> a specific business field.
>
> To do so, to extract a subset of Wikipedia, the existing Lucene index
> (that includes interesing words of the specific field) can be used.
>
> koji
> --
> http://soleami.com/blog/**automatically-acquiring-**
> synonym-knowledge-from-**wikipedia.html<http://soleami.com/blog/automatically-acquiring-synonym-knowledge-from-wikipedia.html>
>
> ------------------------------**------------------------------**---------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.**apache.org<java-user-unsubscribe@lucene.apache.org>
> For additional commands, e-mail: java-user-help@lucene.apache.**org<java-user-help@lucene.apache.org>
>
>


-- 
Thanks and Regards
Vignesh Srinivasan
9739135640

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message