lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Bill Bell (JIRA)" <j...@apache.org>
Subject [jira] Created: (LUCENE-2347) Dump WordNet to SOLR Synonym format
Date Thu, 25 Mar 2010 22:56:27 GMT
Dump WordNet to SOLR Synonym format
-----------------------------------

                 Key: LUCENE-2347
                 URL: https://issues.apache.org/jira/browse/LUCENE-2347
             Project: Lucene - Java
          Issue Type: New Feature
          Components: contrib/*
    Affects Versions: 3.0.1
            Reporter: Bill Bell


This enhancement allows you to dump v2 of WordNet to SOLR synonym format! Get all your syns
loaded easily.

1. You can load all synonyms from http://wordnetcode.princeton.edu/2.0/ WordNet V2 to SOLR
by first using the Sys2Index program
http://lucene.apache.org/java/2_2_0/api/org/apache/lucene/wordnet/Syns2Index.html

Get WNprolog from http://wordnetcode.princeton.edu/2.0/

2. We modified this program to work with SOLR (See attached) on amidev.kaango.com in /vol/src/lucene/contrib/wordnet
vi /vol/src/lucene/contrib/wordnet/src/java/org/apache/lucene/wordnet/Syns2Solr.java

3. Run ant

4. java -classpath /vol/src/lucene/build/contrib/wordnet/lucene-wordnet-3.1-dev.jar org.apache.lucene.wordnet.Syns2Solr
prolog/wn_s.pl solr > index_synonyms.txt

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org


Mime
View raw message