lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Veena Channagouda (JIRA)" <j...@apache.org>
Subject [jira] Commented: (LUCENE-2347) Dump WordNet to SOLR Synonym format
Date Fri, 22 Oct 2010 10:56:15 GMT

    [ https://issues.apache.org/jira/browse/LUCENE-2347?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12923817#action_12923817
] 

Veena Channagouda commented on LUCENE-2347:
-------------------------------------------

These were the steps I followed to run the attached file Syns2Solr.java with command promt.
Assuming Java bin is set in classpath.

Downloaded Syns2Solr.java to C:\downloads. Modified the file to remove the package declaration
at the top .
Downloaded lucene3.0.2 jar from http://archive.apache.org/dist/lucene/java/ 
Downloaded WNprolog from http://wordnetcode.princeton.edu/2.0/ 

C:\downloads>javac -classpath lucene-core-3.0.2.jar Syns2Solr.java

C:\java Syns2Solr  wn_s.pl  c:\downloads\index > index.txt

U will have index.txt created with all the synonyms in it.
(wn_s.pl is extracted from the downloaded WNprolog. )





> Dump WordNet to SOLR Synonym format
> -----------------------------------
>
>                 Key: LUCENE-2347
>                 URL: https://issues.apache.org/jira/browse/LUCENE-2347
>             Project: Lucene - Java
>          Issue Type: New Feature
>          Components: contrib/analyzers
>    Affects Versions: 3.0.1
>            Reporter: Bill Bell
>         Attachments: Syns2Solr.java
>
>
> This enhancement allows you to dump v2 of WordNet to SOLR synonym format! Get all your
syns loaded easily.
> 1. You can load all synonyms from http://wordnetcode.princeton.edu/2.0/ WordNet V2 to
SOLR by first using the Sys2Index program
> http://lucene.apache.org/java/2_2_0/api/org/apache/lucene/wordnet/Syns2Index.html
> Get WNprolog from http://wordnetcode.princeton.edu/2.0/
> 2. We modified this program to work with SOLR (See attached) on amidev.kaango.com in
/vol/src/lucene/contrib/wordnet
> vi /vol/src/lucene/contrib/wordnet/src/java/org/apache/lucene/wordnet/Syns2Solr.java
> 3. Run ant
> 4. java -classpath /vol/src/lucene/build/contrib/wordnet/lucene-wordnet-3.1-dev.jar org.apache.lucene.wordnet.Syns2Solr
prolog/wn_s.pl solr > index_synonyms.txt

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message