lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Király Péter (JIRA) <j...@apache.org>
Subject [jira] [Commented] (LUCENE-2347) Dump WordNet to SOLR Synonym format
Date Thu, 02 Jun 2011 21:33:47 GMT

    [ https://issues.apache.org/jira/browse/LUCENE-2347?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13043062#comment-13043062
] 

Király Péter commented on LUCENE-2347:
--------------------------------------

It seems, that the current version of this class does not make use any Lucene or Solr class,
so you can delete/comment out all org.apache.* imports, and thus no need for adding the jar
to the classpath.

I found a problem, that both notification messages like "Opening Prolog file" goes into the
same output as the main content, the synonyms, which is not an optimal solution.

The second parameter, the index directory is not used at all, just checked, which is not necessary.

Anyway: thanks for publishing this code.

> Dump WordNet to SOLR Synonym format
> -----------------------------------
>
>                 Key: LUCENE-2347
>                 URL: https://issues.apache.org/jira/browse/LUCENE-2347
>             Project: Lucene - Java
>          Issue Type: New Feature
>          Components: modules/analysis
>    Affects Versions: 3.0.1
>            Reporter: Bill Bell
>         Attachments: Syns2Solr.java
>
>
> This enhancement allows you to dump v2 of WordNet to SOLR synonym format! Get all your
syns loaded easily.
> 1. You can load all synonyms from http://wordnetcode.princeton.edu/2.0/ WordNet V2 to
SOLR by first using the Sys2Index program
> http://lucene.apache.org/java/2_2_0/api/org/apache/lucene/wordnet/Syns2Index.html
> Get WNprolog from http://wordnetcode.princeton.edu/2.0/
> 2. We modified this program to work with SOLR (See attached) on amidev.kaango.com in
/vol/src/lucene/contrib/wordnet
> vi /vol/src/lucene/contrib/wordnet/src/java/org/apache/lucene/wordnet/Syns2Solr.java
> 3. Run ant
> 4. java -classpath /vol/src/lucene/build/contrib/wordnet/lucene-wordnet-3.1-dev.jar org.apache.lucene.wordnet.Syns2Solr
prolog/wn_s.pl solr > index_synonyms.txt

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message