lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Lance Norskog (JIRA)" <j...@apache.org>
Subject [jira] [Created] (SOLR-4007) Morfologik dictionaries not available in Solr field type
Date Mon, 29 Oct 2012 03:53:12 GMT
Lance Norskog created SOLR-4007:
-----------------------------------

             Summary: Morfologik dictionaries not available in Solr field type
                 Key: SOLR-4007
                 URL: https://issues.apache.org/jira/browse/SOLR-4007
             Project: Solr
          Issue Type: Bug
          Components: Schema and Analysis
    Affects Versions: 4.1
            Reporter: Lance Norskog
            Priority: Minor


The Polish Morfologik type does not find its dictionaries when used in Solr. To demonstrate:

1) Add this to example/solr/collection1/conf/schema.xml:
{noformat}
    <!-- Polish -->
    <fieldType name="text_pl" class="solr.TextField" positionIncrementGap="100">
      <analyzer>
        <tokenizer class="solr.StandardTokenizerFactory"/>
        <filter class="solr.MorfologikFilterFactory" dictionary="MORFOLOGIK" />
      </analyzer>
    </fieldType>
{noformat}

2) Add this to example/solr/collection1/conf/solrconfig.xml:

{noformat}
  <lib dir="../../../../lucene/build/analysis/morfologik/" regex=".*\.jar" />
  <lib dir="../../../contrib/analysis-extras/lib" regex=".*\.jar" />
  <lib dir="../../../dist/" regex="apache-solr-analysis-extras-\d.*\.jar" />
{noformat}

3) Test 'text_pl' in the analysis page. You will get an exception.
{noformat}
Oct 28, 2012 8:27:19 PM org.apache.solr.core.SolrCore execute
INFO: [collection1] webapp=/solr path=/analysis/field params={analysis.showmatch=true&analysis.query=&wt=json&analysis.fieldvalue=blah+blah&analysis.fieldtype=text_pl}
status=500 QTime=26 
Oct 28, 2012 8:27:19 PM org.apache.solr.common.SolrException log
SEVERE: null:java.lang.RuntimeException: Default dictionary resource for language 'plnot found.
	at morfologik.stemming.Dictionary.getForLanguage(Dictionary.java:163)
	at morfologik.stemming.PolishStemmer.<init>(PolishStemmer.java:64)
	at org.apache.lucene.analysis.morfologik.MorfologikFilter.<init>(MorfologikFilter.java:70)
	at org.apache.lucene.analysis.morfologik.MorfologikFilterFactory.create(MorfologikFilterFactory.java:63)
	at org.apache.solr.handler.AnalysisRequestHandlerBase.analyzeValue(AnalysisRequestHandlerBase.java:125)
	at org.apache.solr.handler.FieldAnalysisRequestHandler.analyzeValues(FieldAnalysisRequestHandler.java:220)
	at org.apache.solr.handler.FieldAnalysisRequestHandler.handleAnalysisRequest(FieldAnalysisRequestHandler.java:181)
	at org.apache.solr.handler.FieldAnalysisRequestHandler.doAnalysis(FieldAnalysisRequestHandler.java:100)
	at 

[...........]

Caused by: java.io.IOException: Could not locate resource: morfologik/dictionaries/pl.dict
	at morfologik.util.ResourceUtils.openInputStream(ResourceUtils.java:56)
	at morfologik.stemming.Dictionary.getForLanguage(Dictionary.java:156)
	... 38 more

{noformat}

{{morfologik-polish-1.5.3.jar}} has {{morfologik/dictionaries/pl.dict}}.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message