lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Dawid Weiss (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (SOLR-4007) Morfologik dictionaries not available in Solr field type
Date Wed, 31 Oct 2012 09:19:12 GMT

    [ https://issues.apache.org/jira/browse/SOLR-4007?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13487630#comment-13487630
] 

Dawid Weiss commented on SOLR-4007:
-----------------------------------

Solr doesn't set context class loader and this is used by Morfologik internally to look up
classes. If you look at my commit you'll see that the fix is to temporarily set context class
loader to the one that loaded PolishStemmer (where the dictionaries reside).  I don't know
how it applies to your patch/code.
                
> Morfologik dictionaries not available in Solr field type
> --------------------------------------------------------
>
>                 Key: SOLR-4007
>                 URL: https://issues.apache.org/jira/browse/SOLR-4007
>             Project: Solr
>          Issue Type: Bug
>          Components: Schema and Analysis
>    Affects Versions: 4.1
>            Reporter: Lance Norskog
>            Assignee: Dawid Weiss
>            Priority: Minor
>             Fix For: 4.1
>
>
> The Polish Morfologik type does not find its dictionaries when used in Solr. To demonstrate:
> 1) Add this to example/solr/collection1/conf/schema.xml:
> {noformat}
>     <!-- Polish -->
>     <fieldType name="text_pl" class="solr.TextField" positionIncrementGap="100">
>       <analyzer>
>         <tokenizer class="solr.StandardTokenizerFactory"/>
>         <filter class="solr.MorfologikFilterFactory" dictionary="MORFOLOGIK" />
>       </analyzer>
>     </fieldType>
> {noformat}
> 2) Add this to example/solr/collection1/conf/solrconfig.xml:
> {noformat}
>   <lib dir="../../../../lucene/build/analysis/morfologik/" regex=".*\.jar" />
>   <lib dir="../../../contrib/analysis-extras/lib" regex=".*\.jar" />
>   <lib dir="../../../dist/" regex="apache-solr-analysis-extras-\d.*\.jar" />
> {noformat}
> 3) Test 'text_pl' in the analysis page. You will get an exception.
> {noformat}
> Oct 28, 2012 8:27:19 PM org.apache.solr.core.SolrCore execute
> INFO: [collection1] webapp=/solr path=/analysis/field params={analysis.showmatch=true&analysis.query=&wt=json&analysis.fieldvalue=blah+blah&analysis.fieldtype=text_pl}
status=500 QTime=26 
> Oct 28, 2012 8:27:19 PM org.apache.solr.common.SolrException log
> SEVERE: null:java.lang.RuntimeException: Default dictionary resource for language 'plnot
found.
> 	at morfologik.stemming.Dictionary.getForLanguage(Dictionary.java:163)
> 	at morfologik.stemming.PolishStemmer.<init>(PolishStemmer.java:64)
> 	at org.apache.lucene.analysis.morfologik.MorfologikFilter.<init>(MorfologikFilter.java:70)
> 	at org.apache.lucene.analysis.morfologik.MorfologikFilterFactory.create(MorfologikFilterFactory.java:63)
> 	at org.apache.solr.handler.AnalysisRequestHandlerBase.analyzeValue(AnalysisRequestHandlerBase.java:125)
> 	at org.apache.solr.handler.FieldAnalysisRequestHandler.analyzeValues(FieldAnalysisRequestHandler.java:220)
> 	at org.apache.solr.handler.FieldAnalysisRequestHandler.handleAnalysisRequest(FieldAnalysisRequestHandler.java:181)
> 	at org.apache.solr.handler.FieldAnalysisRequestHandler.doAnalysis(FieldAnalysisRequestHandler.java:100)
> 	at 
> [...........]
> Caused by: java.io.IOException: Could not locate resource: morfologik/dictionaries/pl.dict
> 	at morfologik.util.ResourceUtils.openInputStream(ResourceUtils.java:56)
> 	at morfologik.stemming.Dictionary.getForLanguage(Dictionary.java:156)
> 	... 38 more
> {noformat}
> {{morfologik-polish-1.5.3.jar}} has {{morfologik/dictionaries/pl.dict}}.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message