lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Emir Arnautović <emir.arnauto...@sematext.com>
Subject Re: pf2
Date Fri, 26 Jan 2018 13:14:06 GMT
Hi Rick,
It does not work in any case or it does not work for some cases - e.g. something like l’avion?
Maybe you can try use sow=false and see if it will help.

Cheers,
Emir
--
Monitoring - Log Management - Alerting - Anomaly Detection
Solr & Elasticsearch Consulting Support Training - http://sematext.com/



> On 26 Jan 2018, at 13:38, Rick Leir <rleir@leirtech.com> wrote:
> 
> Emir
> Thanks, I will do when I get off this bus.
> 
> I have run the text thru the SolrAdmin Analyzer, it looks fine.
> 
> According to the debugQuery output, individual words match in the qf, but not the pair
that pf2 should match.
> 
> I compare the configs for English and French, and they are the same apart from the analysis
chain which is below. Only French fails. I will take out filters one by one and attempt to
find which is causing this.
> Cheers -- Rick
> 
> On January 26, 2018 4:09:51 AM EST, "Emir Arnautović" <emir.arnautovic@sematext.com>
wrote:
>> Hi Rick,
>> Can you include sample of your query and text that should match.
>> 
>> Thanks,
>> Emir
>> --
>> Monitoring - Log Management - Alerting - Anomaly Detection
>> Solr & Elasticsearch Consulting Support Training - http://sematext.com/
>> 
>> 
>> 
>>> On 25 Jan 2018, at 23:13, Rick Leir <rleir@leirtech.com> wrote:
>>> 
>>> 
>>> 
>>> Hi all
>>> My pf2 keywords^11.0 works for english not for french. Here are the
>> fieldtypes, actually from two schema.xml's in separate cores. Solr
>> 5.2.2, edismax, q.op AND
>>> I suspect there are several problems with the french schema. Maybe I
>> only needed to show the query analyzer, not the index analyzer?
>>> 
>>> The pf2 does not show a match in the debugQuery=true output for the
>> French. However, a qf keywords^10.0 does show a match. The keywords
>> field is copyfielded into text, which is the df. Is there any other
>> field I should be showing?
>>> Thanks
>>> Rick
>>> 
>>> <fieldType class="solr.TextField" name="text_en"
>> positionIncrementGap="100">
>>> <analyzer type="index">
>>>  <charFilter class="solr.MappingCharFilterFactory"
>> mapping="mapping-ISOLatin1Accent.txt"/>
>>>  <tokenizer class="solr.ClassicTokenizerFactory"/>
>>>  <filter class="solr.SynonymFilterFactory" expand="false"
>> ignoreCase="true" synonyms="synonyms.txt"
>> tokenizerFactory="solr.StandardTokenizerFactory"/>
>>>  <filter class="solr.StopFilterFactory" ignoreCase="true"
>> words="lang/stopwords_en.txt"/>
>>>  <filter class="solr.LowerCaseFilterFactory"/>
>>>  <filter class="solr.EnglishPossessiveFilterFactory"/>
>>>  <filter class="solr.KeywordMarkerFilterFactory"
>> protected="protwords.txt"/>
>>>  <filter class="solr.StemmerOverrideFilterFactory"
>> dictionary="lang/stemdict_en.txt" ignoreCase="true"/>
>>>  <filter class="solr.EnglishMinimalStemFilterFactory"/>
>>>  <filter class="solr.SnowballPorterFilterFactory" language="English"
>> />
>>>  <filter class="solr.ASCIIFoldingFilterFactory"/>
>>> </analyzer>
>>> <analyzer type="query">
>>>  <charFilter class="solr.MappingCharFilterFactory"
>> mapping="mapping-ISOLatin1Accent.txt"/>
>>> <tokenizer class="solr.ClassicTokenizerFactory"/>
>>>  <filter class="solr.StopFilterFactory" ignoreCase="true"
>> words="lang/stopwords_en.txt"/>
>>>  <filter class="solr.LowerCaseFilterFactory"/>
>>>  <filter class="solr.EnglishPossessiveFilterFactory"/>
>>>  <filter class="solr.KeywordMarkerFilterFactory"
>> protected="protwords.txt"/>
>>>  <filter class="solr.StemmerOverrideFilterFactory"
>> dictionary="lang/stemdict_en.txt" ignoreCase="true"/>
>>>  <filter class="solr.EnglishMinimalStemFilterFactory"/>
>>>  <filter class="solr.SnowballPorterFilterFactory" language="English"
>> />
>>>  <filter class="solr.ASCIIFoldingFilterFactory"/>
>>> </analyzer>
>>> </fieldType>
>>> 
>>> <fieldType class="solr.TextField" name="text_fr"
>> positionIncrementGap="100">
>>> <analyzer type="index">
>>>  <charFilter class="solr.MappingCharFilterFactory"
>> mapping="mapping-ISOLatin1Accent.txt"/>
>>>  <tokenizer class="solr.ClassicTokenizerFactory"/>
>>>  <filter class="solr.SynonymFilterFactory" expand="false"
>> ignoreCase="true" synonyms="synonyms.txt"
>> tokenizerFactory="solr.StandardTokenizerFactory"/>
>>>  <filter class="solr.ElisionFilterFactory" ignoreCase="true"
>> articles="lang/contractions_fr.txt"/>
>>>  <filter class="solr.LowerCaseFilterFactory"/>
>>>  <filter class="solr.StopFilterFactory" format="snowball"
>> ignoreCase="true" words="lang/stopwords_fr.txt"/>
>>>  <filter class="solr.FrenchMinimalStemFilterFactory"/>
>>>  <filter class="solr.StemmerOverrideFilterFactory"
>> dictionary="lang/stemdict_fr.txt" ignoreCase="true"/>
>>>  <filter class="solr.ASCIIFoldingFilterFactory"/>
>>> </analyzer>
>>> <analyzer type="query">
>>>  <charFilter class="solr.MappingCharFilterFactory"
>> mapping="mapping-ISOLatin1Accent.txt"/>
>>>  <tokenizer class="solr.ClassicTokenizerFactory"/>
>>>  <filter class="solr.ElisionFilterFactory" ignoreCase="true"
>> articles="lang/contractions_fr.txt"/>
>>>  <filter class="solr.LowerCaseFilterFactory"/>
>>>  <filter class="solr.StopFilterFactory" format="snowball"
>> ignoreCase="true" words="lang/stopwords_fr.txt"/>
>>>  <filter class="solr.FrenchMinimalStemFilterFactory"/>
>>>  <filter class="solr.StemmerOverrideFilterFactory"
>> dictionary="lang/stemdict_fr.txt" ignoreCase="true"/>
>>>  <filter class="solr.ASCIIFoldingFilterFactory"/>
>>> </analyzer>
>>> </fieldType>
>>> 
>>> -- 
>>> Sorry for being brief. Alternate email is rickleir at yahoo dot com
> 
> -- 
> Sorry for being brief. Alternate email is rickleir at yahoo dot com


Mime
View raw message