lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Vernon Chapman <chapman.li...@gmail.com>
Subject Re: Problems with synonyms
Date Tue, 31 Mar 2009 20:20:38 GMT
Walter,

Thanks for clarifying my mistake there.
I wouldn't want to send someone down the wrong path.

Thanks
Vernon


On 3/31/09 4:17 PM, "Walter Underwood" <wunderwood@netflix.com> wrote:

> It looks like you are using synonyms at query time. Don't do that, it
> works very strangely. Only use them at index time. That does the right
> matching and also gives the right idf for scoring.
> 
> More details are here:
> 
> http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#head-2c461ac74b4
> ddd82e453dc68fcfc92da77358d46
> 
> wunder
> 
> On 3/31/09 1:04 PM, "Vernon Chapman" <chapman.lists@gmail.com> wrote:
> 
>> Leonardo,
>> 
>> The only other thing I can think of is check the
>> Field type in the schema.xml file make sure that you are using the same
>> filters.
>> 
>> For example if in your index analyzer you use the solr.SynonymFilterFactory
>> filter make sure your query analyzer also uses the same filter class.
>> 
>> Other than that I am stuck, hope that helps
>> 
>> Vernon
>> 
>> 
>> 
>> On 3/31/09 3:39 PM, "Leonardo Dias" <leonardo@catho.com.br> wrote:
>> 
>>> Hi, Vernon!
>>> 
>>> We tried both approaches: OR and AND. In both cases, the results were
>>> smaller
>>> when the synonyms was set up, with no change at all when it comes to
>>> synonyms.
>>> 
>>> Any other ideas? Is it likely to be a bug?
>>> 
>>> Best,
>>> 
>>> Leonardo
>>> 
>>> Vernon Chapman escreveu:
>>>>  
>>>> Leonardo,
>>>> 
>>>> I am no expert but I would check to make sure that the
>>>> DefaultOperator parameter in your schema.xml file is set to
>>>> OR rather thank AND.
>>>> 
>>>> Vernon
>>>> 
>>>> On 3/31/09 3:24 PM, "Leonardo Dias" <leonardo@catho.com.br>
>>>> <mailto:leonardo@catho.com.br>  wrote:
>>>> 
>>>>   
>>>>  
>>>>>  
>>>>> Hello there. How are you guys?
>>>>> 
>>>>> We're having problems with synonyms here and I thought that maybe you
>>>>> guys could help us on how SOLR works for synonyms.
>>>>> 
>>>>> The problem is the following: I'd like to setup a synonym like "dba,
>>>>> database administrator".
>>>>> 
>>>>> Instead of increasing the number of results for the keyword "dba", the
>>>>> results got smaller and it only brought me back results that had both
>>>>> the keywords "dba" and "database administrator" at the same time instead
>>>>> of bringing back both "dba" and "database administrator" as expected
>>>>> since our synonym configuration is using expand=true.
>>>>> 
>>>>> Since in the past this was not the expected behavior, I'd like to know
>>>>> whether something changed in the solr/lucene internals so that this
>>>>> functionality is now lost, or if I'm doing something wrong with my setup.
>>>>> 
>>>>> Currently all fields pass through the Synonym filter factory. The
>>>>> analysis shows me that it tries to search for database administrator
and
>>>>> DBA. A debug query also shows me that the query it's trying to do is
>>>>> something like this:
>>>>> 
>>>>> +DisjunctionMaxQuery((title:"(dba datab) administr")~0.1)
>>>>> DisjunctionMaxQuery((title:"(dba datab) administr"^100000.0 |
>>>>> observation:"(dba datab) administr"^10.0 | description:"(dba datab)
>>>>> administr"^10.0 | company:"(dba datab) administr")~0.1)
>>>>> 
>>>>> The problem is: when I search for this, I get 5 results. When I search
>>>>> for dba only, without the "dba, database administrator" line in the
>>>>> synonyms.txt file, I get more than 100 results.
>>>>> 
>>>>> Do you guys know why this is happening?
>>>>> 
>>>>> Thank you,
>>>>> 
>>>>> Leonardo
>>>>>     
>>>>>  
>>>>  
>>>> 
>>>> 
>>>> 
>>>>   
>>>>  
>> 
>> 
>> on 3/31/09 3:39 pm, "leonardo dias" <leonardo@catho.com.br> wrote:
>> 
>> hi, vernon!
>> 
>> we tried both approaches: or and and. in both cases, the results were
>> smaller when the synonyms was set up, with no change at all when it comes to
>> synonyms.
>> 
>> any other ideas? is it likely to be a bug?
>> 
>> best,
>> 
>> leonardo
>> 
>> vernon chapman escreveu:
>>  
>> leonardo,
>> 
>> i am no expert but i would check to make sure that the
>> defaultoperator parameter in your schema.xml file is set to
>> or rather thank and.
>> 
>> vernon
>> 
>> on 3/31/09 3:24 pm, "leonardo dias" <leonardo@catho.com.br>
>> <mailto:leonardo@catho.com.br>  wrote:
>> 
>>   
>>  
>>  
>> hello there. how are you guys?
>> 
>> we're having problems with synonyms here and i thought that maybe you
>> guys could help us on how solr works for synonyms.
>> 
>> the problem is the following: i'd like to setup a synonym like "dba,
>> database administrator".
>> 
>> instead of increasing the number of results for the keyword "dba", the
>> results got smaller and it only brought me back results that had both
>> the keywords "dba" and "database administrator" at the same time instead
>> of bringing back both "dba" and "database administrator" as expected
>> since our synonym configuration is using expand=true.
>> 
>> since in the past this was not the expected behavior, i'd like to know
>> whether something changed in the solr/lucene internals so that this
>> functionality is now lost, or if i'm doing something wrong with my setup.
>> 
>> currently all fields pass through the synonym filter factory. the
>> analysis shows me that it tries to search for database administrator and
>> dba. a debug query also shows me that the query it's trying to do is
>> something like this:
>> 
>> +disjunctionmaxquery((title:"(dba datab) administr")~0.1)
>> disjunctionmaxquery((title:"(dba datab) administr"^100000.0 |
>> observation:"(dba datab) administr"^10.0 | description:"(dba datab)
>> administr"^10.0 | company:"(dba datab) administr")~0.1)
>> 
>> the problem is: when i search for this, i get 5 results. when i search
>> for dba only, without the "dba, database administrator" line in the
>> synonyms.txt file, i get more than 100 results.
>> 
>> do you guys know why this is happening?
>> 
>> thank you,
>> 
>> leonardo
>>     
>>  
>>  
>> 
>> 
>> 
>>   
>> 
> 



Mime
View raw message