lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Erick Erickson <erickerick...@gmail.com>
Subject Re: Multi Word searches in Solr
Date Wed, 17 Nov 2010 20:56:04 GMT
Nope, you've got it right. Parenthesis are what's necessary. This is
actually similar to the Lucene world if you consider <defaultField> in
your config to be equivalent to specifying a default field when
you instantiate a parser. But that's a stretch.....

That said, it is surprising that you are getting the exact same
numbers of results found, unless you default field isn't one that's
actually in your documents....

Best
Erick

On Wed, Nov 17, 2010 at 1:48 PM, Matthew Hall <mhall@informatics.jax.org>wrote:

> Oh and to clarify what I expect to see.
>
> I expect to see the term in a multiword query to be put together with OR's
> (Unless I've set the default to be AND's in the solrconfig.xml)
>
> I'm guessing that what I'm going to need to do here is place all of my
> queries in parentheses for the fields that I expect I'm going to have
> multiple word queries.  This isn't what I was expecting having come from
> Lucene before this.
>
> So instead of my query looking like this in the Solr admin
>
> abstract: mouse genome informatics
>
> I really need it looking like:
>
> abstract: (mouse genome informatics) so that the query parser understands
> that these are multiple terms for a single field.
>
> Or am I misunderstanding here?
>
> Matt
>
>
> On 11/17/2010 1:44 PM, Matthew Hall wrote:
>
>> I'm getting the result set that matches what it would be if I just
>> searched for the first word in the query.
>>
>> So I'm getting the results for mouse.
>>
>> And yes, abstract: is the name of the field.
>>
>> So a search for
>>
>> abstract: mouse
>>
>> would yield 69103 results
>>
>> abstract: mouse anythingelseIputhere
>> yields 69103 results, even if the second token doesn't exist anywhere in
>> the indexes.
>>
>> abstract: mouse verycommonterm
>> yields 69103 results, even if the second token would normally result in
>> say 150000 matches when searched on its own.
>>
>> I think what you are describing as being possible is what is happening,
>> the query parser is taking what's being passed to it
>>
>> abstract: foo bar
>> and really converting it into
>>
>> abstract: foo OR default:bar
>>
>> Matt
>>
>> On 11/17/2010 1:38 PM, kenf_nc wrote:
>>
>>> Multi word queries is the bread and butter of Solr/Lucene, so I'm not
>>> sure I
>>> understand the complete issue here. For clarity, is 'abstract' the name
>>> of
>>> your default text field, or is your query
>>>
>>> q=abstract: mouse genome
>>>
>>> if the latter, my thought was is it possible that the query is being
>>> converted into a query of
>>> q=abstract:mouse genome  where mouse is looked for in the field abstract,
>>> and genome is compared to the default text field. This is a stab in the
>>> dark, I don't know what your data looks like.
>>>
>>> You say it doesn't work the way you expect, but you don't really say what
>>> you do see. Are you getting zero results, or fewer than you expected, or
>>> only results that match all fields (the AND proposition)?
>>>
>>
>>
>
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message