lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Amit Jha <shanuu....@gmail.com>
Subject Re: Search strategy - improving search quality for short search terms such as "doll"
Date Wed, 16 Jan 2013 16:41:00 GMT
Its all about the data data set, here I mean index. If you have documents containing "toy"
and "doll" it will return that in result set. 

What I understood that you are talking about the context of the query. For example if you
search "books on MK Gandhi" and "books by MK Gandhi" both queries have different context.

Context based search at some level achieved by natural language processing. This one you can
look at for better search.

Look for solr wiki & mailing list would be great source of learning.


Rgds
AJ

On 16-Jan-2013, at 15:10, "David Parks" <davidparks21@yahoo.com> wrote:

> I'm a beginner-intermediate solr admin, I've set up the basics for our
> application and it runs well.
> 
> 
> 
> Now it's time for me to dig in and start tuning and improving queries.
> 
> 
> 
> My next target is searches on simple terms such as "doll" which, in google,
> would return documents about, well, "toy dolls", because that's the most
> common usage of the simple term "doll". But in my index it predominantly
> returns documents about CDs with the song "Doll Face", and "My baby doll" in
> them.
> 
> 
> 
> I'm not directly asking how to solve this as much as I'm asking what
> direction I should be looking in to learn what I need to know to tackle the
> general issue myself.
> 
> 
> 
> Left on my own I would start looking at categorizing the CD's into a facet
> called "music", reasonably doable in my dataset. Then I need to reduce the
> boost-value of the entire facet/category of music unless certain pre-defined
> query terms exist, such as [music, cd, song, listen, dvd, <analyze actual
> user queries to come up with a more exhaustive list>, etc.]. 
> 
> 
> 
> I don't yet know how to do all of this, but after a couple more good books I
> should be "dangerous".
> 
> 
> 
> So the question to this list:
> 
> 
> 
> -          Am I on the right track here?  If not, can you point me in a
> direction to go?
> 
> 
> 
> 
> 

Mime
View raw message