lucene-java-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From parnab kumar <parnab.2...@gmail.com>
Subject Re: Boolean Search Query is not workng
Date Fri, 23 Jan 2015 15:31:16 GMT
Hi,

     While indexing , a norm value is calculated for each field  and
injected in the index. This norm value is used as field level boosting
which is also multiplied with other factors like tf-idf and query level
boost which you specify with setBoost. so you see setting boosting is one
of the several factors. Moreover, the ranking model used also influences
the scoring. Lucene default ranking which is traditional vector space model
is known to bias short documents. Experiment with BM 25 also.

Additionally, if you are used to using Solr, try using Dismax query parser
which is known to address most of the limitations of the default lucene
query parser and scoring mechanism.


Using Luke.. check the norm for each fields. To prevent norms to be
injected during indexing ... index with option No Norms.

Thanks,
Parnab..
CSE, IIT-KGP
BIS, UCC, Ireland

On Fri, Jan 23, 2015 at 2:54 PM, Rajendra Rao <rajendra.rao@launchship.com>
wrote:

> Hello
> Reply to the mail, sent by Nitin We tried and this is what we got :
>
> My query was dotNet^10.0 Resume:jdbc Resume:C# Resume:MVC
>
> Documents:
> .1. java jdbc
> 2.Java dotNet
> 3.java jdbc apache
> 4.java dotNet
> 5.java
>          6java jdbc apache dotNet C# MVC
>          7.jakarta
> 8",apache
> 9.java jdbc apache dotNet C# MVC  tomcat;
>         10java jdbc apache dotNet",
>
> and result is doc id.1)  score :0.031800564  which is unexpected as
> weightage is given to Dotnet
>
> Thanks
>
>
>
>
> On Fri, Jan 23, 2015 at 4:34 PM, Ian Lea <ian.lea@gmail.com> wrote:
>
> > Use IndexSearcher.explain() to help figure out what matched, why.  And
> > watch out for typos: jakarta != jakarata.
> >
> > If you still can't figure it out, post here a very small completely
> > self-contained program or test case, using RAMDirectory, that
> > demonstrates the problem.
> >
> >
> > --
> > Ian.
> >
> >
> > On Fri, Jan 23, 2015 at 10:27 AM, Priyanka Tufchi
> > <priyanka.tufchi@launchship.com> wrote:
> > > Hi Ian
> > >
> > > I tried with what u sent
> > >
> > > Query-java~5 jakarta~5 apache  tomcat
> > > Document : "1", "java jakarta tomcat"
> > >  "2", java jakarata""
> > > "3", "java jakarta  apache"
> > >
> > >  Score  : 1 :0.27094576
> > > 3 :0.27094576
> > > 2 :0.010494952
> > >
> > >
> > > If we go  by query it is giving same score ..It is not working.
> > >
> > > Thanks
> > > Priyanka
> > >
> > >
> > > On Fri, Jan 23, 2015 at 3:19 PM, Ian Lea <ian.lea@gmail.com> wrote:
> > >
> > >> How about "home~10 house~10 flat". See
> > >> http://lucene.apache.org/core/4_10_3/queryparser/index.html
> > >>
> > >>
> > >> --
> > >> Ian.
> > >>
> > >>
> > >> On Fri, Jan 23, 2015 at 7:17 AM, Priyanka Tufchi
> > >> <priyanka.tufchi@launchship.com> wrote:
> > >> > Hi ALL
> > >> >
> > >> > I am  working on a project which uses lucene for searching . I am
> > >> > struggling with boolean based Query : Actual Scenario is
> > >> >
> > >> > e.g
> > >> >  In Query, if I give house home flat
> > >> >  then
> > >> >  inside It should search house or home or flat  but I want to give
> > them
> > >> > with weightage  like house and home  should get high weigh and flat
> > >> should
> > >> > get less then rest.
> > >> > If document contain "Home"  .Lucene search should not go for house
> > and
> > >> > flat.
> > >> >
> > >> > I searched on Internet for some good stuff but not able to find any
> > code
> > >> > sample or proper syntax for reference .
> > >> >
> > >> >
> > >> > Thanks
> > >> > Priyanka
> > >> >
> > >> > --
> > >> > Launchship Technology  respects your privacy. This email is intended
> > only
> > >> > for the use of the party to which it is addressed and may contain
> > >> > information that is privileged, confidential, or protected by law.
> If
> > you
> > >> > have received this message in error, or do not want to receive any
> > >> further
> > >> > emails from us, please notify us immediately by replying to the
> > message
> > >> and
> > >> > deleting it from your computer.
> > >>
> > >> ---------------------------------------------------------------------
> > >> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> > >> For additional commands, e-mail: java-user-help@lucene.apache.org
> > >>
> > >>
> > >
> > > --
> > > Launchship Technology  respects your privacy. This email is intended
> only
> > > for the use of the party to which it is addressed and may contain
> > > information that is privileged, confidential, or protected by law. If
> you
> > > have received this message in error, or do not want to receive any
> > further
> > > emails from us, please notify us immediately by replying to the message
> > and
> > > deleting it from your computer.
> >
> > ---------------------------------------------------------------------
> > To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> > For additional commands, e-mail: java-user-help@lucene.apache.org
> >
> >
>
> --
> Launchship Technology  respects your privacy. This email is intended only
> for the use of the party to which it is addressed and may contain
> information that is privileged, confidential, or protected by law. If you
> have received this message in error, or do not want to receive any further
> emails from us, please notify us immediately by replying to the message and
> deleting it from your computer.
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message