lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Erick Erickson <erickerick...@gmail.com>
Subject Re: steps to improve search
Date Mon, 05 Jul 2010 01:07:44 GMT
Yes, when you change the schema in the indexing portion,
it is necessary to reindex the data. You can change the
search parts w/o reindexing..

Also, see this page:
http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters
See the CommonGramsFilterFactory section, which contains
this tidibt:
<<<CommonGramsFilter is useful for issuing phrase queries (i.e. "the cat")
that contain stop words. Normally phrases containing stop words would not
match their intended target and instead, the query "the cat" would match all
documents containing "cat", which can be undesirable behavior.>>

HTH
Erick

On Fri, Jul 2, 2010 at 11:38 AM, Frederico Azeiteiro <
Frederico.Azeiteiro@cision.com> wrote:

> Thanks Leonardo, I didn't know that tool, very good!
>
> So I see what is wrong:
>
> SnowballPorterFilterFactory and StopFilterFactory. (both used on index and
> query)
>
> I tried remove the snowball and change the stopfilter to "ignorecase=false"
> on QUERY and restarted solr.
>
> But now I get no results :(.
>
> On index analysis I get (result of filters):
> paying  for     it
> paying
> paying
> paying
> pay
>
> For Query analysis (result of filters):
> paying  for     it
> paying  for     it
> paying
> paying
> paying
>
> This means that at the end, the word indexed is "pay" and the searched is
> "paying"?
>
> It's necessary to reindex the data?
>
> Thanks
>
> -----Original Message-----
> From: Leonardo Menezes [mailto:leonardo.menezess@googlemail.com]
> Sent: sexta-feira, 2 de Julho de 2010 12:58
> To: solr-user@lucene.apache.org
> Subject: Re: steps to improve search
>
> most likely due to:
> EnglishPorterFilterFactory
> RemoveDuplicatesTokenFilterFactory
> StopFilterFactory
>
> you get those "fake" matches. try going into the admin, on the analysis
> section. in there you can "simulate" the index/search of a document, and
> see
> how its actually searched/indexed. it will give you some clues...
>
> On Fri, Jul 2, 2010 at 1:50 PM, Frederico Azeiteiro <
> Frederico.Azeiteiro@cision.com> wrote:
>
> > For the example given, I need the full expression "paying for it", so
> > yes all the words.
> > -----Original Message-----
> > From: Ahmet Arslan [mailto:iorixxx@yahoo.com]
> > Sent: sexta-feira, 2 de Julho de 2010 12:30
> > To: solr-user@lucene.apache.org
> > Subject: RE: steps to improve search
> >
> > > I need to know how to achieve more accurates queries (like
> > > the example below...) using these filters.
> >
> > do you want that all terms - you search - must appear in returned
> > documents?
> >
> > You can change default operator of QueryParser to AND. either in
> > schema.xml or appending &q.op=AND you your search url. I am assuming you
> > are not using dismax.
> >
> >
> >
> >
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message