lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From donato <ddice...@outlook.com>
Subject Re: How on EARTH do I remove 's in schema file?
Date Sun, 19 Mar 2017 03:34:50 GMT
Thank you so much, Erick! I will try that!

I do have one other question though... what sections do I do all of this in? I see like four
or five sections with different things in them. Do I use all of those in each section or just
in some? What is each section? What do they do?

Thanks again for your time. Truly. Thank you!

________________________________
From: Erick Erickson [via Lucene] <ml-node+s472066n4325826h88@n3.nabble.com>
Sent: Saturday, March 18, 2017 11:29:49 PM
To: donato
Subject: Re: How on EARTH do I remove 's in schema file?

First, uncheck the "verbose" checkbox. The nitty-gritty information
isn't relevant at this point.

Second, hover over each of the light-gray like "MCF", "PRCF" and such.
You'll see the element of the analysis chain that stands for, and the
difference between the line before and this line is the effect of that
element. For instance, on the query side you see that "patrick" is
turned into "patrick", "patricks" and "patrick's" by "SF" which I'd
guess is your SynonymFilter. But hovering over that will tell you
exactly what element is producing those changes.

Then it looks like you're using HTMLStripCharFilter, MappingCharFilter
and PatternReplaceCharFilter (Factories all). Why do you think all
those are necessary?

So stop. Take a deep breath. My guess is that you've been trying a
bunch of different approaches and the interactions of all the
different parts are throwing you off. Start simple, with say
StandardTokenizerFactory
LowercaseFilterFactory
EnglishPosessiveFilterFactory
PorterStemFilterFactory

Use the analysis page and work your way toward complexity. Concentrate
on the indexing side first. Enter all three of your variants (jack
jacks jack's) in the box and press the button. Do not pass go. Do not
collection $200 until you see the effects of your changes on the
analysis page.

Your stated goal here is that all of your variants reduce to "jack" in
the example above. Don't bother querying until you see that result in
your index.

Tip: It is a bit clumsy to have to restart Solr every time you make
changes in your schema (although if you're running stand-alone you can
reload the core). So I often define several different field types with
different possibilities and compare them after a single reload.

Best,
Erick

On Sat, Mar 18, 2017 at 8:12 PM, vishal jain <[hidden email]</user/SendEmail.jtp?type=node&node=4325826&i=0>>
wrote:

> Try "stemEnglishPossessive" to remove.
>
> On Sat, Mar 18, 2017 at 4:00 AM, donato <[hidden email]</user/SendEmail.jtp?type=node&node=4325826&i=1>>
wrote:
>
>> I have been racking my brain for days... I need to remove 's from say
>> "patrick's" If I search for "patrick" or "patricks" I get the same number
>> of
>> results, however, if I search for "patrick's" it's a different number. I
>> just want solr to ignore the 'sCan someone PLEASE help me!!!! It is driving
>> me nuts!!!!Here is my schema file...
>> Id                          Name
>>
>>
>>
>> --
>> View this message in context: http://lucene.472066.n3.
>> nabble.com/How-on-EARTH-do-I-remove-s-in-schema-file-tp4325709.html
>> Sent from the Solr - User mailing list archive at Nabble.com.


________________________________
If you reply to this email, your message will be added to the discussion below:
http://lucene.472066.n3.nabble.com/How-on-EARTH-do-I-remove-s-in-schema-file-tp4325709p4325826.html
To unsubscribe from How on EARTH do I remove 's in schema file?, click here<http://lucene.472066.n3.nabble.com/template/NamlServlet.jtp?macro=unsubscribe_by_code&node=4325709&code=ZGRpY2VjY2FAb3V0bG9vay5jb218NDMyNTcwOXwtMTcwNTcxMzYyNg==>.
NAML<http://lucene.472066.n3.nabble.com/template/NamlServlet.jtp?macro=macro_viewer&id=instant_html%21nabble%3Aemail.naml&base=nabble.naml.namespaces.BasicNamespace-nabble.view.web.template.NabbleNamespace-nabble.view.web.template.NodeNamespace&breadcrumbs=notify_subscribers%21nabble%3Aemail.naml-instant_emails%21nabble%3Aemail.naml-send_instant_email%21nabble%3Aemail.naml>




--
View this message in context: http://lucene.472066.n3.nabble.com/How-on-EARTH-do-I-remove-s-in-schema-file-tp4325709p4325827.html
Sent from the Solr - User mailing list archive at Nabble.com.
Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message