Mailing-List: contact solr-user-help@lucene.apache.org; run by ezmlm
Precedence: bulk
Reply-To: solr-user@lucene.apache.org
Received-SPF: pass (athena.apache.org: domain of erickerickson@gmail.com
 designates 209.85.220.171 as permitted sender)
MIME-Version: 1.0
In-Reply-To: 
 <CACMMrEMi8cWf2_qwQvx8joRYr3i=MWKrdhBkdFrxQOjq_3Epaw@mail.gmail.com>
References: 
 <CACMMrEMi8cWf2_qwQvx8joRYr3i=MWKrdhBkdFrxQOjq_3Epaw@mail.gmail.com>
Date: Mon, 7 Oct 2013 21:43:16 -0400
Message-ID: 
 <CAN4YXvc1V8e2k4uBz33dOkkqFWbhGHrir4ZBVJcOPOm0tAfWvw@mail.gmail.com>
Subject: Re: Regarding edismax parsing
From: Erick Erickson <erickerickson@gmail.com>
To: solr-user@lucene.apache.org
Content-Type: text/plain; charset=ISO-8859-1

You're probably having problem with the distinction between
query parsing and analysis which has been discussed many
times.

The issues is that the query parser breaks things up into individual
tokens and _then_ sends them to the analyzer chain as individual
tokens (usually).

Try escaping your spaces.

Best
Erick

On Mon, Oct 7, 2013 at 8:28 AM, Prashant Golash
<prashant.golash@gmail.com> wrote:
> Hi,
>
> I have a question regarding to parsing of tokens in edismax parser and
> subsequently a follow up question related to same.
>
>    - Each field has list of analyzers and tokenizers as configured in
>    schema.xml (Index and query time). Now, say I search for query - red shoes.
>    So, is it like that for forming Disjunction query on each field, edismax
>    will first apply analyzers configured to that field, and then form the
>    query. For e.g if field1 has changes red to rd and field2 changes red to
>    re, query will be like - (field1:rd) | (field2:re)  ?
>
>
>    - If above holds true, then when I changed ordering of analyzers and put
>    "SynonymFilterFactory" at top of all analyzers (in schema.xml), edismax
>    still tokenizes the query first with respect to space and then only apply
>    synonym filter factory, which leads me to think that this is not happening.
>    My use case is like , before applying any tokenizer, I want to support
>    phrase level synonym replacement and do rest of analysis.
>
> Thanks,
> Prashant Golash