lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Erick Erickson <erickerick...@gmail.com>
Subject Re: Doubts in Result Grouping in solr 3.6.1
Date Fri, 07 Sep 2012 04:32:34 GMT
Grouping isn't defined for tokenized fields I don't think. See:
http://wiki.apache.org/solr/FieldCollapsing where it says for
group.field:
"..The field must currently be single-valued..."

Are you sure you don't want faceting?

Best
Erick

On Tue, Sep 4, 2012 at 5:27 AM, mechravi25 <mechravi25@yahoo.co.in> wrote:
> Hi,
>
> I am currently using solr 3.6.1 version and for indexing data, i am using
> the data import handler for 3.5 because of the reason posted in the
> following forum link
> http://lucene.472066.n3.nabble.com/Dataimport-Handler-in-solr-3-6-1-td4001149.html
>
> I am trying to achieve result grouping based on a field "grpValue" which has
> value like this "Name XYZ|Company". There are totally 359 docs that were
> indexed and the field "grpValue" in all the 359 docs contains the word
> "Company" in its value.
>
> I gave the following in my schema.xml for splitting the word while indexing
> and querying
>
> <fieldType name="groupField" class="solr.TextField"
> positionIncrementGap="100">
>  <analyzer type="index">
> <tokenizer class="solr.PatternTokenizerFactory" pattern="\s+|\|"/>
>
>       <filter class="solr.LowerCaseFilterFactory"/>
> <filter class="solr.StopFilterFactory" ignoreCase="true"
> words="stopwords_new.txt" enablePositionIncrements="true" />
>  </analyzer>
>  <analyzer type="query">
>
>  <tokenizer class="solr.PatternTokenizerFactory" pattern="\s+|\|"/>
>       <filter class="solr.LowerCaseFilterFactory"/>
> <filter class="solr.StopFilterFactory" ignoreCase="true"
> words="stopwords_new.txt" enablePositionIncrements="true" />
>  </analyzer>
> </fieldType>
>
>
> I am trying to split the words if I have a single space or an “|” symbol in
> my data when i use the pattern="\s+|\|" in PatternTokenizerFactory.
>
> When I gave the analyze option in solr, the sample value was split inot 3
> words "Name","XYZ","Company" in both my index and query analyzer.
>
> When i gave the following url
>
> http://localhost:8080/solr/core1/select/?q=*%3A*&version=2.2&start=0&rows=359&indent=on&group=true&group.field=grpValue&group.limit=0
>
> I noticed that I have a grouping name called Company which has numFound as
> 73 but the particular field "grpValue" has the word "Company" in its value
> in all the 359 docs. Ideally, i should have got 359 docs as numFound under
> my group
>
> - <lst name="grouped">
> - <lst name="grpValue">
>   <int name="matches">359</int>
> - <arr name="groups">
> - <lst>
>   <str name="groupValue">Company</str>
>   <result name="doclist" numFound="73" start="0" />
>   </lst>
>
> Please someone guide me as to why only 73 docs is present in that group
> instead of 359.
>
> I also noticed that when I counted the numFound in all the groups, it
> totalled upto 359.
>
>
> Please guide me on this and I am not sure what I am missing. Please let me
> know in case more details is needed.
>
> Thanks in advance.
>
>
>
>
>
>
> --
> View this message in context: http://lucene.472066.n3.nabble.com/Doubts-in-Result-Grouping-in-solr-3-6-1-tp4005239.html
> Sent from the Solr - User mailing list archive at Nabble.com.

Mime
View raw message