lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Lance Norskog (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (SOLR-2242) Get distinct count of names for a facet field
Date Mon, 13 Jun 2011 05:40:51 GMT

    [ https://issues.apache.org/jira/browse/SOLR-2242?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13048471#comment-13048471
] 

Lance Norskog commented on SOLR-2242:
-------------------------------------

Yeah, my itch started just now also :)

"Constraint" means any facet value: terms, numerical ranges, query results.

Range queries have the same situation: when I give range endpoints and a gap, I want to know
how many intervals it made from the gap.That would be the analog of this count. 

I'm not saying this patch has to do range counts also, but pointing out the eventual scope
of this feature. Therefore, 'numTerms' is not the word we're looking for. 'count' or 'total'
seem right.

Below, both *features:{* and *popularity:{* need counts. 
 
{code}
"facet_counts":{
    "facet_queries":{
      "*:*":27},
    "facet_fields":{
      "features":[
        "facet_terms",[
          "2",7,]]}
    "facet_ranges":{
      "popularity":{
        "counts":[
          "0",3,
          "2",0,
          "4",1,
          "6",9],
        "gap":2,
        "start":0,
        "end":8}}}}
{code}


p.s.
I got the above from the example electronic shop database with this query:
[click to see|http://localhost:8983/solr/select/?q=*%3A*&version=2.2&start=0&rows=0&indent=on&facet.field=popularity&facet=true&facet.numTerms=true&facet.query=*:*&wt=json&facet.range.start=0&facet.range.end=7&facet.range.gap=2&facet.range=popularity]

> Get distinct count of names for a facet field
> ---------------------------------------------
>
>                 Key: SOLR-2242
>                 URL: https://issues.apache.org/jira/browse/SOLR-2242
>             Project: Solr
>          Issue Type: New Feature
>          Components: Response Writers
>    Affects Versions: 4.0
>            Reporter: Bill Bell
>            Priority: Minor
>             Fix For: 4.0
>
>         Attachments: SOLR-2242.patch, SOLR-2242.shard.patch, SOLR-2242.solr3.1.patch,
SOLR.2242.solr3.1.patch, SOLR.2242.v2.patch
>
>
> When returning facet.field=<name of field> you will get a list of matches for distinct
values. This is normal behavior. This patch tells you how many distinct values you have (#
of rows). Use with limit=-1 and mincount=1.
> The feature is called "namedistinct". Here is an example:
> http://localhost:8983/solr/select?q=*:*&facet=true&facet.field=manu&facet.mincount=1&facet.limit=-1&f.manu.facet.namedistinct=0&facet.field=price&f.price.facet.namedistinct=1
> Here is an example on field "hgid" (without namedistinct):
> {code}
> - <lst name="facet_fields">
> - <lst name="hgid">
>   <int name="HGPY0000045FD36D4000A">1</int> 
>   <int name="HGPY00000FBC6690453A9">1</int> 
>   <int name="HGPY00001E44ED6C4FB3B">1</int> 
>   <int name="HGPY00001FA631034A1B8">1</int> 
>   <int name="HGPY00003317ABAC43B48">1</int> 
>   <int name="HGPY00003A17B2294CB5A">5</int> 
>   <int name="HGPY00003ADD2B3D48C39">1</int> 
>   </lst>
>   </lst>
> {code}
> With namedistinct (HGPY0000045FD36D4000A, HGPY00000FBC6690453A9, HGPY00001E44ED6C4FB3B,
HGPY00001FA631034A1B8, HGPY00003317ABAC43B48, HGPY00003A17B2294CB5A, HGPY00003ADD2B3D48C39).
This returns number of rows (7), not the number of values (11).
> {code}
> - <lst name="facet_fields">
> - <lst name="hgid">
>   <int name="_count_">7</int> 
>   </lst>
>   </lst>
> {code}
> This works actually really good to get total number of fields for a group.field=hgid.
Enjoy!

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message