lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Bill Bell (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (SOLR-2242) Get distinct count of names for a facet field
Date Thu, 28 Apr 2011 03:52:03 GMT

    [ https://issues.apache.org/jira/browse/SOLR-2242?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13026103#comment-13026103
] 

Bill Bell commented on SOLR-2242:
---------------------------------

Lance Norskog,

What do you want it to be called? I would use a committer to take this issue on. It has several
votes, and lots of downloads. People are using it successfully already.

Do you want me to switch the numFacetTerms to numFacetNames ? Anything else? I feel like we
are going in circles on this issue.

{code}

This will output the numFacetTerms AND hgid:
http://localhost:8983/solr/select?q=*:*&facet=true&facet.field=hgid&facet.mincount=1&f.hgid.facet.numFacetTerms=2

<lst name="facet_fields">
  <lst name="hgid">
   <int name="numFacetTerms">7</int>  <!-- this is not 11 -->
   <lst name="counts">
   	<int name="HGPY0000045FD36D4000A">1</int>
   	<int name="HGPY00000FBC6690453A9">1</int>
   	<int name="HGPY00001E44ED6C4FB3B">1</int>
   	<int name="HGPY00001FA631034A1B8">1</int>
   	<int name="HGPY00003317ABAC43B48">1</int>
   	<int name="HGPY00003A17B2294CB5A">5</int>
   	<int name="HGPY00003ADD2B3D48C39">1</int>
   </lst>
  </lst>
</lst>

{code}

> Get distinct count of names for a facet field
> ---------------------------------------------
>
>                 Key: SOLR-2242
>                 URL: https://issues.apache.org/jira/browse/SOLR-2242
>             Project: Solr
>          Issue Type: New Feature
>          Components: Response Writers
>    Affects Versions: 4.0
>            Reporter: Bill Bell
>            Priority: Minor
>             Fix For: 4.0
>
>         Attachments: SOLR-2242.patch, SOLR.2242.solr3.1.patch, SOLR.2242.v2.patch
>
>
> When returning facet.field=<name of field> you will get a list of matches for distinct
values. This is normal behavior. This patch tells you how many distinct values you have (#
of rows). Use with limit=-1 and mincount=1.
> The feature is called "namedistinct". Here is an example:
> http://localhost:8983/solr/select?q=*:*&facet=true&facet.field=manu&facet.mincount=1&facet.limit=-1&f.manu.facet.namedistinct=0&facet.field=price&f.price.facet.namedistinct=1
> Here is an example on field "hgid" (without namedistinct):
> {code}
> - <lst name="facet_fields">
> - <lst name="hgid">
>   <int name="HGPY0000045FD36D4000A">1</int> 
>   <int name="HGPY00000FBC6690453A9">1</int> 
>   <int name="HGPY00001E44ED6C4FB3B">1</int> 
>   <int name="HGPY00001FA631034A1B8">1</int> 
>   <int name="HGPY00003317ABAC43B48">1</int> 
>   <int name="HGPY00003A17B2294CB5A">5</int> 
>   <int name="HGPY00003ADD2B3D48C39">1</int> 
>   </lst>
>   </lst>
> {code}
> With namedistinct (HGPY0000045FD36D4000A, HGPY00000FBC6690453A9, HGPY00001E44ED6C4FB3B,
HGPY00001FA631034A1B8, HGPY00003317ABAC43B48, HGPY00003A17B2294CB5A, HGPY00003ADD2B3D48C39).
This returns number of rows (7), not the number of values (11).
> {code}
> - <lst name="facet_fields">
> - <lst name="hgid">
>   <int name="_count_">7</int> 
>   </lst>
>   </lst>
> {code}
> This works actually really good to get total number of fields for a group.field=hgid.
Enjoy!

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message