lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Bill Bell <billnb...@gmail.com>
Subject Re: [jira] Commented: (SOLR-2242) Get distinct count of names for a facet field
Date Tue, 15 Mar 2011 01:16:12 GMT
Thanks. 

Not sure how to get the facet distinct count without looping, but I'll
look into that. Not sure what "constraints" means?

I agree that you should not have to specify limit, but mincount should
apply, since many times I want 1 or higher.

Would be always include this or just add it as an option?

f.hgid.facet.namedistinct=1 ?

Proposed:
{code}
"facet fields" : {"hgid" : {
  "missing" : 25,
  "namedistinct" : 1250,
  "counts" : ["constraint",10,...]
}}
{code}


Then we add others as needed?

Or do you mean?

f.hgid.facet.constraints = namedistinct() with the option to specify more
than one?

f.hgid.facet.constraints = namedistinct(),missing()


Proposed:
{code}
"facet fields" : {"hgid" : {
  "constraints" : ["missing()",25,"namedistinct()",1250],
  "counts" : ["constraint",10,...]
}}
{code}



On 3/14/11 7:05 PM, "Yonik Seeley (JIRA)" <jira@apache.org> wrote:

>
>    [ 
>https://issues.apache.org/jira/browse/SOLR-2242?page=com.atlassian.jira.pl
>ugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13006750#comm
>ent-13006750 ] 
>
>Yonik Seeley commented on SOLR-2242:
>------------------------------------
>
>It feels like we should have an option to return the number of
>constraints that match the criteria (mincount, etc) w/o having to specify
>facet.limit=-1, and you should be able to get this info in addition to
>the normal facet counts.  We can also improve the efficiency by not
>building the complete list in memory just to return it's count.
>
>We've also talked before about having an extra metadata level for each
>facet.
>
>Current:
>{code}
>"facet fields" : {"hgid" : ["constraint",10,...]}
>{code}
>
>Proposed:
>{code}
>"facet fields" : {"hgid" : {
>  "missing" : 25,
>  "constraints" : 1250,
>  "counts" : ["constraint",10,...]
>}}
>{code}
>
>> Get distinct count of names for a facet field
>> ---------------------------------------------
>>
>>                 Key: SOLR-2242
>>                 URL: https://issues.apache.org/jira/browse/SOLR-2242
>>             Project: Solr
>>          Issue Type: New Feature
>>          Components: Response Writers
>>    Affects Versions: 4.0
>>            Reporter: Bill Bell
>>            Priority: Minor
>>             Fix For: 4.0
>>
>>         Attachments: SOLR-2242-distinctFacet.patch
>>
>>
>> When returning facet.field=<name of field> you will get a list of
>>matches for distinct values. This is normal behavior. This patch tells
>>you how many distinct values you have (# of rows). Use with limit=-1 and
>>mincount=1.
>> The feature is called "namedistinct". Here is an example:
>> 
>>http://localhost:8983/solr/select?q=*:*&facet=true&facet.field=manu&facet
>>.mincount=1&facet.limit=-1&f.manu.facet.namedistinct=0&facet.field=price&
>>f.price.facet.namedistinct=1
>> Here is an example on field "hgid" (without namedistinct):
>> {code}
>> - <lst name="facet_fields">
>> - <lst name="hgid">
>>   <int name="HGPY0000045FD36D4000A">1</int>
>>   <int name="HGPY00000FBC6690453A9">1</int>
>>   <int name="HGPY00001E44ED6C4FB3B">1</int>
>>   <int name="HGPY00001FA631034A1B8">1</int>
>>   <int name="HGPY00003317ABAC43B48">1</int>
>>   <int name="HGPY00003A17B2294CB5A">5</int>
>>   <int name="HGPY00003ADD2B3D48C39">1</int>
>>   </lst>
>>   </lst>
>> {code}
>> With namedistinct (HGPY0000045FD36D4000A, HGPY00000FBC6690453A9,
>>HGPY00001E44ED6C4FB3B, HGPY00001FA631034A1B8, HGPY00003317ABAC43B48,
>>HGPY00003A17B2294CB5A, HGPY00003ADD2B3D48C39). This returns number of
>>rows (7), not the number of values (11).
>> {code}
>> - <lst name="facet_fields">
>> - <lst name="hgid">
>>   <int name="_count_">7</int>
>>   </lst>
>>   </lst>
>> {code}
>> This works actually really good to get total number of fields for a
>>group.field=hgid. Enjoy!
>
>--
>This message is automatically generated by JIRA.
>For more information on JIRA, see: http://www.atlassian.com/software/jira
>
>---------------------------------------------------------------------
>To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
>For additional commands, e-mail: dev-help@lucene.apache.org
>



---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message