lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Yonik Seeley (JIRA)" <j...@apache.org>
Subject [jira] Commented: (SOLR-153) Facet Index
Date Fri, 15 Oct 2010 17:44:33 GMT

    [ https://issues.apache.org/jira/browse/SOLR-153?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12921460#action_12921460
] 

Yonik Seeley commented on SOLR-153:
-----------------------------------

I think this facet algorithm could do well when both the number of unique terms are high,
and the number of values per document is high.  That's really the only case where our existing
algorithms fall down.

There's more info about how this should work, starting here:
http://search.lucidimagination.com/search/document/6ccbec5e602687ae/facet_optimizing
And then the comments in the code of course.

bq. How much work would it to integrate your work into facets? E.g. to get an idea on real
data?

Not sure... it's been a long time, and I was brainstorming in code - I never tried running
it, so I guarantee there are tons of bugs.  Cool stuff though - wish I had time to work on
it again.

> Facet Index
> -----------
>
>                 Key: SOLR-153
>                 URL: https://issues.apache.org/jira/browse/SOLR-153
>             Project: Solr
>          Issue Type: New Feature
>            Reporter: Yonik Seeley
>         Attachments: facettree.patch, facettree.patch
>
>
> A facet index, initially for non-hierarchical facets.
> Start with all terms, and a set of documents for each term.  Group lower level nodes
by taking the union of the sets, but keep track of the largest set going back all the way
to the leaves (the max doc-freq for that node).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message