lucene-solr-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Shalin Shekhar Mangar (JIRA)" <>
Subject [jira] Updated: (SOLR-1682) Implement CollapseComponent
Date Wed, 30 Dec 2009 10:41:30 GMT


Shalin Shekhar Mangar updated SOLR-1682:

    Attachment: SOLR-236.patch

Here's an implementation based on [Yonik's suggestion|].

This is just a PoC and not fit to be committed. This implementation uses one pass for collapse.threshold=1
and two passes for collapse.threshold>1 so it should be a lot faster than the previous
method. Though, I haven't benchmarked yet. Memory consumption should be proportional to start+count
instead of index size.

What is covered:
# Non-adjacent collapsing
# collapse.threshold
# [New response format|]
# Includes DocSetAwareCollector interface from SOLR-1680

What is not covered:
# Adjacent collapsing
# Aggregate functions (should be easy to add)
# Faceting (it doesn't keep/return the docsets needed for FacetComponent)
# Caching
# This implementation does not return the correct numFound

The response adds special fields to only the first document in a group. Here's a sample of
the first document in a group:
      <int name="id">1</int>
      <str name="name_s1">author1</str>
      <str name="title_s1">a tree</str>
      <date name="timestamp">2009-12-30T10:16:51.944Z</date>
      <arr name="multiDefault">
      <int name="intDefault">42</int>
      <str name="collapse.value">author1</str>
      <int name="collapse.count">1</int>
      <float name="score">0.67107505</float>

See for example usage.

> Implement CollapseComponent
> ---------------------------
>                 Key: SOLR-1682
>                 URL:
>             Project: Solr
>          Issue Type: Sub-task
>          Components: search
>            Reporter: Martijn van Groningen
>            Assignee: Shalin Shekhar Mangar
>             Fix For: 1.5
>         Attachments: field-collapsing.patch, SOLR-236.patch
> Child issue of SOLR-236. This issue is dedicated to field collapsing in general and all
its code (CollapseComponent, DocumentCollapsers and CollapseCollectors). The main goal is
the finalize the request parameters and response format.

This message is automatically generated by JIRA.
You can reply to this email to add a comment to the issue online.

View raw message