lucene-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Alejandro Marqués (Commented) (JIRA) <j...@apache.org>
Subject [jira] [Commented] (SOLR-1709) Distributed Date and Range Faceting
Date Wed, 18 Apr 2012 10:19:43 GMT

    [ https://issues.apache.org/jira/browse/SOLR-1709?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13256419#comment-13256419
] 

Alejandro Marqués commented on SOLR-1709:
-----------------------------------------

Should the facet.mincount parameter work with distributed date faceting?

I have two Solr servers (http://localhost:8090/solr1/ and http://localhost:8090/solr2/) each
one with part of the docs from exampledocs

If I make the next query:
http://localhost:8090/solr1/select/?q=*%3A*&version=2.2&start=0&rows=10&indent=on&facet=true&facet.mincount=1&facet.date=manufacturedate_dt&facet.date.start=2005-05-01T00:00:00Z&facet.date.end=2006-05-00T00:00:00Z&facet.date.gap=%2B1MONTH

The mincount parameter works properly, however, if I add the shards parameter (shards=localhost:8090/solr1,localhost:8090/solr2
or even shards=localhost:8090/solr1):
http://localhost:8090/solr1/select/?q=*%3A*&version=2.2&start=0&rows=10&indent=on&shards=localhost:8090/solr1,localhost:8090/solr2&facet=true&facet.mincount=1&facet.date=manufacturedate_dt&facet.date.start=2005-05-01T00:00:00Z&facet.date.end=2006-05-00T00:00:00Z&facet.date.gap=%2B1MONTH

mincount parameter is being ignored retrieving facets with 0 results.

Should the mincount parameter work as in the single search or it is not supported for date
facets?

Thanks in advance
                
> Distributed Date and Range Faceting
> -----------------------------------
>
>                 Key: SOLR-1709
>                 URL: https://issues.apache.org/jira/browse/SOLR-1709
>             Project: Solr
>          Issue Type: Improvement
>          Components: SearchComponents - other
>    Affects Versions: 1.4
>            Reporter: Peter Sturge
>            Assignee: Simon Willnauer
>            Priority: Minor
>             Fix For: 3.6, 4.0
>
>         Attachments: FacetComponent.java, FacetComponent.java, ResponseBuilder.java,
SOLR-1709.patch, SOLR-1709_3x.patch, SOLR-1709_distributed_date_faceting_v3x.patch, solr-1.4.0-solr-1709.patch
>
>
> This patch is for adding support for date facets when using distributed searches.
> Date faceting across multiple machines exposes some time-based issues that anyone interested
in this behaviour should be aware of:
> Any time and/or time-zone differences are not accounted for in the patch (i.e. merged
date facets are at a time-of-day, not necessarily at a universal 'instant-in-time', unless
all shards are time-synced to the exact same time).
> The implementation uses the first encountered shard's facet_dates as the basis for subsequent
shards' data to be merged in.
> This means that if subsequent shards' facet_dates are skewed in relation to the first
by >1 'gap', these 'earlier' or 'later' facets will not be merged in.
> There are several reasons for this:
>   * Performance: It's faster to check facet_date lists against a single map's data, rather
than against each other, particularly if there are many shards
>   * If 'earlier' and/or 'later' facet_dates are added in, this will make the time range
larger than that which was requested
>         (e.g. a request for one hour's worth of facets could bring back 2, 3 or more
hours of data)
>     This could be dealt with if timezone and skew information was added, and the dates
were normalized.
> One possibility for adding such support is to [optionally] add 'timezone' and 'now' parameters
to the 'facet_dates' map. This would tell requesters what time and TZ the remote server thinks
it is, and so multiple shards' time data can be normalized.
> The patch affects 2 files in the Solr core:
>   org.apache.solr.handler.component.FacetComponent.java
>   org.apache.solr.handler.component.ResponseBuilder.java
> The main changes are in FacetComponent - ResponseBuilder is just to hold the completed
SimpleOrderedMap until the finishStage.
> One possible enhancement is to perhaps make this an optional parameter, but really, if
facet.date parameters are specified, it is assumed they are desired.
> Comments & suggestions welcome.
> As a favour to ask, if anyone could take my 2 source files and create a PATCH file from
it, it would be greatly appreciated, as I'm having a bit of trouble with svn (don't shoot
me, but my environment is a Redmond-based os company).

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


Mime
View raw message