lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Mikhail Khludnev <mkhlud...@griddynamics.com>
Subject Re: Solr 4.0 - Join performance
Date Thu, 02 Aug 2012 18:38:13 GMT
Hello,

You can check my record.
https://issues.apache.org/jira/browse/SOLR-3076?focusedCommentId=13415644&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-13415644

I'm still working on precise performance measurement.

On Thu, Aug 2, 2012 at 6:45 PM, Eric Khoury <ekhoury72@hotmail.com> wrote:

>
>
>
>
>
>
> Hello all,
>
>
>
> I’m testing out the new join feature, hitting some perf
> issues, as described in Erick’s article (
> http://architects.dzone.com/articles/solr-experimenting-join).
>
> Basically, I’m using 2 objects in solr (this is a simplified
> view):
>
>
>
> Item
>
> - Id
>
> - Name
>
>
>
> Grant
>
> - ItemId
>
> - AvailabilityStartTime
>
> - AvailabilityEndTime
>
>
>
> Each item can have multiple grants attached to it.
>
>
>
> The query I'm using is the following, to find items by
> name, filtered by grants availability window:
>
>
>
> solr/select?fq=Name:XXX&q={!join
> from=ItemId to=Id} AvailabilityStartTime:[* TO NOW] AND
> -AvailabilityEndTime:[*
> TO NOW]
>
>
>
> With a hundred thousand items, this query can take multiple seconds
> to perform, due to the large number or ItemIds returned from the join
> query.
>
> Has anyone come up with a better way to use joins for these types of
> queries?  Are there improvements planned in 4.0 rtm in this area?
>
>
>
> Btw, I’ve explored simply adding Start-End times to items, but
> the flat data model makes it hard to maintain start-end pairs.
>
>
>
> Thanks for the help!
>
> Eric.
>
>
>
>




-- 
Sincerely yours
Mikhail Khludnev
Tech Lead
Grid Dynamics

<http://www.griddynamics.com>
 <mkhludnev@griddynamics.com>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message