lucene-solr-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Praveen Babu <subramani....@gmail.com>
Subject Re: Solr Grouping, Aggregations and Custom Functions
Date Thu, 08 Sep 2016 04:11:21 GMT
Hi All,

I am also new to Solr and I have gone through Solr document and tested agg
using Solr- Prasto ( Parallel sql), Stream.

I am getting very good response using these 2 technologies. But my worries
are, unable to Group By Multivalue field which Solr standard api does but
not latest version of solr-prasto/Stream.

I want to aggregate/Group by  "app.name" field using stream/ Parallel sql.
Please suggest.

input:

{

id: 1

field1:[1,2,3],

app.name:[watsapp,facebook,... ]

}

{

id: 2

field1:[1,2,3],

app.name:[watsapp,facebook,... ]

}



Expected result :

watsapp: 2

facebook : 2


I have 2 TB data . I wanted to execute in aggmode=map_reduce. Any
suggestion?



Regards,
S.Praveen
Technical Architech
LinkedIn:
https://www.linkedin.com/in/praveen-babu-73232889?trk=nav_responsive_tab_profile




On Thu, Sep 8, 2016 at 6:01 AM, Roshni Rajagopal <roshknee80@gmail.com>
wrote:

> Hi Solr Gurus,
>
>        I have these requirements
>
> 1. Need to group data in solr on multiple fields and compute agregations
> like SUM (field)
>
> 2. Need to compute some custom calculations - sum(field1)/sum(field2) on
> the grouped data.
>
> Options Ive tried
>
> 1. Group- this does not allow to group by more than 1 field, and
> aggregations are not supported
>
> 2. Stats - this along with facet.pivot gets results for basic group
> aggregations like SUM. Custom Calculation is not supported. Also the format
> is messy with stats getting calculated at every level. Cannot paginate.
>
> 2. Facet JSON API -gets results for basic group aggregations like SUM.
> Format is less messy and we can paginate. Custom Calculation like
> DIV(sum(field1), sum(field2)) is still not supported.
>
> So the last resort is /sql handler for parallel queries. Is tested and
> stable, and will it meet my requirements? Im on solr 6.10.
>
> Or would you recommend adding Sparkā€¦I would prefer to handle all
> requirements in solr, as I dont want to maintain another moving part of
> Spark.
>
> Do advise!
>
> Regards
>
> Roshni
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message