Return-Path: X-Original-To: apmail-lucene-dev-archive@www.apache.org Delivered-To: apmail-lucene-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 5C20F19D43 for ; Sun, 10 Apr 2016 16:06:27 +0000 (UTC) Received: (qmail 9113 invoked by uid 500); 10 Apr 2016 16:06:25 -0000 Delivered-To: apmail-lucene-dev-archive@lucene.apache.org Received: (qmail 9054 invoked by uid 500); 10 Apr 2016 16:06:25 -0000 Mailing-List: contact dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@lucene.apache.org Delivered-To: mailing list dev@lucene.apache.org Received: (qmail 9044 invoked by uid 99); 10 Apr 2016 16:06:25 -0000 Received: from arcas.apache.org (HELO arcas) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Sun, 10 Apr 2016 16:06:25 +0000 Received: from arcas.apache.org (localhost [127.0.0.1]) by arcas (Postfix) with ESMTP id 808282C14E1 for ; Sun, 10 Apr 2016 16:06:25 +0000 (UTC) Date: Sun, 10 Apr 2016 16:06:25 +0000 (UTC) From: "Joel Bernstein (JIRA)" To: dev@lucene.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Created] (SOLR-8965) Add Path reduce operation to aggregate paths taken in a session MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 Joel Bernstein created SOLR-8965: ------------------------------------ Summary: Add Path reduce operation to aggregate paths taken in a session Key: SOLR-8965 URL: https://issues.apache.org/jira/browse/SOLR-8965 Project: Solr Issue Type: New Feature Reporter: Joel Bernstein One of the things it's tricky to do at scale is session aggregation. MapReduce of course makes this easy. Now that we have MapReduce it would be good to add some session aggregations to the base library. The Path reduce operation can be used with the *reduce* function to concatenate the path taken in a session into a single field. These path records can then be added to another SolrCloud collection using the update stream. Once they have been consolidated in another collection aggregations can be run on the paths using the RollupStream. A HashRollupStream could also be developed to aggregate the paths as they are reduced. The HashRollupStream would keep all the paths in a hash map during the aggregation so it would not require the paths to be received in order. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org For additional commands, e-mail: dev-help@lucene.apache.org