Return-Path: X-Original-To: apmail-hive-dev-archive@www.apache.org Delivered-To: apmail-hive-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id B9445E519 for ; Wed, 6 Feb 2013 09:27:13 +0000 (UTC) Received: (qmail 724 invoked by uid 500); 6 Feb 2013 09:27:13 -0000 Delivered-To: apmail-hive-dev-archive@hive.apache.org Received: (qmail 586 invoked by uid 500); 6 Feb 2013 09:27:13 -0000 Mailing-List: contact dev-help@hive.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@hive.apache.org Delivered-To: mailing list dev@hive.apache.org Received: (qmail 578 invoked by uid 500); 6 Feb 2013 09:27:13 -0000 Delivered-To: apmail-hadoop-hive-dev@hadoop.apache.org Received: (qmail 575 invoked by uid 99); 6 Feb 2013 09:27:13 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 06 Feb 2013 09:27:13 +0000 Date: Wed, 6 Feb 2013 09:27:13 +0000 (UTC) From: "PRETTY SITHARA (JIRA)" To: hive-dev@hadoop.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Updated] (HIVE-2238) Support for Median and Mode UDAFs MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/HIVE-2238?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] PRETTY SITHARA updated HIVE-2238: --------------------------------- Attachment: input.txt Test_Cases.tar.gz Attaching the test cases and input text file for HIVE-2238 > Support for Median and Mode UDAFs > --------------------------------- > > Key: HIVE-2238 > URL: https://issues.apache.org/jira/browse/HIVE-2238 > Project: Hive > Issue Type: New Feature > Components: UDF > Reporter: Travis Powell > Labels: patch > Attachments: HIVE-2238.1.patch.txt, input.txt, Test_Cases.tar.gz > > > Median and Mode are essential functions for reducing/refining the data set, and would allow for greater control over the selection of data. More involved analytics are probably best handled by relational databases or OLAP cubes, but Median and Mode are very practical for Hive solely in terms of delivering a smaller data set, where items selected only have a certain mode. (Rows that describe an object to which the table is joined where that object has a column value frequency threshold.) > Comments are more than welcome. Would be happy to support. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira