Return-Path: X-Original-To: apmail-hive-dev-archive@www.apache.org Delivered-To: apmail-hive-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 80AA21043D for ; Thu, 8 Aug 2013 21:02:51 +0000 (UTC) Received: (qmail 83237 invoked by uid 500); 8 Aug 2013 21:02:50 -0000 Delivered-To: apmail-hive-dev-archive@hive.apache.org Received: (qmail 83188 invoked by uid 500); 8 Aug 2013 21:02:50 -0000 Mailing-List: contact dev-help@hive.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@hive.apache.org Delivered-To: mailing list dev@hive.apache.org Received: (qmail 83178 invoked by uid 500); 8 Aug 2013 21:02:50 -0000 Delivered-To: apmail-hadoop-hive-dev@hadoop.apache.org Received: (qmail 83175 invoked by uid 99); 8 Aug 2013 21:02:50 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 08 Aug 2013 21:02:50 +0000 Date: Thu, 8 Aug 2013 21:02:50 +0000 (UTC) From: "Brenden Matthews (JIRA)" To: hive-dev@hadoop.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (HIVE-1545) Add a bunch of UDFs and UDAFs MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/HIVE-1545?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13733976#comment-13733976 ] Brenden Matthews commented on HIVE-1545: ---------------------------------------- Where's the rest of the source? > Add a bunch of UDFs and UDAFs > ----------------------------- > > Key: HIVE-1545 > URL: https://issues.apache.org/jira/browse/HIVE-1545 > Project: Hive > Issue Type: New Feature > Components: UDF > Reporter: Jonathan Chang > Assignee: Jonathan Chang > Priority: Minor > Attachments: core.tar.gz, ext.tar.gz, UDFEndsWith.java, UDFFindInString.java, UDFLtrim.java, UDFRtrim.java, udfs.tar.gz, udfs.tar.gz, UDFStartsWith.java, UDFTrim.java > > > Here some UD(A)Fs which can be incorporated into the Hive distribution: > UDFArgMax - Find the 0-indexed index of the largest argument. e.g., ARGMAX(4, 5, 3) returns 1. > UDFBucket - Find the bucket in which the first argument belongs. e.g., BUCKET(x, b_1, b_2, b_3, ...), will return the smallest i such that x > b_{i} but <= b_{i+1}. Returns 0 if x is smaller than all the buckets. > UDFFindInArray - Finds the 1-index of the first element in the array given as the second argument. Returns 0 if not found. Returns NULL if either argument is NULL. E.g., FIND_IN_ARRAY(5, array(1,2,5)) will return 3. FIND_IN_ARRAY(5, array(1,2,3)) will return 0. > UDFGreatCircleDist - Finds the great circle distance (in km) between two lat/long coordinates (in degrees). > UDFLDA - Performs LDA inference on a vector given fixed topics. > UDFNumberRows - Number successive rows starting from 1. Counter resets to 1 whenever any of its parameters changes. > UDFPmax - Finds the maximum of a set of columns. e.g., PMAX(4, 5, 3) returns 5. > UDFRegexpExtractAll - Like REGEXP_EXTRACT except that it returns all matches in an array. > UDFUnescape - Returns the string unescaped (using C/Java style unescaping). > UDFWhich - Given a boolean array, return the indices which are TRUE. > UDFJaccard > UDAFCollect - Takes all the values associated with a row and converts it into a list. Make sure to have: set hive.map.aggr = false; > UDAFCollectMap - Like collect except that it takes tuples and generates a map. > UDAFEntropy - Compute the entropy of a column. > UDAFPearson (BROKEN!!!) - Computes the pearson correlation between two columns. > UDAFTop - TOP(KEY, VAL) - returns the KEY associated with the largest value of VAL. > UDAFTopN (BROKEN!!!) - Like TOP except returns a list of the keys associated with the N (passed as the third parameter) largest values of VAL. > UDAFHistogram -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira