hadoop-hive-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Jerome Boulon (JIRA)" <j...@apache.org>
Subject [jira] Updated: (HIVE-259) Add PERCENTILE aggregate function
Date Tue, 23 Feb 2010 17:50:28 GMT

     [ https://issues.apache.org/jira/browse/HIVE-259?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Jerome Boulon updated HIVE-259:
-------------------------------

    Attachment: Percentile.xlsx
                jb2.txt

Percentile test file + validation using Excep Percentile function:
CREATE TABLE JB2
(
duration bigint,
code string
)
ROW FORMAT DELIMITED FIELDS TERMINATED BY ' ' LINES TERMINATED BY '\n'
    STORED AS TEXTFILE;

LOAD DATA LOCAL INPATH '/jb2.txt' INTO TABLE JB2;



Result:
hive> select percentile(duration,"25,50,99") from JB2;
Ended Job = job_201002201654_0006
OK
[14.0,33.0,416.4000000000001]
Time taken: 36.261 seconds

hive> select code,percentile(duration,"25,50,99") from JB2 group by code;
Ended Job = job_201002201654_0007
OK
a	[2.0,17.5,427.2299999999999]
b	[22.75,44.5,345.84999999999997]
c	[18.0,29.0,58.760000000000005]
Time taken: 23.419 seconds
hive> quit;


> Add PERCENTILE aggregate function
> ---------------------------------
>
>                 Key: HIVE-259
>                 URL: https://issues.apache.org/jira/browse/HIVE-259
>             Project: Hadoop Hive
>          Issue Type: New Feature
>          Components: Query Processor
>            Reporter: Venky Iyer
>            Assignee: Jerome Boulon
>         Attachments: HIVE-259-2.patch, HIVE-259.1.patch, HIVE-259.patch, jb2.txt, Percentile.xlsx
>
>
> Compute atleast 25, 50, 75th percentiles

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message