hive-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Stephen Sprague <sprag...@gmail.com>
Subject Re: computing median and percentiles
Date Wed, 19 Mar 2014 23:58:08 GMT
not a hive question is it?   its more like a math question.



On Wed, Mar 19, 2014 at 1:30 PM, Seema Datar <sdatar@yahoo-inc.com> wrote:

>
>
>   I understand the percentile function is supported in Hive in the latest
> versions. However, how does once calculate percentiles when the data is
> across two columns. So say -
>
>  Value  Count
>
>  100 2   ( so basically 100 occurred twice)
> 200 4
> 300 1
> 400 6
> 500 3
>
>
>  I want to find out the 0.25 percentile for the value distribution. How
> can I do it using the Hive percentile function?
>
>
>

Mime
View raw message