hive-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "jayapriya surendran (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (HIVE-9688) Support SAMPLE operator in hive
Date Mon, 23 Mar 2015 04:14:12 GMT

    [ https://issues.apache.org/jira/browse/HIVE-9688?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14375365#comment-14375365
] 

jayapriya surendran commented on HIVE-9688:
-------------------------------------------

Hello Prasanth,
Greetings! I'm Jayapriya Surendran, currently pursuing MS in Computer Engineering at San Jose
State University. I would like to work on this idea for Google Summer of Code 2015. I'm really
interested in distributed systems and I've learnt the basics of Hadoop and MapReduce from
this Udacity course (https://www.udacity.com/course/ud617) offered by Cloudera. I've familiarized
myself with Java,JUnit,Maven,IntelliJ and Git while implementing algorithms (https://github.com/jayapriya90/algorithms).
I'd be really thankful if you could help me out on how to start with this project.  

Thanks and Regards
Jayapriya Surendran

> Support SAMPLE operator in hive
> -------------------------------
>
>                 Key: HIVE-9688
>                 URL: https://issues.apache.org/jira/browse/HIVE-9688
>             Project: Hive
>          Issue Type: New Feature
>            Reporter: Prasanth Jayachandran
>              Labels: gsoc, gsoc2015, hive, java
>
> Hive needs SAMPLE operator to support parallel order by, skew joins and count + distinct
optimizations. Random, Reservoir and Stratified sampling should cover most of the cases.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message