hadoop-mapreduce-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Arnab Guin (JIRA)" <j...@apache.org>
Subject [jira] [Updated] (MAPREDUCE-5591) K-ranker
Date Mon, 21 Oct 2013 23:34:42 GMT

     [ https://issues.apache.org/jira/browse/MAPREDUCE-5591?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Arnab Guin updated MAPREDUCE-5591:
----------------------------------

    Attachment: k-ranking.tgz

> K-ranker 
> ---------
>
>                 Key: MAPREDUCE-5591
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-5591
>             Project: Hadoop Map/Reduce
>          Issue Type: Task
>          Components: examples
>    Affects Versions: 2.2.0
>            Reporter: Arnab Guin
>         Attachments: k-ranking.tgz
>
>
> Hi,
> I recently wrote some code to find the max K integers corresponding a group. 
> Given one of more input files containing input lines of the following form:
> "key",value
> where key is a string
>       value is any integer
> the program prints the top K elements corresponding to each key.
> eg.
> "a",1
> "b",1
> "a",2
> "a",5
> "b",17
> "c",5
> "b",6
> if k = 2, the program prints
> "a" [2,5]
> "b" [6,17]
> "c" [5]
> Compile steps:
> mvn clean
> mvn package javadoc:javadoc
> Run steps:
> hadoop jar <ranking jar file>  <main class> <K> <input directory>
<output directory>
> eg. hadoop jar target/ranking-1.0-SNAPSHOT.jar  org.ml.MaxKRanker 5 data/input data/output
> Wanted to know if there is a component (examples maybe) where the code can be contributed.
Also open to any suggestions for improvements.
> Thanks,
> Arnab



--
This message was sent by Atlassian JIRA
(v6.1#6144)

Mime
View raw message