hadoop-mapreduce-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Sonal Goyal <sonalgoy...@gmail.com>
Subject Re: mapreduce for proxy log file analysis
Date Sun, 01 Aug 2010 14:57:38 GMT

Have you checked Hive? Seems to fit your needs perfectly.

Thanks and Regards,

On Sun, Aug 1, 2010 at 1:40 AM, Bright D L <brightdl@gmail.com> wrote:

> Hi all,
>        I am doing a simple project to analyze http proxy server logs by
> hadoop mapreduce approach (in Java). The log file contains logs for a week
> or some times more than that.
>        I  have following requirements:
>                1) Find the top 50 bandwidth consumers (IPs) for each day
>                2) Find the hour of the day where there is maximum bandwidth
> utilization
>        Please help me out with some directions. Sample code is highly
> appreciated.
> Thank you all,
> Bright

View raw message