hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Christian Schneider <cschneiderpub...@gmail.com>
Subject Re: How to Archive the Task Logs (Stdout, Stderr, Syslogs)
Date Tue, 19 Mar 2013 11:01:08 GMT
Hi Jagat,
could you give me a short hint which source and Sink I should use?

Because it would be pretty good to have a 1:1 copy from the log folder of
the task nodes. Smth. like:
/<hostname of the
TaskTracker>/var/log/hadoop-0.20-mapreduce/userlogs/job_201303181503_0248/attempt_201303181503_0248_m_000023_0/*

Best Regards,
Christian.


2013/3/19 Christian Schneider <cschneiderpublic@gmail.com>

> Hi Jagat,
> Thank you. That sounds good. I will have a  look at it.
>
> Best Regards,
> Christian.
>
>
> 2013/3/19 Jagat Singh <jagatsingh@gmail.com>
>
>> Hello,
>>
>> You should be looking at Flume.
>>
>> Its made for this
>>
>> http://flume.apache.org/
>>
>> Thanks,
>>
>> Jagat Singh
>>
>>
>> On Tue, Mar 19, 2013 at 9:03 PM, Christian Schneider <
>> cschneiderpublic@gmail.com> wrote:
>>
>>> Hi,
>>> I found out that these logs are stored directly at the TaskNodes.
>>>
>>> We need to have them stored over a long time (some months or better a
>>> year). What is a good way of doing that?
>>>
>>> With my current knowledge I would write a cron job that picks up all the
>>> files every few minutes.
>>> But I guess thats not the best approach...
>>>
>>> Best Regards,
>>> Christian.
>>>
>>
>>
>

Mime
View raw message