hama-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Edward J. Yoon" <edwardy...@apache.org>
Subject Re: Fault tolerant test
Date Thu, 27 Sep 2012 09:19:17 GMT
Interesting, it was "Too many open files" error? or something?

On Thu, Sep 27, 2012 at 6:13 PM, Yuesheng Hu <yueshenghu@gmail.com> wrote:
> TB or hundreds GB data job, and long-time running job.
> I test a 200GB dataset for kmeans this afternoon, every superstep taken
> about 30m(our cluster is small), it will throw "Filesystem closed"
> exception occasionally.
>
> 2012/9/27 Edward J. Yoon <edwardyoon@apache.org>
>
>> Hi,
>>
>> Today I tested Hama TRUNK on 1152 cores cluster, everything seems OK
>> except the AvroMessageManager and Memory issues.
>>
>> I'm planning on testing new FT system 2 weeks later (I'll be vacation
>> Next week). So, could you please let me know where I should
>> concentrate my efforts?
>>
>> --
>> Best Regards, Edward J. Yoon
>> @eddieyoon
>>



-- 
Best Regards, Edward J. Yoon
@eddieyoon

Mime
View raw message