hadoop-hdfs-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From xiaofei du <xiaofei.du...@gmail.com>
Subject Re: what kind of improvement for HDFS could possibly be done within 3 months
Date Tue, 28 Sep 2010 12:25:03 GMT
Hi Thomas,

Thanks for your suggestions, I would take well consideration of them.
However, I need coding work for finishing my diploma project. Thus, writing
documentation may not be the good choice for me. And my work needs to be
related to HDFS. For example, I know random modification of file is not
supported, I don't know if this function could be added to HDFS (if it is
feasible). If it is feasible, how is the workload? is it hard? could it be
finished within 3 months?

I am wondering if someone is already working on random modification or
something related to HDFS. And I may need some guidance from the veteran.

On Tue, Sep 28, 2010 at 6:05 PM, Thomas Koch <thomas@koch.ro> wrote:

> xiaofei du:
> > Hi All,
> >
> > I am a graduate student, I am preparing for my diploma project. I have
> > about 3 months to finish the project. I want to do some work on HDFS.
> > However, I have no concept what I could do for improving HDFS. So could
> you
> > guys please give me some suggestions?
> >
> > I hope the suggested project could be done within 3 months, I cannot
> afford
> > more time. So the project should not be too hard (at the time, it should
> > not be easy, otherwise, I cannot reach the graduation requirement :-) )
> >
> > thank you !!!
> Hi,
> you could write a developer documentation of the inner workings of HDFS
> (+HBASE, +MAPREDUCE?) that could be understood by HDFS users. Additionally
> to
> the documentation of the current state, you could include:
> - Different strategies to make the NameNode distributed
> - The different Approaches to append
> - How does Security with Kerberos work?
> One of the challenges of such a work would be to make it as easy as
> possible
> for developers to understand some part of HDFS they're interested in.
> Another challenge is to choose a documentation format and workflow that
> would
> make it easy to keep this documentation current without much effort.
> A totally other project that I also consider important for Hadoop: Help
> Apache
> to implement an infrastructure based on GIT. This could help many projects
> in
> the long run. If you're interested in this, you should subscribe to
> infrastructure-dev@apache.org and get in contact with Jukka Zitting
> <jukka.zitting@gmail.com>.
> Best regards,
> Thomas Koch, http://www.koch.ro

Best wishes,
Xiaofei Du(Gregory)

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message