hadoop-general mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Hosseinzadeh, Jafar" <Jafar.Hosseinza...@childrens.harvard.edu>
Subject Re: How to get hadoop issues data for research?
Date Tue, 09 Dec 2014 15:13:52 GMT
Hi Feixue,

It all depends on how much data and what kind of work.  The easiest way to
a Hadoop cluster going is Serengeti.  You can create it using vmWare.  And
the performance is not too bad.  You can also look at Amazon to host your
project will be more expansive.  Here is a link that might help:


Please let me know if you need more information.


On 12/9/14 9:40 AM, "Akira AJISAKA" <ajisakaa@oss.nttdata.co.jp> wrote:

>You can use REST API. Example:
>This general@ mailing list is for announcements and project management.
>For end-user questions and discussions, please use user@ mailing list.
>(12/9/14, 18:22), zfx wrote:
>> Hi, all
>> I am a graduate student in Peking University, our lab do some research
>>on open source projects.
>> This is our introduction:
>> https://passion-lab.org/
>> Now we need hadoop issues data for research, I found the issues list:
>> https://issues.apache.org/jira/issues/?jql=project%20%3D%20HADOOP
>> I want to download the hadoop issues data, Could anyone tell me how to
>>download the data? Or is there some links or API for download the data?
>> Many thanks!
>> Beat regards,
>> Feixue, Zhang?

View raw message