hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Josh Patterson <j...@cloudera.com>
Subject Re: Hadoop and Cloud computing
Date Wed, 11 Aug 2010 16:57:57 GMT
Hadoop and MapReduce should not be thought of in the realm of only
text based processing; projects like the openPDC have shown Map Reduce
to be a very good at processing TBs of high resolution timeseries data
(from binary formats). Take a look at the "powered by" page on
hadoop.apache.org to look for other ideas as well.

In terms of architecture, there are some interesting discussions in
the HBase realm about dealing with hot spots and how HBase and HDFS
work together. You might ask Jonathan Gray or Michael Stack over in
the hbase irc channel about what they are facing, I'm sure they would
provide you with an interesting discussion.


On Wed, Aug 11, 2010 at 12:42 PM, Jackob Carlsson
<jackob.carlsson@gmail.com> wrote:
> Hi Josh,
> I would say the second case. As you know MapReduce algorithms more or less
> fits with text pressing but I'm looking for some issues such as large scale
> data handling. I would appreciate if you can point me to some related topics
> if you know.
> Best regards,
> Jackob
> On Wed, Aug 11, 2010 at 5:10 PM, Josh Patterson <josh@cloudera.com> wrote:
>> Jackob,
>> Are you looking for problems to solve with Map Reduce on Hadoop or
>> open problems to be solved in the architecture of hadoop?
>> Josh Patterson
>> Cloudera
>> On Tue, Aug 10, 2010 at 10:00 AM, Jackob Carlsson
>> <jackob.carlsson@gmail.com> wrote:
>> > Hi,
>> >
>> > I am trying to write a thesis proposal about my PhD about usage of hadoop
>> in
>> > cloud computing. I need to find some open problems in cloud computing
>> which
>> > can be addressed by hadoop. I would appreciate if somebody could help me
>> to
>> > find some topics.
>> >
>> > Thanks in advance
>> > Jackob
>> >

View raw message