hadoop-general mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Anthony Ikeda" <Anthony.Ik...@cardlink.com.au>
Subject RE: Active-Active Performance
Date Tue, 25 May 2010 02:12:37 GMT
Thanks Hemanth,

In regards to different locations of the HADOOP home this is low
priority more for testing not production. I was trying to install HADOOP
for testing over 2 machines with only a Windows XP machine running
Cygwin and a Mac running Darwin. Not a priority.

In regards to my last question about operating in a detached fashion, we
are trying to factor in what happens when the link between both sites is
cut. Will both sites operate independently until the connection is
re-established? Is there any particular setup required to ensure we can
cover this scenario or is it an out-of-the-box feature?


-----Original Message-----
From: Hemanth Yamijala [mailto:yhemanth@gmail.com] 
Sent: Tuesday, 25 May 2010 12:08 PM
To: general@hadoop.apache.org
Subject: Re: Active-Active Performance


I'm new to Hadoop and I've been given the task to see how we might
> Hadoop and HBase to implement an Active-Active site layer for sharing
> information across a distributed application.
> I've been able to:
> *         Install and get Hadoop running on a single node and am in
> process of configure a 2 node setup.
> *         Install HBase on a single node and create a table and
mapping as
> well as insert data into the system
> Once I've got the mutli-node configured I hope to run some tests as
> I've noticed that trying to start Hadoop in distributed mode, the
> will ssh to the master to start it as well (bin/start-all.sh) provided
> same path is setup on the remote machine.
> Questions:
> Can I configure the system IF the Hadoop installation is not in the
> location per machine?

I would think configuring and managing such a system would get very
- for e.g. if you'll want to add nodes to expand in future. You would
not be able to take advantage of the very helpful scripts that come with
Hadoop. Is there a reason why you want to do this ?

> If the master node goes down (say due to electrical fault or system
> how do the slave nodes react? Will they continue to run? Will the
nodes be
> back in sync once the master starts again?

Hadoop slaves will continue. They will enter a retry loop trying to
to the master until it comes up. In doing so, they could fill up log
very fast though. If the master starts with the same configuration,
host, ports), they should be able to connect and resume.

> Would I require a particular configuration to ensure that both our
> can operate within the cluster as well as in a detached fashion (due
> maintenance or network issues)?
I did not quite follow this. Can you explain a little more about how you
want to setup your system ?


This e-mail has been scanned for viruses by MCI's Internet Managed 
Scanning Services - powered by MessageLabs. For further information 
visit http://www.mci.com

This e-mail message and any attachments are intended only for the use of the addressee(s)
named above and may contain information that is privileged and confidential. If you are not
the intended recipient, any display, dissemination, distribution, or copying is strictly prohibited.
  If you believe you have received this e-mail message in error, please immediately notify
the sender by replying to this e-mail message or by telephone to (02) 9646 9222. Please delete
the email and any attachments and do not retain the email or any attachments in any form.

View raw message