hadoop-hdfs-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Alex Lee <eliy...@hotmail.com>
Subject RE: Realtime sensor's tcpip data to hadoop
Date Thu, 15 May 2014 09:50:11 GMT
There are so many choice 
 OpenTSDB, flume, ActiveMQ, Lustre, Splunk, 
Ganglia or Nagios. Not very sure which one is more fiitted.
Date: Fri, 9 May 2014 15:46:24 +0200
Subject: Re: Realtime sensor's tcpip data to hadoop
From: dechouxb@gmail.com
To: user@hadoop.apache.org; user@flume.apache.org

Flume is indeed something you should look into. 'Log files' is a simplification. Flume really
handles events and yes logs are a common kind of event but not the only one.

On Wed, May 7, 2014 at 4:48 AM, Alex Lee <eliyart@hotmail.com> wrote:

Sensors' may send tcpip data to server. Each sensor may send tcpip data like a stream to the
server, the quatity of the sensors and the data rate of the data is high.
Firstly, how the data from tcpip can be put into hadoop. It need to do some process and store
in hbase. Does it need through save to data files and put into hadoop or can be done in some
direct ways from tcpip. Is there any software module can take care of this. Searched that
Ganglia Nagios and Flume may do it. But when looking into details, ganglia and nagios are
more for monitoring hadoop cluster itself. Flume is for log files.

Secondly, if the total network traffic from sensors are over the limit of one lan port, how
to share the loads, is there any component in hadoop to make this done automatically.
Any suggestions, thanks.

View raw message