flink-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Robert Metzger <rmetz...@apache.org>
Subject Re: about hdfs in flink cluster
Date Thu, 16 Oct 2014 10:17:51 GMT

good to hear that you are using Flink.

HDFS is a distributed file system for reliably storing huge amounts of
data. Many users of Flink use HDFS to store all kinds of data on it.
This includes both input data for their jobs and also results.
Often, Flink and HDFS are installed next to each other in a cluster so that
the same machines that contain the data also process it.

An example for you could be that you store the data from MySQL in HDFS. Or
you could join data from MySQL with data in HDFS.

Let us know if you have more questions. We are happy to help!

On Thu, Oct 16, 2014 at 9:56 AM, Márton Balassi <balassi.marton@gmail.com>

> Flink does not require HDFS if you decide not to use it. The version you
> are currently using should fit you then. You might change it if you need/do
> not need YARN support.
> On Thu, Oct 16, 2014 at 4:27 AM, 杨洪波 <hongbosoftware@163.com> wrote:
> > hey all,
> > i am new in flink and i have a question:  i use flink cluster to read
> > mysql datasource,my biz don't need hdfs, do i must set HDFS in my flink
> > cluster? if yes,then for what flink use it ?
> > thanks!

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message