flink-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Lydia Ickler <ickle...@googlemail.com>
Subject cluster execution
Date Thu, 28 Jan 2016 08:53:22 GMT
Hi all,

I am doing some operations on a DataSet<Tuple3<Integer,Integer,Double>> … (see
code below)
When I run my program on a cluster with 3 machines I can see within the web client that only
my master is executing the program. 
Do I have to specify somewhere that all machines have to participate? Usually the cluster
executes in parallel.

Any suggestions?

Best regards, 
Lydia
DataSet<Tuple3<Integer, Integer, Double>> matrixA = readMatrix(env, input);
DataSet<Tuple3<Integer, Integer, Double>> initial = matrixA.groupBy(0).sum(2);

//normalize by maximum value
initial = initial.cross(initial.max(2)).map(new normalizeByMax());
matrixA.join(initial).where(1).equalTo(0)
      .map(new ProjectJoinResultMapper()).groupBy(0, 1).sum(2);


Mime
View raw message