flink-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Hilmi Yildirim <hilmi.yildi...@neofonie.de>
Subject Reading from HBase problem
Date Mon, 08 Jun 2015 13:04:47 GMT
I implemented a simple Flink Batch job which reads from an HBase Cluster 
of 13 machines and with nearly 100 million rows. The hbase version is 
1.0.0-cdh5.4.1. So, I imported hbase-client 1.0.0-cdh5.4.1.
I implemented a flatmap which creates a tuple ("a", 1L) for each row . 
Then, I use groupBy(0).sum(1).writeAsTest. The result should be the 
number of rows. But, the result is not correct. I run the job multiple 
times and the result flactuates by +-5. I also run the job for a smaller 
table with 100.000 rows and the result is correct.

Does anyone know the reason for that?

Best Regards,

Hilmi Yildirim
Software Developer R&D


Besuchen Sie den Neo Tech Blog für Anwender:

Folgen Sie uns:

Neofonie GmbH | Robert-Koch-Platz 4 | 10115 Berlin
Handelsregister Berlin-Charlottenburg: HRB 67460
Geschäftsführung: Thomas Kitlitschko

View raw message