while the data values you generated might be following a true random distribution, your row key, UUID, is not (because it is created on the same machines by the same software with a certain window of time)
For example, if you were using the UUID class in Java, these would be composed from several components (related to dimensions such as time and version), so you can not expect a random distribution over the whole space.
I'm observing very strange behavior. I have 3 node cluster with
ByteOrderPartitioner. (I run 1.1.5)
I created a key space with replication factor of 1.
Then I created one column family and populated it with random data.
I use UUID as a row key, and Integer as a column name.
Row keys were generated as
UUID uuid = UUID.randomUUID();
I populated about 100000 rows with 100 column each.
I would expect equal load on each node, but the result is totally
different. This is what nodetool gives me:
Address DC Rack Status State Load
127.0.0.1 datacenter1 rack1 Up Normal 27.61 MB
127.0.0.3 datacenter1 rack1 Up Normal 206.47 KB
127.0.0.2 datacenter1 rack1 Up Normal 13.86 MB
one node (127.0.0.3) is almost empty.
Any ideas what is wrong?