mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Suneel Marthi <suneel_mar...@yahoo.com>
Subject Re: Using Mahout to cluster a large CSV file
Date Fri, 31 Jan 2014 13:17:23 GMT
Use Mahout's CSVVectorIterator.java to read ur input CSV file and generate vectors.

You pass in a java.io.Reader to your CSV file and it generates Dense Vectors (from CSV).

U could then feed the generated vectors into KMeans clustering.




On Friday, January 31, 2014 7:55 AM, "Allen, Ronald L." <allenrl1@ornl.gov> wrote:
 
Hi all,

Has anyone had any success using Mahout kmeans to cluster a data in a single large CSV file? 
If so, how did you do it?

Thanks,
Ronnie
Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message