mahout-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Bertrand Dechoux <decho...@gmail.com>
Subject Re: Using Mahout to cluster a large CSV file
Date Fri, 31 Jan 2014 13:28:51 GMT
I guess the big (no pun intended) question is what is your definition of a
large CSV.

Bertrand


On Fri, Jan 31, 2014 at 2:17 PM, Suneel Marthi <suneel_marthi@yahoo.com>wrote:

> Use Mahout's CSVVectorIterator.java to read ur input CSV file and generate
> vectors.
>
> You pass in a java.io.Reader to your CSV file and it generates Dense
> Vectors (from CSV).
>
> U could then feed the generated vectors into KMeans clustering.
>
>
>
>
> On Friday, January 31, 2014 7:55 AM, "Allen, Ronald L." <allenrl1@ornl.gov>
> wrote:
>
> Hi all,
>
> Has anyone had any success using Mahout kmeans to cluster a data in a
> single large CSV file?  If so, how did you do it?
>
> Thanks,
> Ronnie
>

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message