spark-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Minudika Malshan <>
Subject Get data from CSV files to feed SparkML library methods
Date Wed, 10 Aug 2016 11:16:10 GMT
Hi all,

I'm using spark ml library and need to train a model using data extracted
from a CSV file.
I found that we can load datasets from LibSVM files to spark ML methods.
As far as i understood, the data should be represented as labeled points
in-order to feed the ml methods.
Is there a way to load dataset from a CSV file instead of a LibSVM file?
Or do I need to convert the CSV file to LibSVM format? If so, could you
please let me know a way to do that.?
Your help would be much appreciated.

Thank you!

View raw message