systemml-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Deron Eriksson (JIRA)" <j...@apache.org>
Subject [jira] [Resolved] (SYSTEMML-153) Allow input data file without requiring corresponding metadata file
Date Fri, 16 Sep 2016 19:15:21 GMT

     [ https://issues.apache.org/jira/browse/SYSTEMML-153?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Deron Eriksson resolved SYSTEMML-153.
-------------------------------------
       Resolution: Won't Fix
    Fix Version/s: Not Applicable

This issue was address by [PR66|https://github.com/apache/incubator-systemml/pull/66], which
I closed since no consensus could be reached.

As I menioned on the PR, "Since no consensus could be reached and the MLContext API allows
data input without what I would consider burdensome mandatory metadata requirements, I'll
close this PR."

> Allow input data file without requiring corresponding metadata file
> -------------------------------------------------------------------
>
>                 Key: SYSTEMML-153
>                 URL: https://issues.apache.org/jira/browse/SYSTEMML-153
>             Project: SystemML
>          Issue Type: Improvement
>            Reporter: Deron Eriksson
>             Fix For: Not Applicable
>
>
> Right now a metadata file is required for an input data file. For example, a matrix.csv
file would typically require a matrix.csv.mtd file. Creating a .mtd manually is a minor annoyance
in terms of consumability of SystemML. It would be nice if there were some mechanism so that
a metadata file does not need to be provided in all cases.
> One possibility is that if no metadata file is present, SystemML could assume a particular
default format (for example, a comma-separated delimited file). The number of rows and columns
could be determined by parsing the file. This might work well for small files but not necessarily
well for enormous files.
> A possible way to solve this would be to use a file extension to indicate that you have
a small input data file and you don't want to have to provide a metadata file. For example,
you could have a matrix.csv-nomtd file. The .csv part of the extension indicates that it's
a csv file, and the -nomtd part of the extension indicates that you don't want to provide
metadata.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Mime
View raw message