systemml-issues mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Glenn Weidner (JIRA)" <j...@apache.org>
Subject [jira] [Commented] (SYSTEMML-1474) Index out of bounds error in test_naive_bayes1 of test_mllearn_numpy.py
Date Fri, 07 Apr 2017 18:26:41 GMT

    [ https://issues.apache.org/jira/browse/SYSTEMML-1474?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15961231#comment-15961231
] 

Glenn Weidner commented on SYSTEMML-1474:
-----------------------------------------

Note it appears to be a regression since the same test can be run successfully on same local
system using latest release systemml-0.13.0-incubating-python.tgz.

> Index out of bounds error in test_naive_bayes1 of test_mllearn_numpy.py
> -----------------------------------------------------------------------
>
>                 Key: SYSTEMML-1474
>                 URL: https://issues.apache.org/jira/browse/SYSTEMML-1474
>             Project: SystemML
>          Issue Type: Bug
>            Reporter: Glenn Weidner
>            Priority: Minor
>
> The following error was observed running the python tests from command line with spark-submit:
> {code}
> ======================================================================
> ERROR: test_naive_bayes1 (__main__.TestMLLearn)
> ----------------------------------------------------------------------
> Traceback (most recent call last):
>   File "/home/spark/test_mllearn_numpy.py", line 184, in test_naive_bayes1
>     mllearn_predicted = nb.fit(vectors, newsgroups_train.target).predict(vectors_test)
>   File "/usr/lib/python2.7/site-packages/systemml/mllearn/estimators.py", line 142, in
fit
>     self.fit_numpy(X, y)
>   File "/usr/lib/python2.7/site-packages/systemml/mllearn/estimators.py", line 95, in
fit_numpy
>     self._fit_numpy()
>   File "/usr/lib/python2.7/site-packages/systemml/mllearn/estimators.py", line 88, in
_fit_numpy
>     self.model = self.estimator.fit(convertToMatrixBlock(self.sc, self.X), y_mb)
>   File "/usr/lib/python2.7/site-packages/systemml/converters.py", line 106, in convertToMatrixBlock
>     [ _copyRowBlock(i, sc, ret, src, numRowsPerBlock,  rlen, clen) for i in range(0,
src.shape[0], numRowsPerBlock) ]
>   File "/usr/lib/python2.7/site-packages/systemml/converters.py", line 83, in _copyRowBlock
>     mb = _convertSPMatrixToMB(sc, src[i:i+numRowsPerBlock,]) if isinstance(src, spmatrix)
else _convertDenseMatrixToMB(sc, src[i:i+numRowsPerBlock,])
>   File "/usr/lib64/python2.7/site-packages/scipy/sparse/csr.py", line 304, in __getitem__
>     return self._get_submatrix(row, col)
>   File "/usr/lib64/python2.7/site-packages/scipy/sparse/csr.py", line 447, in _get_submatrix
>     check_bounds(i0, i1, M)
>   File "/usr/lib64/python2.7/site-packages/scipy/sparse/csr.py", line 443, in check_bounds
>     " %d <= %d" % (i0, num, i1, num, i0, i1))
> IndexError: index out of bounds: 0 <= 2030 <= 2034, 0 <= 2059 <= 2034, 2030
<= 2059
> {code}
> The IndexError was first observed when running the test under a Notebook cloud environment
with Spark 2.0.2, then reproduced at command line on local system with Spark 2.1.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Mime
View raw message