mahout-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "yumeng (JIRA)" <j...@apache.org>
Subject [jira] Created: (MAHOUT-583) Loss some data when create sequence files from directory
Date Thu, 13 Jan 2011 11:11:45 GMT
Loss some data when create sequence files from directory
--------------------------------------------------------

                 Key: MAHOUT-583
                 URL: https://issues.apache.org/jira/browse/MAHOUT-583
             Project: Mahout
          Issue Type: Bug
          Components: Utils
    Affects Versions: 0.5
         Environment: All situation
            Reporter: yumeng
            Priority: Minor
             Fix For: 0.5


Loss some data when create sequence files from directory. It will happen when we need more
than one output chunk file. It create chunk-0 twice. The first chunk-0 file is overwrite by
the second chunk-0 file. That's because the name of the second chunk file starts from 0 not
1.
For example, it creates files in the sequence, chunk-0, chunk-0, chunk-1, chunk-2, chunk-3,
chunk-*.  So we loss the first chunk-0 file if we create more than one chunk files.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message