hadoop-common-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Runping Qi (JIRA)" <j...@apache.org>
Subject [jira] Updated: (HADOOP-2052) distcp mapper's status report misleading
Date Sat, 13 Oct 2007 17:51:50 GMT

     [ https://issues.apache.org/jira/browse/HADOOP-2052?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Runping Qi updated HADOOP-2052:
-------------------------------

    Description: 
When the mappers of distcp finish, the status page in the web gui reports the data copied.
However, the reported number is far away from the real number, which is very misleading.
For example, a particular mapper task_200710131713_0001_m_000000_0  reported: 

Finished. Bytes copied: 4.3g

However, it does not say which file.
I thought it was for part-00000. But the file size of part-00000
is about 9GB.

It will be much clearer if the status report  say something like:

Finished copy file-xxxx: 4.3g
That way, I can easily check whether the size is correct.

 



  was:
When the mappers of distcp finish, the status page in the web gui reports the data copied.
However, the reported number is far away from the real number, which is very misleading.
For example, a particular mapper reported: 

Finished. Bytes copied: 4.3g

However, the actual file size is about 9GB.




> distcp mapper's status report misleading
> ----------------------------------------
>
>                 Key: HADOOP-2052
>                 URL: https://issues.apache.org/jira/browse/HADOOP-2052
>             Project: Hadoop
>          Issue Type: Bug
>          Components: mapred
>            Reporter: Runping Qi
>
> When the mappers of distcp finish, the status page in the web gui reports the data copied.
> However, the reported number is far away from the real number, which is very misleading.
> For example, a particular mapper task_200710131713_0001_m_000000_0  reported: 
> Finished. Bytes copied: 4.3g
> However, it does not say which file.
> I thought it was for part-00000. But the file size of part-00000
> is about 9GB.
> It will be much clearer if the status report  say something like:
> Finished copy file-xxxx: 4.3g
> That way, I can easily check whether the size is correct.
>  

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message