hadoop-common-dev mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Matei Zaharia (JIRA)" <j...@apache.org>
Subject [jira] Commented: (HADOOP-3937) Job history may get disabled due to overly long job names
Date Wed, 13 Aug 2008 17:44:44 GMT

    [ https://issues.apache.org/jira/browse/HADOOP-3937?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12622293#action_12622293
] 

Matei Zaharia commented on HADOOP-3937:
---------------------------------------

If the history start time has to go, then the best thing to do is to ensure that JobID's are
globally unique rather than hoping that the (jobid, job name, user) combination will be unique.
First of all, are JobID's actually unique right now? My impression was that they weren't -
they include jobtracker start time and job number, but two trackers could give the same JobID.
However, if we keep the machine name in there, then the (machine, jobid) combination could
be unique. If you want to make sure JobID's are unique however, just use a GUID generator,
such as http://java.sun.com/j2se/1.5.0/docs/api/java/util/UUID.html#randomUUID().

> Job history may get disabled due to overly long job names
> ---------------------------------------------------------
>
>                 Key: HADOOP-3937
>                 URL: https://issues.apache.org/jira/browse/HADOOP-3937
>             Project: Hadoop Core
>          Issue Type: Bug
>    Affects Versions: 0.17.0, 0.17.1, 0.18.0, 0.19.0
>            Reporter: Matei Zaharia
>         Attachments: HADOOP-3937.patch
>
>
> Since Hadoop 0.17, the job history logs include the job's name in the filename. However,
this can lead to overly long filenames, because job names may be arbitrarily long. When a
filename is too long for the underlying OS, file creation fails and the JobHistory class silently
disables history from that point on. This can lead to days of lost history until somebody
notices the error in the log.
> Proposed solution: Trim the job name to a reasonable length when selecting a filename
for the history file.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message