Return-Path: X-Original-To: apmail-hadoop-mapreduce-issues-archive@minotaur.apache.org Delivered-To: apmail-hadoop-mapreduce-issues-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 7C3E610953 for ; Wed, 28 May 2014 16:04:02 +0000 (UTC) Received: (qmail 63131 invoked by uid 500); 28 May 2014 16:04:02 -0000 Delivered-To: apmail-hadoop-mapreduce-issues-archive@hadoop.apache.org Received: (qmail 63063 invoked by uid 500); 28 May 2014 16:04:02 -0000 Mailing-List: contact mapreduce-issues-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: mapreduce-issues@hadoop.apache.org Delivered-To: mailing list mapreduce-issues@hadoop.apache.org Received: (qmail 63054 invoked by uid 99); 28 May 2014 16:04:02 -0000 Received: from arcas.apache.org (HELO arcas.apache.org) (140.211.11.28) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 28 May 2014 16:04:02 +0000 Date: Wed, 28 May 2014 16:04:02 +0000 (UTC) From: "jay vyas (JIRA)" To: mapreduce-issues@hadoop.apache.org Message-ID: In-Reply-To: References: Subject: [jira] [Commented] (MAPREDUCE-5902) JobHistoryServer (HistoryFileManager) needs more debug logs, fails to pick up jobs with % characters in the name. MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/MAPREDUCE-5902?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14011240#comment-14011240 ] jay vyas commented on MAPREDUCE-5902: ------------------------------------- Sure I can try those. In general what is the contract for a Hadoop file system- should it support any character in a file name ? Are there certain escape sequences that have a particular meaning? > JobHistoryServer (HistoryFileManager) needs more debug logs, fails to pick up jobs with % characters in the name. > ----------------------------------------------------------------------------------------------------------------- > > Key: MAPREDUCE-5902 > URL: https://issues.apache.org/jira/browse/MAPREDUCE-5902 > Project: Hadoop Map/Reduce > Issue Type: Bug > Components: jobhistoryserver > Reporter: jay vyas > Original Estimate: 1h > Remaining Estimate: 1h > > 1) JobHistoryServer sometimes skips over certain history files, and ignores serving them as completed. > 2) In addition to skipping these files, the JobHistoryServer doesnt effectively log which files are being skipped , and why. > So In addition to determining why certain types of files are skipped (file name length doesnt appear to be the reason, rather, it appears to be that % characters throw the JobHistoryServer filter off), we should log completed .jhist files which are available in the mr-history/tmp directory, yet they are skipped for some reason. > *Regarding the actual bug : Skipping completed jhist files* > We will need an author of the JobHistoryServer, I think, to chime in on what types of paths for jobs are actually valid. It appears that at least some characters, if in a job name, will make the jobhistoryserver skip recognition of a completed jhist file. > *Regarding logging* > It would be extremely useful , then, to have a couple of gaurded logs at this level of the code, so that we can see, in the log folders, why files are being filtered out , i.e. it is due to filterint or visibility. > {noformat} > private static List scanDirectory(Path path, FileContext fc, > PathFilter pathFilter) throws IOException { > path = fc.makeQualified(path); > List jhStatusList = new ArrayList(); > RemoteIterator fileStatusIter = fc.listStatus(path); > while (fileStatusIter.hasNext()) { > FileStatus fileStatus = fileStatusIter.next(); > Path filePath = fileStatus.getPath(); > if (fileStatus.isFile() && pathFilter.accept(filePath)) { > jhStatusList.add(fileStatus); > } > } > return jhStatusList; > } > {noformat} > *Reproducing* > I was able to reproduce this bug by writing a custom mapreduce job with a job name, which contained % characters. I have also seen this with a version of the Mahout ParallelALSFactorizationJob, which includes "-" characters in its name, which wind up getting replaced by "%2D" later on at some stage in the job pipeline. -- This message was sent by Atlassian JIRA (v6.2#6252)