Return-Path: X-Original-To: archive-asf-public-internal@cust-asf2.ponee.io Delivered-To: archive-asf-public-internal@cust-asf2.ponee.io Received: from cust-asf.ponee.io (cust-asf.ponee.io [163.172.22.183]) by cust-asf2.ponee.io (Postfix) with ESMTP id 62663200B13 for ; Wed, 1 Jun 2016 07:19:46 +0200 (CEST) Received: by cust-asf.ponee.io (Postfix) id 61374160A47; Wed, 1 Jun 2016 05:19:46 +0000 (UTC) Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by cust-asf.ponee.io (Postfix) with SMTP id AB308160A41 for ; Wed, 1 Jun 2016 07:19:45 +0200 (CEST) Received: (qmail 86839 invoked by uid 500); 1 Jun 2016 05:19:44 -0000 Mailing-List: contact dev-help@apex.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@apex.apache.org Delivered-To: mailing list dev@apex.apache.org Received: (qmail 86826 invoked by uid 99); 1 Jun 2016 05:19:44 -0000 Received: from pnap-us-west-generic-nat.apache.org (HELO spamd2-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 01 Jun 2016 05:19:44 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd2-us-west.apache.org (ASF Mail Server at spamd2-us-west.apache.org) with ESMTP id 3D9641A57C8 for ; Wed, 1 Jun 2016 05:19:44 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd2-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -5.446 X-Spam-Level: X-Spam-Status: No, score=-5.446 tagged_above=-999 required=6.31 tests=[KAM_LAZY_DOMAIN_SECURITY=1, RCVD_IN_DNSWL_HI=-5, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, RP_MATCHES_RCVD=-1.426] autolearn=disabled Received: from mx1-lw-eu.apache.org ([10.40.0.8]) by localhost (spamd2-us-west.apache.org [10.40.0.9]) (amavisd-new, port 10024) with ESMTP id 8anuSwD_Z0FH for ; Wed, 1 Jun 2016 05:19:42 +0000 (UTC) Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by mx1-lw-eu.apache.org (ASF Mail Server at mx1-lw-eu.apache.org) with SMTP id 7DF475F296 for ; Wed, 1 Jun 2016 05:19:41 +0000 (UTC) Received: (qmail 86808 invoked by uid 99); 1 Jun 2016 05:19:40 -0000 Received: from git1-us-west.apache.org (HELO git1-us-west.apache.org) (140.211.11.23) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 01 Jun 2016 05:19:40 +0000 Received: by git1-us-west.apache.org (ASF Mail Server at git1-us-west.apache.org, from userid 33) id 9A0F9DFB74; Wed, 1 Jun 2016 05:19:40 +0000 (UTC) From: chaithu14 To: dev@apex.incubator.apache.org Reply-To: dev@apex.incubator.apache.org References: In-Reply-To: Subject: [GitHub] incubator-apex-malhar pull request: APEXMALHAR-2103 Fixed the scanner issue ... Content-Type: text/plain Message-Id: <20160601051940.9A0F9DFB74@git1-us-west.apache.org> Date: Wed, 1 Jun 2016 05:19:40 +0000 (UTC) archived-at: Wed, 01 Jun 2016 05:19:46 -0000 Github user chaithu14 commented on a diff in the pull request: https://github.com/apache/incubator-apex-malhar/pull/300#discussion_r65303185 --- Diff: library/src/main/java/com/datatorrent/lib/io/fs/FileSplitterInput.java --- @@ -375,11 +374,18 @@ public void run() lastScannedInfo = null; numDiscoveredPerIteration = 0; for (String afile : files) { - String filePath = new File(afile).getAbsolutePath(); - LOG.debug("Scan started for input {}", filePath); - Map lastModifiedTimesForInputDir; - lastModifiedTimesForInputDir = referenceTimes.get(filePath); - scan(new Path(afile), null, lastModifiedTimesForInputDir); + Path filePath = new Path(afile); + LOG.debug("Scan started for input {}", filePath.toString()); + Map lastModifiedTimesForInputDir = null; + if (fs.exists(filePath)) { + FileStatus fileStatus = fs.getFileStatus(filePath); + if (fileStatus.isDirectory()) { + lastModifiedTimesForInputDir = referenceTimes.get(fileStatus.getPath().toString()); + } else { + lastModifiedTimesForInputDir = referenceTimes.get(fileStatus.getPath().getParent().toString()); --- End diff -- @Priyanka: If we maintain 2 different keys, then fileSplitter will emit multiple filemetadata's for /home/myDir/file1.txt. Is it expected behavior. Please correct it, if I am wrong. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastructure@apache.org or file a JIRA ticket with INFRA. ---