Return-Path: Delivered-To: apmail-hadoop-hive-dev-archive@minotaur.apache.org Received: (qmail 29788 invoked from network); 11 Dec 2009 03:05:42 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 11 Dec 2009 03:05:42 -0000 Received: (qmail 43359 invoked by uid 500); 11 Dec 2009 03:05:42 -0000 Delivered-To: apmail-hadoop-hive-dev-archive@hadoop.apache.org Received: (qmail 43314 invoked by uid 500); 11 Dec 2009 03:05:40 -0000 Mailing-List: contact hive-dev-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hive-dev@hadoop.apache.org Delivered-To: mailing list hive-dev@hadoop.apache.org Received: (qmail 43304 invoked by uid 99); 11 Dec 2009 03:05:40 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 11 Dec 2009 03:05:40 +0000 X-ASF-Spam-Status: No, hits=-10.5 required=5.0 tests=AWL,BAYES_00,RCVD_IN_DNSWL_HI X-Spam-Check-By: apache.org Received: from [140.211.11.140] (HELO brutus.apache.org) (140.211.11.140) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 11 Dec 2009 03:05:38 +0000 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id 2277E234C04C for ; Thu, 10 Dec 2009 19:05:18 -0800 (PST) Message-ID: <913230820.1260500718118.JavaMail.jira@brutus> Date: Fri, 11 Dec 2009 03:05:18 +0000 (UTC) From: "Paul Yang (JIRA)" To: hive-dev@hadoop.apache.org Subject: [jira] Commented: (HIVE-976) test outputs should compare the file/directory for sampling In-Reply-To: <1078576415.1260386658071.JavaMail.jira@brutus> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-JIRA-FingerPrint: 30527f35849b9dde25b450d4833f0394 [ https://issues.apache.org/jira/browse/HIVE-976?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12789088#action_12789088 ] Paul Yang commented on HIVE-976: -------------------------------- I'm looking at sample7.q.out, which has the lines that are quoted in the description. As I understand it, the test should check lines of the form 'file:/[directory]/[file]' and verify that the file part (or the last token) is the same as those in the reference test files. But aren't there lines in the test output that begin with 'file:/' that should be ignored completely? Anyway, assuming that we can look at just the last token of lines that begin with 'file:/', it doesn't seem like there is a way to use only diff to handle this case. The man page for diff does not describe any useful options. I have two ideas 1. Instead of using diff directly, write a script that uses diff + some additional logic to detect this condition. Probably will be a little slower. 2. in mapredWork, add an additional function that would display the last token of the paths. Sort of an ugly idea, but would work with diff in the current form. > test outputs should compare the file/directory for sampling > ------------------------------------------------------------ > > Key: HIVE-976 > URL: https://issues.apache.org/jira/browse/HIVE-976 > Project: Hadoop Hive > Issue Type: Bug > Components: Testing Infrastructure > Reporter: Namit Jain > Assignee: Paul Yang > Fix For: 0.5.0 > > > Currently, all lines starting with file: are ignored. > It means that > file:/Users/heyongqiang/Documents/workspace/Hive-Test/build/ql/test/data/warehouse/srcbucket/srcbucket0.txt [s] > and > file:/Users/heyongqiang/Documents/workspace/Hive-Test/build/ql/test/data/warehouse/srcbucket [s] > are same - that is not good because it will hide some of the optimizations of sampling. > This should be changed to compare the last token. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.