Return-Path: Delivered-To: apmail-hadoop-core-dev-archive@www.apache.org Received: (qmail 96921 invoked from network); 3 Mar 2009 01:38:28 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 3 Mar 2009 01:38:28 -0000 Received: (qmail 61150 invoked by uid 500); 3 Mar 2009 01:38:24 -0000 Delivered-To: apmail-hadoop-core-dev-archive@hadoop.apache.org Received: (qmail 61118 invoked by uid 500); 3 Mar 2009 01:38:24 -0000 Mailing-List: contact core-dev-help@hadoop.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: core-dev@hadoop.apache.org Delivered-To: mailing list core-dev@hadoop.apache.org Received: (qmail 61107 invoked by uid 99); 3 Mar 2009 01:38:24 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Mon, 02 Mar 2009 17:38:24 -0800 X-ASF-Spam-Status: No, hits=-2000.0 required=10.0 tests=ALL_TRUSTED X-Spam-Check-By: apache.org Received: from [140.211.11.140] (HELO brutus.apache.org) (140.211.11.140) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 03 Mar 2009 01:38:19 +0000 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id C406B234C4B4 for ; Mon, 2 Mar 2009 17:37:58 -0800 (PST) Message-ID: <537944668.1236044278801.JavaMail.jira@brutus> Date: Mon, 2 Mar 2009 17:37:58 -0800 (PST) From: "Eric Yang (JIRA)" To: core-dev@hadoop.apache.org Subject: [jira] Commented: (HADOOP-5087) Regex for Cmd parsing contains an error In-Reply-To: <1007051468.1232495879528.JavaMail.jira@brutus> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org [ https://issues.apache.org/jira/browse/HADOOP-5087?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12678190#action_12678190 ] Eric Yang commented on HADOOP-5087: ----------------------------------- +1 on fixftaregex.patch > Regex for Cmd parsing contains an error > --------------------------------------- > > Key: HADOOP-5087 > URL: https://issues.apache.org/jira/browse/HADOOP-5087 > Project: Hadoop Core > Issue Type: Bug > Components: contrib/chukwa > Environment: HADOOP-4947 use regex to parse chukwa commands but there's an error in the regex > the current regex is: > Pattern addCmdPattern = Pattern.compile("[aA][dD][dD]\\s+(\\S+)\\s+(\\S+)\\s+(.*\\S)?\\s*(\\d+)\\s*"); > does not correctly parsed this valid checkpoint entry: > "ADD org.apache.hadoop.chukwa.datacollection.adaptor.filetailer.CharFileTailingAdaptorUTF8NewLineEscaped Syslog 0 /var/log/messages 114027" > Parsing result: > adaptorName org.apache.hadoop.chukwa.datacollection.adaptor.filetailer.CharFileTailingAdaptorUTF8NewLineEscaped > dataType Syslog > params 0 /var/log/messages 11402 > offset 7 > Instead of: > adaptorName org.apache.hadoop.chukwa.datacollection.adaptor.filetailer.CharFileTailingAdaptorUTF8NewLineEscaped > dataType Syslog > params 0 /var/log/messages > offset 114027 > The correct regex is: "[aA][dD][dD]\\s+(\\S+)\\s+(\\S+)\\s+(.*\\s)?\\s*(\\d+)\\s*" > Example of parsing: "ADD org.apache.hadoop.chukwa.datacollection.adaptor.MySpecificAdaptor Syslog 0 my param1 param2 /var/log/messages 114027"; > Parsing result: > adaptorName org.apache.hadoop.chukwa.datacollection.adaptor.MySpecificAdaptor > dataType Syslog > params 0 my param1 param2 /var/log/messages > offset 114027 > Reporter: Jerome Boulon > Assignee: Ari Rabkin > Attachments: fixedregex.patch, fixftaregex.patch, HADOOP-5087-2.patch, HADOOP-5087.patch, reluctantregex.patch > > -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.