Return-Path: Delivered-To: apmail-lucene-hadoop-dev-archive@locus.apache.org Received: (qmail 26436 invoked from network); 11 Nov 2006 01:52:06 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.2) by minotaur.apache.org with SMTP; 11 Nov 2006 01:52:06 -0000 Received: (qmail 23406 invoked by uid 500); 11 Nov 2006 01:52:12 -0000 Delivered-To: apmail-lucene-hadoop-dev-archive@lucene.apache.org Received: (qmail 23370 invoked by uid 500); 11 Nov 2006 01:52:12 -0000 Mailing-List: contact hadoop-dev-help@lucene.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: hadoop-dev@lucene.apache.org Delivered-To: mailing list hadoop-dev@lucene.apache.org Received: (qmail 23361 invoked by uid 99); 11 Nov 2006 01:52:11 -0000 Received: from herse.apache.org (HELO herse.apache.org) (140.211.11.133) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 10 Nov 2006 17:52:11 -0800 X-ASF-Spam-Status: No, hits=0.0 required=10.0 tests= X-Spam-Check-By: apache.org Received: from [140.211.11.4] (HELO brutus.apache.org) (140.211.11.4) by apache.org (qpsmtpd/0.29) with ESMTP; Fri, 10 Nov 2006 17:51:58 -0800 Received: from brutus (localhost [127.0.0.1]) by brutus.apache.org (Postfix) with ESMTP id 75902714317 for ; Fri, 10 Nov 2006 17:51:38 -0800 (PST) Message-ID: <16001731.1163209898478.JavaMail.jira@brutus> Date: Fri, 10 Nov 2006 17:51:38 -0800 (PST) From: "dhruba borthakur (JIRA)" To: hadoop-dev@lucene.apache.org Subject: [jira] Commented: (HADOOP-709) streaming job with Control characters in the command causes runtime exception in the job tracker In-Reply-To: <22924647.1163198079083.JavaMail.jira@brutus> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-Virus-Checked: Checked by ClamAV on apache.org [ http://issues.apache.org/jira/browse/HADOOP-709?page=comments#action_12448922 ] dhruba borthakur commented on HADOOP-709: ----------------------------------------- Another sun issue: http://bugs.sun.com/bugdatabase/view_bug.do?bug_id=6371411 Our streaming code creates an XML file for the job. This file as an entry of the form: stream.map.streamprocessor/bin/sort -t'' One option would be that user-input be encoded in Base64 before being XML-ised. This is necessary to support non-ascii user-input. > streaming job with Control characters in the command causes runtime exception in the job tracker > ------------------------------------------------------------------------------------------------ > > Key: HADOOP-709 > URL: http://issues.apache.org/jira/browse/HADOOP-709 > Project: Hadoop > Issue Type: Bug > Components: contrib/streaming > Reporter: dhruba borthakur > Assigned To: dhruba borthakur > > I run a streaming job with a Ctrl-A character in the command line option as follows: > $HSTREAMING -mapper "/bin/sort -t'^A'" -input "input/*" -output output > It causes a exeception as listed below. If i run the same command without the Ctrl-A character the command completes successfully. > Exception in thread "main" org.apache.hadoop.ipc.RemoteException: java.io.IOException: java.lang.RuntimeException: org.xml.sax.SAXParseException: Character reference "" is an invalid XML character. > at org.apache.hadoop.conf.Configuration.loadResource(Configuration.java:551) > at org.apache.hadoop.conf.Configuration.loadResources(Configuration.java:472) > at org.apache.hadoop.conf.Configuration.getProps(Configuration.java:453) > at org.apache.hadoop.conf.Configuration.get(Configuration.java:201) > at org.apache.hadoop.mapred.JobConf.getUser(JobConf.java:175) > at org.apache.hadoop.mapred.JobInProgress.(JobInProgress.java:88) > at org.apache.hadoop.mapred.JobTracker.submitJob(JobTracker.java:1016) > at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) > at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39) > at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25) > at java.lang.reflect.Method.invoke(Method.java:585) > at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:337) > at org.apache.hadoop.ipc.Server$Handler.run(Server.java:514) > at org.apache.hadoop.ipc.Client$Connection.run(Client.java:248) -- This message is automatically generated by JIRA. - If you think it was sent incorrectly contact one of the administrators: http://issues.apache.org/jira/secure/Administrators.jspa - For more information on JIRA, see: http://www.atlassian.com/software/jira