hadoop-general mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From venkatesh kavuluri <venkatesh.kowl...@gmail.com>
Subject Re: Already subscribed to general@hadoop.apache.org
Date Wed, 05 Jan 2011 04:11:56 GMT
Hi All,

I am trying to override the TextOutputFormat class default key-value
separator ("\t") with a control character using the configuration parameter
below.

conf.set("mapred.textoutputformat.separator", "\u0008"); and also like
conf.set("mapred.textoutputformat.separator", "\b");

I get the error below,

[Fatal Error] :40:68: Character reference "&#8" is an invalid XML character.
10/12/22 14:39:13 FATAL conf.Configuration: error parsing conf file:
org.xml.sax.SAXParseException: Character reference "&#8" is an invalid XML
character.
Exception in thread "main" java.lang.RuntimeException:
org.xml.sax.SAXParseException: Character reference "&#8" is an invalid XML
character.
at
org.apache.hadoop.conf.Configuration.loadResource(Configuration.java:1168)
at
org.apache.hadoop.conf.Configuration.loadResources(Configuration.java:1040)
at org.apache.hadoop.conf.Configuration.getProps(Configuration.java:980)
at org.apache.hadoop.conf.Configuration.get(Configuration.java:382)
at
org.apache.hadoop.mapred.JobConf.checkAndWarnDeprecation(JobConf.java:1662)
at org.apache.hadoop.mapred.JobConf.<init>(JobConf.java:215)
at
org.apache.hadoop.mapred.LocalJobRunner$Job.<init>(LocalJobRunner.java:93)
at
org.apache.hadoop.mapred.LocalJobRunner.submitJob(LocalJobRunner.java:373)
at org.apache.hadoop.mapred.JobClient.submitJobInternal(JobClient.java:800)
at org.apache.hadoop.mapreduce.Job.submit(Job.java:432)
at org.apache.hadoop.mapreduce.Job.waitForCompletion(Job.java:447)
at com.ebay.ewa2.aggregation.BotVisits.run(BotVisits.java:181)
at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:79)
at com.ebay.ewa2.aggregation.BotVisits.main(BotVisits.java:185)
Caused by: org.xml.sax.SAXParseException: Character reference "&#8" is an
invalid XML character.
at org.apache.xerces.parsers.DOMParser.parse(Unknown Source)
at org.apache.xerces.jaxp.DocumentBuilderImpl.parse(Unknown Source)
at javax.xml.parsers.DocumentBuilder.parse(Unknown Source)
at
org.apache.hadoop.conf.Configuration.loadResource(Configuration.java:1092)
... 14 more

Looks like the jobConf object is not being deserialized properly. Is there
any work around without needing to override the TextOutputFormat's
RecordWriter method. I am using Hadoop 0.20.2 API.

Thanks,
Venkatesh

Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message