hadoop-general mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Harshit Kumar <hkumar.ar...@gmail.com>
Subject java.io.IOException: Cannot open filename /user/root/�s�t�e�p�1�/�p�a�r�t�-�0�0�0�0�0
Date Thu, 04 Feb 2010 08:25:42 GMT

I dont understand the reason for this error.

java.io.IOException: Cannot open filename
/user/root/�s�t�e�p�1�/�p�a�r�t�-�0�0�0�0�0 at
at org.apache.hadoop.hdfs.DFSClient$DFSInputStream.(DFSClient.java:1385) at
org.apache.hadoop.hdfs.DFSClient.open(DFSClient.java:338) at
at org.apache.hadoop.fs.FileSystem.open(FileSystem.java:359) at
org.bike.MakeNPairReduce.reduce(MakeNPairReduce.java:40) at
org.bike.MakeNPairReduce.reduce(MakeNPairReduce.java:1) at
org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:436) at

I have a code that scans a folder step0 to find name of files generated in
the previous map-reduce phase. Then create another file with the entries for
if scanning finds that there are 2 files produced by 1st map-reduce phase,
then new created file will have 2 entries step1/part00000 and
step1/part00001 i.e. one entry for each file.

Now, when I read this file in another map-reduce job, each line is read as
/user/root/�s�t�e�p�1�/�p�a�r�t�-�0�0�0�0�0 . What it seems
like, a string
inserted by my code, when read by FSDataInputStream prefix each character of
the string by a question mark (?). Why is that so?

The file name part-00000 do exist inside folder step1, but reading this
filename, /user/root/�s�t�e�p�1�/�p�a�r�t�-�0�0�0�0�0 , throws
which I can undersand that there is no such filename, but why are these ?'s
infiltraded before each letter.

Really appreciate if some one can help me solve this riddle?

Thanks and Regards
H. Kumar
skype: harshit900
Blog: http://harshitkumar.wordpress.com
Website: http:/kumarharmuscat.tripod.com

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message