hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Abhinav M Kulkarni <abhinavkulka...@gmail.com>
Subject Unwanted characters in output
Date Wed, 25 Jul 2012 01:00:14 GMT

I wrote a simple program to gather some statistics about bigrams in some 
I print statistics to a custom file.

Path file = new Path(context.getConfiguration().get("mapred.output.dir") 
+ "/bigram.txt");
FSDataOutputStream out = 

My code has following lines:

Text.writeString(out, "total number of unique bigrams: " + 
uniqBigramCount + "\n");
Text.writeString(out, "total number of bigrams: " + totalBigramCount + 
Text.writeString(out, "number of bigrams that appear only once: " + 
onceBigramCount + "\n");

I get following output:

'total number of unique bigrams: 424462
!total number of bigrams: 1578220
0number of bigrams that appear only once: 296139

Apart from unwanted characters at the beginning of the lines, there are 
some non-printing characters too. What could be the reason behind this?


  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message