hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Varsha Raveendran <varsha.raveend...@gmail.com>
Subject Re: Word count on cluster configuration
Date Mon, 01 Apr 2013 09:41:14 GMT
Thanks!


On Mon, Apr 1, 2013 at 12:23 PM, Wenming Ye <yew.boulder@hotmail.com> wrote:

>   because many of the “words” are unicode, check the next blog.
>
> http://blogs.msdn.com/b/hpctrekker/archive/2013/04/01/make-another-small-step-with-the-javascript-console-pig-in-hdinsight.aspx
>
>  *From:* Varsha Raveendran <varsha.raveendran@gmail.com>
> *Sent:* Sunday, March 31, 2013 11:43 PM
> *To:* user@hadoop.apache.org
> *Subject:* Word count on cluster configuration
>
>   Hello!
>
> I did the setup for a cluster configuration of Hadoop. After running the
> word count example the output shown in the part-r-00000 file is as shown :
>
> hduser@MT2012158:/usr/local/hadoop$ head
> /tmp/gutenberg-output/gutenberg-output
>     40
>     2
>     4
> ��� � � � �@��    2
> ��� � � � �@�@��    1
> ���� � � � �@�@��    1
> P�������� j l k m �������� g��������������������EXTH
� j 2004-01-01d
> Leonardo    1
> P�������� � � � � �������� ���������������������EXTH
� t    1
> �P�������� � � � ������������ � �
� ���������EXTH � j 2004-01-01d
> Leonardo    1
> �P�������� � � � ������������ � �
� � �����EXTH � t    1
>
>
>
> Can you please tell me why this is happening?
>
>
>
>
> --
> *-Varsha *
>



-- 
*-Varsha *
Mime
View raw message