hadoop-common-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From 张茂森 <maosen.zh...@alibaba-inc.com>
Subject error about character set(ASCII, UTF-8, Unicode) using TextInputFormat
Date Mon, 09 Oct 2006 09:54:54 GMT
Hi all: 

I’m trying to use hadoop to process logs. I’ve write some routine to count
the login times of the same ip. However, because my logs’ characters are
hybrid encoded (ASCII, Unicode, UTF-8 etc), TextInputFormat class in hadoop
will error. Do you have some good way to solve this problem?

Thank you very much!

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message