Return-Path: X-Original-To: apmail-hive-user-archive@www.apache.org Delivered-To: apmail-hive-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 793AA185DC for ; Thu, 30 Jul 2015 19:24:48 +0000 (UTC) Received: (qmail 96116 invoked by uid 500); 30 Jul 2015 19:24:46 -0000 Delivered-To: apmail-hive-user-archive@hive.apache.org Received: (qmail 96046 invoked by uid 500); 30 Jul 2015 19:24:46 -0000 Mailing-List: contact user-help@hive.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: user@hive.apache.org Delivered-To: mailing list user@hive.apache.org Received: (qmail 96031 invoked by uid 99); 30 Jul 2015 19:24:46 -0000 Received: from Unknown (HELO spamd1-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 30 Jul 2015 19:24:46 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd1-us-west.apache.org (ASF Mail Server at spamd1-us-west.apache.org) with ESMTP id E4347D8F59 for ; Thu, 30 Jul 2015 19:24:45 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd1-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 2.998 X-Spam-Level: ** X-Spam-Status: No, score=2.998 tagged_above=-999 required=6.31 tests=[HTML_MESSAGE=3, RCVD_IN_MSPIKE_H2=-0.001, SPF_PASS=-0.001] autolearn=disabled Received: from mx1-eu-west.apache.org ([10.40.0.8]) by localhost (spamd1-us-west.apache.org [10.40.0.7]) (amavisd-new, port 10024) with ESMTP id KwFmT4qzRrwl for ; Thu, 30 Jul 2015 19:24:37 +0000 (UTC) Received: from st13p17im-asmtp002.me.com (st13p17im-asmtp002.me.com [17.164.88.161]) by mx1-eu-west.apache.org (ASF Mail Server at mx1-eu-west.apache.org) with ESMTPS id 5DD2520593 for ; Thu, 30 Jul 2015 19:24:36 +0000 (UTC) Received: from st13p17im-spool001.me.com ([17.164.88.113]) by st13p17im-asmtp002.me.com (Oracle Communications Messaging Server 7.0.5.35.0 64bit (built Mar 31 2015)) with ESMTP id <0NSB000CFF8SVV20@st13p17im-asmtp002.me.com> for user@hive.apache.org; Thu, 30 Jul 2015 19:24:29 +0000 (GMT) X-Proofpoint-Virus-Version: vendor=fsecure engine=2.50.10432:5.14.151,1.0.33,0.0.0000 definitions=2015-07-30_09:2015-07-30,2015-07-30,1970-01-01 signatures=0 X-Proofpoint-Spam-Details: rule=notspam policy=default score=0 spamscore=0 suspectscore=1 phishscore=0 adultscore=0 bulkscore=0 classifier=spam adjust=0 reason=mlx scancount=1 engine=7.0.1-1412110000 definitions=main-1507300309 MIME-version: 1.0 Content-type: multipart/alternative; boundary="Boundary_(ID_BI4jjWnkSzS2hfumZfOEOg)" Received: from localhost ([17.164.88.115]) by st13p17im-spool001.mac.com (Oracle Communications Messaging Server 7.0.5.35.0 64bit (built Mar 31 2015)) with ESMTP id <0NSB007S3F8SJB60@st13p17im-spool001.mac.com> for user@hive.apache.org; Thu, 30 Jul 2015 19:24:28 +0000 (GMT) To: user@hive.apache.org From: murali parimi Subject: ISO-8859 character support in HIve Date: Thu, 30 Jul 2015 19:24:28 +0000 (GMT) X-Mailer: iCloud MailClient15D108 MailServer15D87.19522 X-Originating-IP: [66.175.245.2] Message-id: --Boundary_(ID_BI4jjWnkSzS2hfumZfOEOg) Content-type: text/plain; CHARSET=US-ASCII; format=flowed Content-transfer-encoding: 7BIT Hello Team, I am trying to load some data encoded in ISO-8859 into hive tables. Version: 0.13. The ascents and special symbols allowed in this character set are coming as some junk when I query this table. The SERDE is ORC. Did you ever faced similar issue? Does hive has any support of specifying the Character set at the table level? I tried to use encode and deconde functions. but it is of no use. Any pointers here would be highly appreciated! Thanks, Murali --Boundary_(ID_BI4jjWnkSzS2hfumZfOEOg) Content-type: multipart/related; boundary="Boundary_(ID_S6WpdarWGtVR5+2gcFlSKg)"; type="text/html" --Boundary_(ID_S6WpdarWGtVR5+2gcFlSKg) Content-type: text/html; CHARSET=US-ASCII Content-transfer-encoding: 7BIT
Hello Team,

I am trying to load some data encoded in ISO-8859 into hive tables. Version: 0.13. The ascents and special symbols allowed in this character set are coming as some junk when I query this table. The SERDE is ORC.

Did you ever faced similar issue? Does hive has any support of specifying the Character set at the table level? I tried to use encode and deconde functions. but it is of no use. Any pointers here would be highly appreciated!


Thanks,
Murali
--Boundary_(ID_S6WpdarWGtVR5+2gcFlSKg)-- --Boundary_(ID_BI4jjWnkSzS2hfumZfOEOg)--