Return-Path: X-Original-To: apmail-oodt-dev-archive@www.apache.org Delivered-To: apmail-oodt-dev-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 6A8AD17A85 for ; Thu, 9 Oct 2014 03:55:43 +0000 (UTC) Received: (qmail 80887 invoked by uid 500); 9 Oct 2014 03:55:43 -0000 Delivered-To: apmail-oodt-dev-archive@oodt.apache.org Received: (qmail 80852 invoked by uid 500); 9 Oct 2014 03:55:43 -0000 Mailing-List: contact dev-help@oodt.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@oodt.apache.org Delivered-To: mailing list dev@oodt.apache.org Received: (qmail 80839 invoked by uid 99); 9 Oct 2014 03:55:42 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 09 Oct 2014 03:55:42 +0000 X-ASF-Spam-Status: No, hits=-0.0 required=5.0 tests=RCVD_IN_DNSWL_NONE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: local policy) Received: from [67.231.149.40] (HELO mx0a-00183501.pphosted.com) (67.231.149.40) by apache.org (qpsmtpd/0.29) with ESMTP; Thu, 09 Oct 2014 03:55:16 +0000 Received: from pps.filterd (m0047967.ppops.net [127.0.0.1]) by mx0a-00183501.pphosted.com (8.14.5/8.14.5) with SMTP id s993sU9i012925 for ; Wed, 8 Oct 2014 23:55:13 -0400 Received: from ustls.celgene.com ([216.118.82.93]) by mx0a-00183501.pphosted.com with ESMTP id 1pwbqbh97p-1 (version=TLSv1/SSLv3 cipher=AES128-SHA bits=128 verify=OK) for ; Wed, 08 Oct 2014 23:55:13 -0400 Received: from USSUMSPEXCMBX03.celgene.com ([169.254.4.208]) by USSUMSPEXCCAS01.celgene.com ([10.22.64.41]) with mapi id 14.03.0174.001; Wed, 8 Oct 2014 23:55:11 -0400 From: Konstantinos Mavrommatis To: "dev@oodt.apache.org" Subject: RE: How to ingest files when metadata contain non standard characters? Thread-Topic: How to ingest files when metadata contain non standard characters? Thread-Index: AQHP4sGJE1IbsI8A0kqtCFMghJjVMZwm7mcA//+/BOCAAIi4gP//lXeAgABVX0D//66JAAAKhncA Date: Thu, 9 Oct 2014 03:55:11 +0000 Message-ID: <0ED7EB96706C8E44B4B5B13C07026282421A9467@USSUMSPEXCMBX03.celgene.com> References: <219cf2ca9008419e8d0cb0f8590d96ec@aplex01.dom1.jhuapl.edu> <0ED7EB96706C8E44B4B5B13C07026282421A8ACF@USSUMSPEXCMBX03.celgene.com> <0ED7EB96706C8E44B4B5B13C07026282421A9314@USSUMSPEXCMBX03.celgene.com> <0ED7EB96706C8E44B4B5B13C07026282421A9412@USSUMSPEXCMBX03.celgene.com> In-Reply-To: Accept-Language: en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: x-originating-ip: [10.22.65.64] Content-Transfer-Encoding: base64 MIME-Version: 1.0 X-Proofpoint-Virus-Version: vendor=fsecure engine=2.50.10432:5.12.52,1.0.28,0.0.0000 definitions=2014-10-09_01:2014-10-09,2014-10-08,1970-01-01 signatures=0 X-Proofpoint-Spam-Details: rule=outbound_notspam policy=outbound score=0 spamscore=0 suspectscore=0 phishscore=0 adultscore=0 bulkscore=0 classifier=spam adjust=0 reason=mlx scancount=1 engine=7.0.1-1402240000 definitions=main-1410090039 Content-Type: text/plain; charset="utf-8" X-Virus-Checked: Checked by ClamAV on apache.org SGVyZSBpcyB0aGUgb2ZmZW5kaW5nIGZpbGUgYmVmb3JlIGVzY2FwZToNCg0KDQoNCjxjYXM6bWV0 YWRhdGEgeG1sbnM6Y2FzPSJodHRwOi8vb29kdC5qcGwubmFzYS5nb3YvMS4wL2NhcyI+DQoJPGtl eXZhbD4NCgkJPGtleT5kZXJpdmVkX2Zyb208L2tleT4NCgkJPHZhbD4vZ3Bmcy9jZWxnZW5lL3Jl ZmVyZW5jZS92MS9Ib21vLXNhcGllbnMvR1JDaDM3LnAxMi9TYWlsRmlzaEluZGV4PC92YWw+DQoJ CTx2YWw+L2dwZnMvYXJjaGl2ZS9SRUQvREEwMDAwMDcyL1JOQS1TZXEvUmF3RGF0YS9GYXN0cUZp bGVzL0hNMV8xX1IxLmZhc3RxLmd6PC92YWw+DQoJCTx2YWw+L2dwZnMvYXJjaGl2ZS9SRUQvREEw MDAwMDcyL1JOQS1TZXEvUmF3RGF0YS9GYXN0cUZpbGVzL0hNMV8xX1IyLmZhc3RxLmd6PC92YWw+ DQoJPC9rZXl2YWw+DQoJPGtleXZhbD4NCgkJPGtleT5GaWxlUGF0aDwva2V5Pg0KCQk8dmFsPi9n cGZzL2FyY2hpdmUvUkVEL0RBMDAwMDA3Mi9STkEtU2VxL1Byb2Nlc3NlZC9TYWlsZmlzaC10cmFu c2NyaXB0Q291bnRzL0hNMV8xLlNhaWxmaXNoLnNmaXNoPC92YWw+DQoJPC9rZXl2YWw+DQoJPGtl eXZhbD4NCgkJPGtleT5zdGFydF9leGVjdXRpb248L2tleT4NCgkJPHZhbD5UdWUgT2N0ICA3IDIw OjQ5OjEyIDIwMTQ8L3ZhbD4NCgk8L2tleXZhbD4NCgk8a2V5dmFsPg0KCQk8a2V5PmluZ2VzdF91 c2VyPC9rZXk+DQoJCTx2YWw+a21hdnJvbW1hdGlzPC92YWw+DQoJPC9rZXl2YWw+DQoJPGtleXZh bD4NCgkJPGtleT5lbmRfZXhlY3V0aW9uPC9rZXk+DQoJCTx2YWw+VHVlIE9jdCAgNyAyMTowMzo0 NyAyMDE0PC92YWw+DQoJPC9rZXl2YWw+DQoJPGtleXZhbD4NCgkJPGtleT5ydW5fdXNlcjwva2V5 Pg0KCQk8dmFsPmttYXZyb21tYXRpczwvdmFsPg0KCTwva2V5dmFsPg0KCTxrZXl2YWw+DQoJCTxr ZXk+ZmlsZV9ob3N0PC9rZXk+DQoJCTx2YWw+dXNzZGdzcGhwY2NhczAyPC92YWw+DQoJPC9rZXl2 YWw+DQoJPGtleXZhbD4NCgkJPGtleT5nZW5lcmF0b3I8L2tleT4NCgkJPHZhbD5zYWlsZmlzaDwv dmFsPg0KCTwva2V5dmFsPg0KCTxrZXl2YWw+DQoJCTxrZXk+cnVuX2hvc3Q8L2tleT4NCgkJPHZh bD51c3NkZ3NwaHBjY21wMDE8L3ZhbD4NCgk8L2tleXZhbD4NCgk8a2V5dmFsPg0KCQk8a2V5PnNh bXBsZV9pZDwva2V5Pg0KCQk8dmFsPjI1Njk8L3ZhbD4NCgk8L2tleXZhbD4NCgk8a2V5dmFsPg0K CQk8a2V5PmdlbmVyYXRvcl92ZXJzaW9uPC9rZXk+DQoJCTx2YWw+c2FpbGZpc2hbMC42LjNdPC92 YWw+DQoJPC9rZXl2YWw+DQoJPGtleXZhbD4NCgkJPGtleT5Qcm9kdWN0VHlwZTwva2V5Pg0KCQk8 dmFsPkdlbmVyaWNGaWxlPC92YWw+DQoJPC9rZXl2YWw+DQoJPGtleXZhbD4NCgkJPGtleT5hbmFs eXNpc190YXNrPC9rZXk+DQoJCTx2YWw+Mzg8L3ZhbD4NCgk8L2tleXZhbD4NCgk8a2V5dmFsPg0K CQk8a2V5PmdlbmVyYXRvcl9zdHJpbmc8L2tleT4NCgkJPHZhbD4ic2FpbGZpc2ggcXVhbnQgLS1p bmRleCAvZ3Bmcy9jZWxnZW5lL3JlZmVyZW5jZS92MS9Ib21vLXNhcGllbnMvR1JDaDM3LnAxMi9T YWlsRmlzaEluZGV4IC0tbGlidHlwZSAnVD1QRTpPPT48OlM9QVMnIC0xIDwoZ3VuemlwIC1jIC9n cGZzL2FyY2hpdmUvUkVEL0RBMDAwMDA3Mi9STkEtU2VxL1Jhd0RhdGEvRmFzdHFGaWxlcy9ITTFf MV9SMS5mYXN0cS5neikgLTIgPChndW56aXAgLWMgL2dwZnMvYXJjaGl2ZS9SRUQvREEwMDAwMDcy L1JOQS1TZXEvUmF3RGF0YS9GYXN0cUZpbGVzL0hNMV8xX1IyLmZhc3RxLmd6KSAtbyAvZ3Bmcy9h cmNoaXZlL1JFRC9EQTAwMDAwNzIvUk5BLVNlcS9Qcm9jZXNzZWQvU2FpbGZpc2gtdHJhbnNjcmlw dENvdW50cy9ITTFfMS5TYWlsZmlzaC50eHQgLXAgOCAgLS1ub19iaWFzX2NvcnJlY3QgIjwvdmFs Pg0KCTwva2V5dmFsPg0KPC9jYXM6bWV0YWRhdGE+DQoKKioqKioqKioqKioqKioqKioqKioqKioq KioqKioqKioqKioqKioqKioqKioqKioqKioqKioqKioqClRISVMgRUxFQ1RST05JQyBNQUlMIE1F U1NBR0UgQU5EIEFOWSBBVFRBQ0hNRU5UIElTCkNPTkZJREVOVElBTCBBTkQgTUFZIENPTlRBSU4g TEVHQUxMWSBQUklWSUxFR0VECklORk9STUFUSU9OIElOVEVOREVEIE9OTFkgRk9SIFRIRSBVU0Ug T0YgVEhFIElORElWSURVQUwKT1IgSU5ESVZJRFVBTFMgTkFNRUQgQUJPVkUuCklmIHRoZSByZWFk ZXIgaXMgbm90IHRoZSBpbnRlbmRlZCByZWNpcGllbnQsIG9yIHRoZQplbXBsb3llZSBvciBhZ2Vu dCByZXNwb25zaWJsZSB0byBkZWxpdmVyIGl0IHRvIHRoZQppbnRlbmRlZCByZWNpcGllbnQsIHlv dSBhcmUgaGVyZWJ5IG5vdGlmaWVkIHRoYXQgYW55CmRpc3NlbWluYXRpb24sIGRpc3RyaWJ1dGlv biBvciBjb3B5aW5nIG9mIHRoaXMKY29tbXVuaWNhdGlvbiBpcyBzdHJpY3RseSBwcm9oaWJpdGVk LiBJZiB5b3UgaGF2ZQpyZWNlaXZlZCB0aGlzIGNvbW11bmljYXRpb24gaW4gZXJyb3IsIHBsZWFz ZSByZXBseSB0byB0aGUKc2VuZGVyIHRvIG5vdGlmeSB1cyBvZiB0aGUgZXJyb3IgYW5kIGRlbGV0 ZSB0aGUgb3JpZ2luYWwKbWVzc2FnZS4gVGhhbmsgWW91Lgo=