flink-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Fabian Hueske <fhue...@gmail.com>
Subject Re: UTF-16 support for TextInputFormat
Date Thu, 09 Aug 2018 08:04:05 GMT
Hi David,

Did you try to set the encoding on the TextInputFormat with

TextInputFormat tif = ...
tif.setCharsetName("UTF-16");

Best, Fabian

2018-08-08 17:45 GMT+02:00 David Dreyfus <dddreyfus@gmail.com>:

> Hello -
>
> It does not appear that Flink supports a charset encoding of "UTF-16". It
> particular, it doesn't appear that Flink consumes the Byte Order Mark (BOM)
> to establish whether a UTF-16 file is UTF-16LE or UTF-16BE. Are there any
> plans to enhance Flink to handle UTF-16 with BOM?
>
> Thank you,
> David
>

Mime
View raw message