Encodings¶
Dixer supports read and set encoding of connection types csv
, json
, xml
, ragged-right
and fixed-width
using the encoding
or encoding_var
keys.
The supported encodings are:
UTF-8
: It's the default, you can leave in blank theencoding
andencoding_var
keys if you want.UTF-16
: UTF-16 with BOMUTF-16LE
: UTF-16 Little-Endian without BOMUTF-16BE
: UTF-16 Big-Endian without BOMIBM037
: IBM Code Page 037IBM437
: IBM Code Page 437IBM850
: IBM Code Page 850IBM852
: IBM Code Page 852IBM855
: IBM Code Page 855IBM00858
: Windows Code Page 858IBM860
: IBM Code Page 860IBM862
: IBM Code Page 862IBM863
: IBM Code Page 863IBM865
: IBM Code Page 865IBM866
: IBM Code Page 866IBM1047
: IBM Code Page 1047IBM1140
: IBM Code Page 1140ISO-8859-1
: ISOLatin1ISO-8859-2
: ISOLatin2ISO-8859-3
: ISOLatin3ISO-8859-4
: ISOLatin4ISO-8859-5
: ISOLatinCyrillicISO-8859-6
: ISOLatinArabicISO-8859-6-E
: ISOLatinArabic explicit modeISO-8859-6-I
: ISOLatinArabic implicit modeISO-8859-7
: ISOLatinGreekISO-8859-8
: ISOLatinHebrewISO-8859-8-E
: ISOLatinHebrew explicit modeISO-8859-8-I
: ISOLatinHebrew implicit modeISO-8859-9
: ISOLatin5ISO-8859-10
: ISOLatin6ISO-8859-13
ISO-8859-14
ISO-8859-15
ISO-8859-16
KOI8-R
KOI8-U
macintosh
x-mac-cyrillic
windows-874
windows-1250
windows-1251
windows-1252
windows-1253
windows-1254
windows-1255
windows-1256
windows-1257
windows-1258
How encoding works?¶
Dixer works internally with UTF-8, so when a encoding is specified to a file, if is a source, Dixer transform supported characters in file to UTF-8, and for writing, transform the UTF-8 to speficied encoding. If a character doesn't match this will be ignored, for example, if you are writing a file in ISO-8859-13
and some column value is español
then the output result in destination file is espaol
.