Encodings¶
Dixer supports read and set encoding of connection types csv, json, xml, ragged-right and fixed-width using the encoding or encoding_var keys.
The supported encodings are:
UTF-8: It's the default, you can leave in blank theencodingandencoding_varkeys if you want.UTF-16: UTF-16 with BOMUTF-16LE: UTF-16 Little-Endian without BOMUTF-16BE: UTF-16 Big-Endian without BOMIBM037: IBM Code Page 037IBM437: IBM Code Page 437IBM850: IBM Code Page 850IBM852: IBM Code Page 852IBM855: IBM Code Page 855IBM00858: Windows Code Page 858IBM860: IBM Code Page 860IBM862: IBM Code Page 862IBM863: IBM Code Page 863IBM865: IBM Code Page 865IBM866: IBM Code Page 866IBM1047: IBM Code Page 1047IBM1140: IBM Code Page 1140ISO-8859-1: ISOLatin1ISO-8859-2: ISOLatin2ISO-8859-3: ISOLatin3ISO-8859-4: ISOLatin4ISO-8859-5: ISOLatinCyrillicISO-8859-6: ISOLatinArabicISO-8859-6-E: ISOLatinArabic explicit modeISO-8859-6-I: ISOLatinArabic implicit modeISO-8859-7: ISOLatinGreekISO-8859-8: ISOLatinHebrewISO-8859-8-E: ISOLatinHebrew explicit modeISO-8859-8-I: ISOLatinHebrew implicit modeISO-8859-9: ISOLatin5ISO-8859-10: ISOLatin6ISO-8859-13ISO-8859-14ISO-8859-15ISO-8859-16KOI8-RKOI8-Umacintoshx-mac-cyrillicwindows-874windows-1250windows-1251windows-1252windows-1253windows-1254windows-1255windows-1256windows-1257windows-1258
How encoding works?¶
Dixer works internally with UTF-8, so when a encoding is specified to a file, if is a source, Dixer transform supported characters in file to UTF-8, and for writing, transform the UTF-8 to speficied encoding. If a character doesn't match this will be ignored, for example, if you are writing a file in ISO-8859-13 and some column value is español then the output result in destination file is espaol.