Character Encodings

List of well known character encodings

Character encodings represent mappings between bytes and text. Their usage occurs behind the scenes in computers, and most users are not even aware of it. They allow us to easily work with textual content, save it, and exchange it with others, without being bothered by how such data is transmitted through the network. Throughout history, many character encodings have been created. Below, you can find a list of well-known character encodings.

Character Encoding List

The character encoding table is shown below.

NameType

ASCII

7 bits

Big5

1 to 2 bytes (variable-width)

Big5-HKSCS

1 to 2 bytes (variable-width)

CP 437

1 byte

CP 850

1 byte

CP 852

1 byte

CP 865

1 byte

CP 866

1 byte

EUC-JP

1 to 3 bytes (variable-width)

EUC-KR

1 to 2 bytes (variable-width)

GB18030

1, 2, or 4 bytes (variable-width)

GB2312

1 to 2 bytes (variable-width)

GBK

1 to 2 bytes (variable-width)

ISO 8859-1

1 byte

ISO 8859-10

1 byte

ISO 8859-11

1 byte

ISO 8859-13

1 byte

ISO 8859-14

1 byte

ISO 8859-15

1 byte

ISO 8859-16

1 byte

ISO 8859-2

1 byte

ISO 8859-3

1 byte

ISO 8859-4

1 byte

ISO 8859-5

1 byte

ISO 8859-6

1 byte

ISO 8859-7

1 byte

ISO 8859-8

1 byte

ISO 8859-9

1 byte

KOI8-R

1 byte

KOI8-RU

1 byte

KOI8-T

1 byte

KOI8-U

1 byte

MacCyrillic

1 byte

MacRoman

1 byte

Shift-JIS

1 to 2 bytes (variable-width)

UTF-16

2 or 4 bytes (variable-width)

UTF-16 (Big Endian)

2 or 4 bytes (variable-width)

UTF-16 (Little Endian)

2 or 4 bytes (variable-width)

UTF-32

4 bytes

UTF-32 (Big Endian)

4 bytes

UTF-32 (Little Endian)

4 bytes

UTF-8

1 to 4 bytes (variable-width)

Windows 1250

1 byte

Windows 1251

1 byte

Windows 1252

1 byte

Windows 1253

1 byte

Windows 1254

1 byte

Windows 1255

1 byte

Windows 1256

1 byte

Windows 1257

1 byte

Tired of manually editing translation files?

Our platform streamlines software localization for you.

Copyrights 2024 © Localizely