3.1.5.1 Sending VT-UTF8 and VT100+ Requests

The original VT100 protocol, as specified in [VT100], uses the ASCII character set. The UTF-8 algorithm MUST map a Unicode character into a string of 8-bit bytes. The number of 8-bit bytes depends on the bit width of the Unicode character, as shown in the following table.

 Bit width

 UTF8 encoding

0 - 7

0xxxxxxx

8 - 11

110xxxxx

10xxxxxx

12 - 16

1110xxxx

10xxxxxx

10xxxxxx