3.2.5.1 Sending VT-UTF8 and VT100+ Requests

The original VT100 protocol, as specified in [VT100], uses the ASCII character set. The UTF-8 algorithm MUST map a Unicode character into a string of 8-bit bytes. The number of 8-bit bytes depends on the bit width of the Unicode character, as shown in the following table.

 Bit width

 UTF-8 encoding

0-7

0xxxxxxx

8-11

110xxxxx

10xxxxxx

12-16

1110xxxx

10xxxxxx

10xxxxxx