Code point ↔ UTF-8 conversion
First code point
|
Last code point
|
Byte 1
|
Byte 2
|
Byte 3
|
Byte 4
|
U+0000
|
U+007F
|
0xxxxxxx
|
|
U+0080
|
U+07FF
|
110xxxxx
|
10xxxxxx
|
|
U+0800
|
U+FFFF
|
1110xxxx
|
10xxxxxx
|
10xxxxxx
|
|
U+010000
|
[b]U+10FFFF
|
11110xxx
|
10xxxxxx
|
10xxxxxx
|
10xxxxxx
|
Source : Wikipedia