Question 1

What is UTF-8?

Accepted Answer

UTF-8 is a variable-width character encoding that uses 1 to 4 bytes per character. It is the dominant encoding for the web and is backward compatible with ASCII.

Question 2

How many bytes does a character use in UTF-8?

Accepted Answer

ASCII characters (U+0000 to U+007F) use 1 byte, Latin and similar scripts use 2 bytes, most Asian scripts use 3 bytes, and emoji/supplementary characters use 4 bytes.

UTF-8 Encode & Decode

How UTF-8 Encoding Works

Encode text as UTF-8 bytes

Display as hex bytes

FAQ

Related Tools

Unicode Encode & Decode

Hex Encode & Decode

Base64 Encode & Decode