UTF-8 Encode & Decode

How UTF-8 Encoding Works

Text is converted to its UTF-8 byte representation shown in hexadecimal.

  1. 1

    Encode text as UTF-8 bytes

    The input string is encoded using the UTF-8 character encoding standard.

  2. 2

    Display as hex bytes

    Each byte is displayed as a hexadecimal value (e.g., 0x48 0x65 0x6C 0x6C 0x6F).

FAQ

What is UTF-8?
UTF-8 is a variable-width character encoding that uses 1 to 4 bytes per character. It is the dominant encoding for the web and is backward compatible with ASCII.
How many bytes does a character use in UTF-8?
ASCII characters (U+0000 to U+007F) use 1 byte, Latin and similar scripts use 2 bytes, most Asian scripts use 3 bytes, and emoji/supplementary characters use 4 bytes.

Related Tools