Character Encoder

Encode any character to UTF-8 hex, HTML entity, CSS escape, JavaScript, Python, URL encoding, binary, and decimal. 17 formats per character with one-click copy.

Encoder
Presets:

How to Use

  1. 1
    Enter a character

    Type or paste any Unicode character into the input field. You can enter a single character, a codepoint like U+1F600, or a character name.

  2. 2
    View all encodings

    The tool instantly shows the character encoded in UTF-8, UTF-16, HTML entity, CSS escape, JavaScript escape, URL encoding, and other formats.

  3. 3
    Copy the encoding

    Click the copy button next to any encoding format to copy it to your clipboard for use in your code or document.

About

The Character Encoder is a developer tool that converts any Unicode character into every common encoding representation. Whether you need the UTF-8 byte sequence for a network protocol, an HTML entity for a web page, a CSS escape for a stylesheet, or a Python string literal, this tool gives you the exact syntax instantly.

Every character in the Unicode Standard (currently over 154,000 assigned characters) has a unique code point. However, the way that code point is represented differs across programming languages, markup languages, and transport protocols. UTF-8 uses variable-length byte sequences (1 to 4 bytes), HTML uses ampersand-based numeric references, CSS uses backslash-hex notation, and JavaScript/Python each have their own escape syntax.

This encoder handles the full Unicode range including supplementary plane characters such as emoji, historic scripts, and mathematical symbols. For characters above U+FFFF, it correctly produces JavaScript surrogate pairs, Python 8-digit escapes, and 4-byte UTF-8 sequences. All computation runs locally in your browser — your text is never sent to any server.

Common use cases include embedding special characters in HTML without relying on UTF-8 encoding, inserting symbols into CSS content properties, preparing URL-safe strings for APIs, debugging encoding issues in internationalized text, and converting between formats when porting code between programming languages.

FAQ

What encoding formats does the Character Encoder support?
The Character Encoder outputs 17 formats per character: Unicode code point (U+XXXX), UTF-8 hex bytes, HTML decimal entity, HTML hex entity, CSS escape, JavaScript escape, Python escape, URL encoding, binary representation, decimal value, UTF-16 code units, UTF-32, Java escape, Go escape, Rust escape, XML entity, and Base64.
Can I encode multiple characters at once?
Yes. Enter any string and the encoder breaks it down character by character, showing a full encoding table for each one. This is useful for encoding entire words or phrases across all formats simultaneously.
What is the difference between the Character Encoder and the Unicode Converter?
The Unicode Converter accepts input in many formats (U+hex, HTML entities, escape sequences) and resolves them to characters. The Character Encoder takes plain text input and produces all encoding representations, focusing on the encoding direction from character to code.
How does URL encoding work for Unicode characters?
URL encoding converts each byte of a character's UTF-8 representation into percent-encoded form. For example, the copyright symbol (U+00A9) becomes %C2%A9 because its UTF-8 encoding is two bytes: 0xC2 and 0xA9.
Does the Character Encoder handle emoji and supplementary characters?
Yes. The encoder fully supports characters outside the Basic Multilingual Plane, including emoji and historic scripts. It correctly shows surrogate pairs for JavaScript, 8-digit escapes for Python, and multi-byte sequences for UTF-8.