Emoji Stats

Explore statistics about the Unicode emoji set — category distribution, version growth, type breakdown.

Checker
3781
Total Emojis
10
Categories
16
Versions
67
Glossary Terms

Emojis by Category

😀 Smileys & Emotion
169
👋 People & Body
2261
🏻 Component
0
🐶 Animals & Nature
159
🍎 Food & Drink
131
🚗 Travel & Places
218
Activities
85
💡 Objects
264
❤️ Symbols
224
🏳️ Flags
270

Emojis by Type

1875
Skin Tone Variant
1384
Basic Emoji
262
Flag Sequence
248
ZWJ Sequence
12
Keycap Sequence

Emojis Added per Version

E0.6
+722 722
E0.7
+250 972
E1.0
+168 1140
E2.0
+281 1421
E3.0
+72 1493
E4.0
+2048 2389
E5.0
+239 2628
E11.0
+157 2785
E12.0
+230 3015
E12.1
+168 3183
E13.0
+117 3300
E13.1
+217 3517
E14.0
+112 3629
E15.0
+31 3660
E15.1
+118 3778
E16.0
+3 3781

How to Use

  1. 1
    Select a statistics category

    Choose from breakdowns by Unicode group, emoji version, modifier support, ZWJ sequence type, or platform coverage. Each view offers a different analytical lens on the Unicode emoji dataset.

  2. 2
    Explore charts and counts

    Review the distribution charts showing how emoji are distributed across groups, how many were added per Unicode version, and what percentage support skin tone modifiers. Hover over chart segments for exact counts and percentage breakdowns.

  3. 3
    Drill into specific subsets

    Click any chart segment or table row to filter the full emoji list to that subset, allowing you to browse all emojis matching the selected criterion alongside their codepoints and Unicode version data.

About

The Unicode emoji dataset is a richly structured corpus with multiple analytical dimensions. At the top level, emoji are grouped into nine categories in emoji-test.txt, but these groups are merely organizational conventions — the normative metadata lives in emoji-data.txt, which assigns each codepoint properties like Emoji, Emoji_Presentation, Emoji_Modifier, Emoji_Modifier_Base, and Emoji_Component. These properties govern how emoji behave in text processing, input methods, and rendering engines.

From a growth perspective, the emoji set has expanded from the original 722 emoji in Unicode 6.0 (2010) — largely imported from Japanese carrier encodings — to nearly 4,000 in Emoji 16.0. Each annual release cycle involves formal proposals submitted to the Unicode Emoji Subcommittee, review by the Unicode Technical Committee, and a beta period for public comment. Proposals are evaluated against criteria including expected usage frequency, distinctiveness from existing emoji, and breadth of appeal across cultures and geographies. Many proposed emoji are rejected on the grounds of excessive similarity to existing characters or insufficient evidence of widespread demand.

Sequence emoji — including skin tone modifier sequences, ZWJ sequences, keycap sequences, and flag sequences — account for a significant portion of the emoji catalog but share relatively few base codepoints. This architectural efficiency means that adding a new base emoji character can enable many derived sequences. Statistical analysis of the emoji dataset reveals patterns in design: the 'People & Body' group is the largest due to the combinatorial explosion of skin tone and gender variants, while 'Flags' forms a distinct cluster defined almost entirely by sequence mechanics rather than individual glyph design.

FAQ

How many total emoji are in Unicode Emoji 16.0?
Unicode Emoji 16.0 defines 3,953 fully-qualified emoji in the official emoji-test.txt file. This includes individual character emoji, keycap sequences, flag sequences, and ZWJ sequences. The raw number of codepoints used by all emoji is smaller, since sequences share codepoints — for example, all 26 flag sequences are built from just 26 Regional Indicator letters paired in combinations. The Unicode Consortium publishes updated counts with each release, and EmojiFYI tracks the full Emoji 16.0 dataset.
How many emoji support skin tone modifiers?
As of Emoji 16.0, over 100 individual emoji and hundreds of ZWJ sequences support Fitzpatrick skin tone modifiers. The five modifiers — U+1F3FB (Light), U+1F3FC (Medium-Light), U+1F3FD (Medium), U+1F3FE (Medium-Dark), and U+1F3FF (Dark) — are defined by the Unicode Consortium and are based on the Fitzpatrick scale used in dermatology. Only emoji in the 'Emoji_Modifier_Base' property category are eligible for modifier application, a property defined in Unicode's emoji-data.txt file.
How many new emoji are added each year?
The Unicode Consortium has been releasing one new Unicode (and emoji) version per year since Unicode 10.0 (2017). The number of new emoji per release has varied considerably: Unicode 6.0 (2010) added a large initial set of several hundred emoji from Japanese carrier sources, while recent releases like Emoji 15.1 (2023) added about 118 new sequences (mostly family/couple ZWJ variants) and Emoji 16.0 (2024) added around 8 new base emoji characters. The slower recent addition rate reflects both the maturing of the emoji set and higher scrutiny in the proposal process.
What percentage of emoji are flag sequences?
Flag sequences — formed by pairing two Regional Indicator Symbol letters encoding ISO 3166-1 alpha-2 country codes — number around 258 in Emoji 16.0, representing all UN-recognized countries and territories. This is approximately 6–7% of the total emoji count. Additionally, there are subdivision flag sequences (using tag characters, U+E0020–U+E007E) for certain regions like England, Scotland, and Wales, and a small number of other flag emoji (like the white flag U+1F3F3 and pirate flag ZWJ sequence). Flag rendering varies significantly — Apple and Google support most country flags, while Microsoft Windows renders many as ISO code text (e.g., 'US') rather than graphical flags.
How many emoji use the Zero Width Joiner (ZWJ)?
The emoji-zwj-sequences.txt file in Unicode Emoji 16.0 defines several hundred RGI ZWJ sequences. ZWJ sequences use U+200D (Zero Width Joiner) to combine existing emoji into new composite meanings — for example, woman (U+1F469) + ZWJ + laptop (U+1F4BB) = woman technologist (👩‍💻). The most complex ZWJ sequences are family emoji combining up to four components. Not all ZWJ combinations are RGI; only those explicitly listed in Unicode's ZWJ sequence file are considered standardized and broadly supported.