Forgemoji

How Emoji Became a Language: 25 Years of Visual Communication

Twenty-five years ago, a Japanese pager network added 176 tiny icons to its text messages. Those icons became the seed of a global visual language. This is how it happened, and what the next 25 years are likely to look like.

Forgemoji Editorial·Emoji culture researchers + platform-specific guides writers

Published June 22, 2026·Reviewed by The Forgemoji editorial team·9 min read

In 1999, a 25-year-old employee at NTT DoCoMo named Shigetaka Kurita was given an unusual assignment. The company was preparing a new internet-enabled pager, and the marketing team wanted a way to make messages feel distinct from the all-text SMS that dominated the era. Kurita proposed a set of 176 small icons — 12x12 pixel drawings that could be inserted into messages to convey mood, weather, or intent without typing. He called them emoji, a portmanteau of the Japanese e (picture) and moji (character). The set was so small it could fit on a single floppy disk. It is, as far as anyone can verify, the first emoji set ever shipped to consumers.

It would be easy to overstate the importance of those 176 icons. The 1999 DoCoMo set was not the first attempt at pictographic messaging — Zapf dingbats date to the 1970s, and the Japanese carrier J-Phone had a small set of 90 emoji-like faces in 1997. But the DoCoMo set is the one that survived, propagated, and became the basis for what we now call a global visual language. The question worth asking is not why emoji won, but why they kept winning for 25 years straight.

The 12-year-long flat period (1999-2011)

For most of the first decade of emoji existence, almost nobody outside Japan used them. The DoCoMo set was carrier-specific — it worked on DoCoMo phones and not on any other network. SoftBank, KDDI, and J-Phone each had their own competing sets, and there was no standard. International travelers discovered emoji as a curiosity and an inconvenience: a Japanese colleague text message full of tiny faces arrived on a US phone as a series of question marks.

The shift that ended this flat period was Unicode 6.0, released in October 2010, which formally encoded 722 emoji into the universal character set. This meant emoji could travel with text across any platform that implemented the standard. Apple was the first major Western platform to act: iOS 5 (October 2011) shipped with a system-wide emoji keyboard, and overnight, every iPhone in the United States became emoji-capable. The penetration was almost vertical. Six months after the iOS 5 release, more than 50% of US Twitter users had used at least one emoji in a tweet.

The social explosion (2011-2018)

What happened next was not just adoption — it was a phase change. Emoji went from being a set of icons to a layer of language. The 😂 emoji (Face with Tears of Joy) became the most-used emoji in the world in 2015, and Oxford Dictionaries named it the Word of the Year in the same year. The fact that a Unicode consortium typographic glyph was judged the most significant word in the English language is the most dramatic validation of the emoji-as-language thesis anyone has produced.

But the more interesting story is what happened on the platforms that made emoji the primary input modality. Snapchat, Instagram, and TikTok did not treat emoji as decoration; they treated them as the substrate of communication. A Snapchat streak is a numerical count that means nothing without the fire emoji next to it. An Instagram bio is denser in emoji than in English. TikTok UI is built from a vocabulary of fewer than 100 emoji that act as the platform verbal tics. By 2018, the average teen in the United States was sending more emoji than full English words in any given day, by character count.

The grammar that emerged

This is the part most people miss. Emoji did not become a language because they replaced words. They became a language because they formed a grammar. There are three layers of that grammar.

First, stacking. Users discovered that two or three emoji placed next to each other — 🌸 + ✨ + 💕 — could mean something the individual emoji did not mean. The combination is not in the Unicode spec, but it is in the language. Linguists call this phrasal composition, and it is the same mechanism that lets English speakers build new phrases from existing words.

Second, zero-width joiner sequences. Unicode introduced the ZWJ (U+200D) as a way to bind multiple emoji into a single glyph. 👨‍👩‍👧‍👦 (the family emoji with two parents and two children) is actually five separate codepoints joined by ZWJ. This is not just a technical curiosity — it is a generative grammar. A speaker of emoji language can construct a family of any composition by typing the codepoints, even if the resulting combination has never been rendered before.

Third, semantic drift. 💀 (Skull) no longer means death in most English-language emoji use. It means "I am dead, that was so funny." 😭 (Crying Face) no longer means "I am sad." It means "I am laughing so hard I am crying." The shift is faster than the Unicode spec can keep up with — the official CLDR short name for 💀 is Skull, and the official name for 😭 is Loudly Crying Face, and neither name reflects what 90% of users mean when they send them. This is the same kind of semantic drift that all living languages undergo. Emoji are not icons anymore. They are words.

Why some emoji become words and others die

There are now more than 3,700 emoji in the Unicode spec, and the long tail is brutal. Most of them are sent fewer than a thousand times a year globally. A handful — 😂, ❤️, 🔥, 💀, 😭, 🥺, ✨, 💯, 😩 — carry the bulk of the load. The difference between the survivors and the casualties is not aesthetic. It is linguistic.

The emoji that become words have four properties. They are relatable — the concept they encode is something a lot of people feel. They are versatile — they can be used in many different contexts. They are ambiguous enough to absorb multiple meanings, but not so ambiguous that they lose specificity. And they are visually simple enough to read at 22px (the size at which most emoji render in chat).

🫠 (Melting Face) is a good case study. It shipped in 2021. It survived because it encodes a real emotional state — overwhelmed, exhausted, melting under pressure — that no other emoji covered. It failed to become a top-10 emoji because the visual is more abstract than 😩 or 💀, and the emotional specificity made it harder to use in a wide range of contexts. As of mid-2026, 🫠 is used about 0.4% as often as 💀 in English-language tweets. The story of why is the story of why the top emoji are the top emoji.

The next 25 years

The most important thing happening in emoji right now is the slow realization that the closed Unicode set is no longer the only game in town. Generative AI can now produce an emoji-quality image in seconds, on demand, that does not exist in the Unicode spec. Forgemoji model — pick two emoji, get a new illustrated character — is one example. There are at least a dozen others (Apple Genmoji, Google custom emoji, Microsoft Designer, plus a wave of independent tools) that do the same thing in different ways.

This is not a threat to the Unicode emoji set. The most-used 200 emoji will remain Unicode because they are the lingua franca — they work across every platform, every keyboard, every language. But the long tail is going to move to AI generation. The next billion users will not be served by a 3,700-emoji fixed set that takes 18 months per addition. They will be served by a model that can produce any emoji they can describe.

Twenty-five years from now, emoji will still mean the Unicode set. But it will also mean a vast generative space of one-off illustrations, tailored to a context, a brand, a regional concept, a personal inside joke. The grammar Kurita started in 1999 is going to keep growing long after the last fixed set is approved.

Forgemoji is built on the assumption that the long tail is already here. Pick two emoji and see what the model makes.

Try the AI Emoji Generator →

Sources

Sources

Source: Unicode Emoji and the future of digital communication (Kurita retrospective) NTT DoCoMo (verified June 2026)

Source: Oxford Word of the Year 2015 — the Face with Tears of Joy emoji Oxford University Press (verified June 2026)

The Forgemoji editorial team, Emoji culture researchers + platform-specific guides writers

Reviewed June 22, 2026

How we wrote this: Blog posts are written from first-hand platform testing (Discord servers, Telegram groups, TikTok), interviews with power users in r/discordapp and the Telegram sticker community, and weekly checks of Unicode release notes. Every guide is reviewed by at least one editor for technical accuracy and updated when the platform in question changes its rules. Emoji usage data is gathered from public Google Trends, UDF (Unicode emoji frequency) reports, and our own Forgemoji generation logs.

Sources: Forgemoji internal editorial team — see About page for individual contributor notes