Computers deal in numbers, not letters. Unicode is the universal standard that maps every character in every human language (plus emojis) to a unique number called a "Code Point".
But finding these characters is hard. How do you type the "Copyright" symbol ©? Or the "Zero Width Space"? The Unicode Character Lookup is a search engine for all 149,186 characters in the Unicode standard.
What Can You Find?
1. Invisible Characters
These are characters that render no width but affect text processing.
- Zero Width Space (U+200B): Used to break long words without adding a hyphen.
- Right-to-Left Mark (U+200F): Essential for mixing Arabic/Hebrew with English text properly.
2. Look-Alike Characters (Homoglyphs)
Hackers use these for phishing. The Cyrillic "а" (U+0430) looks identical to Latin "a" (U+0061), but they are totally different. Our tool reveals the underlying code point so you can tell them apart.
3. Emojis
Search for "Cat" and find every feline Code Point: 🐈 (U+1F408), 😸 (U+1F638), 😹 (U+1F639). Get their Hex code to use in CSS or HTML.
Formats Provided
For every character, we provide:
- Character: The visual glyph.
- Name: E.g., "GREEK SMALL LETTER ALPHA".
- Code Point:
U+03B1. - HTML Entity:
αorα. - CSS Code:
B1. - JavaScript:
α.
The Unicode Planes
Unicode is divided into "Planes".
- Basic Multilingual Plane (BMP): Contains standard text for almost all modern languages (0-FFFF).
- Supplementary Planes: Where the Emojis and ancient hieroglyphs live. These require 4 bytes in UTF-8 (Surrogate Pairs in JS).