Unicode Reference
Browse Unicode code points, hex codes and characters. Click a character to copy.
Back to all tools on ToolForge
Characters
| U+ | Dec | Char | U+ | Dec | Char |
|---|
About Unicode Reference
This Unicode reference shows code points in U+hex and decimal with their characters. Useful for symbol lookups, copy-paste and encoding reference.
Unicode Block Reference
| Block | Range | Description |
|---|---|---|
| Basic Latin | U+0000-007F | ASCII characters, control codes |
| Latin-1 Supplement | U+0080-00FF | Western European letters |
| General Punctuation | U+2000-206F | Spaces, dashes, quotes |
| Arrows | U+2190-21FF | Directional arrows |
| Mathematical Operators | U+2200-22FF | Math symbols |
| Box Drawing | U+2500-257F | Line drawing characters |
| Emoji | U+1F600-1F64F | Emoticons |
Frequently Asked Questions
- What is a Unicode code point?
- A code point is a numeric value assigned to each character. Written as U+XXXX where X is hexadecimal. For example, U+0041 is Latin capital letter A.
- What is the difference between UTF-8 and UTF-16?
- UTF-8 uses 1-4 bytes per code point and is ASCII-compatible. UTF-16 uses 2-4 bytes and is common in Windows and Java. Both encode the same Unicode characters.
- What are surrogate pairs?
- Surrogate pairs combine two 16-bit values to represent code points above U+FFFF. Used in UTF-16 for characters like emoji (U+1F600).
- Why do some Unicode characters display as boxes?
- Missing font glyphs cause box display. The character exists in Unicode but your font or system lacks the visual representation.
- What is Unicode normalization?
- Normalization ensures equivalent sequences have the same binary form. NFC composes characters, NFD decomposes them. Important for string comparison.
- What is the valid Unicode range?
- Valid code points range from U+0000 to U+10FFFF. Some ranges are reserved, private use, or non-characters.