Unicode Reference

Browse Unicode code points, hex codes and characters. Click a character to copy.

Back to all tools on ToolForge

More in Developer Tools

Characters

U+DecCharU+DecChar

About Unicode Reference

This Unicode reference shows code points in U+hex and decimal with their characters. Useful for symbol lookups, copy-paste and encoding reference.

Unicode Block Reference

BlockRangeDescription
Basic LatinU+0000-007FASCII characters, control codes
Latin-1 SupplementU+0080-00FFWestern European letters
General PunctuationU+2000-206FSpaces, dashes, quotes
ArrowsU+2190-21FFDirectional arrows
Mathematical OperatorsU+2200-22FFMath symbols
Box DrawingU+2500-257FLine drawing characters
EmojiU+1F600-1F64FEmoticons

Frequently Asked Questions

What is a Unicode code point?
A code point is a numeric value assigned to each character. Written as U+XXXX where X is hexadecimal. For example, U+0041 is Latin capital letter A.
What is the difference between UTF-8 and UTF-16?
UTF-8 uses 1-4 bytes per code point and is ASCII-compatible. UTF-16 uses 2-4 bytes and is common in Windows and Java. Both encode the same Unicode characters.
What are surrogate pairs?
Surrogate pairs combine two 16-bit values to represent code points above U+FFFF. Used in UTF-16 for characters like emoji (U+1F600).
Why do some Unicode characters display as boxes?
Missing font glyphs cause box display. The character exists in Unicode but your font or system lacks the visual representation.
What is Unicode normalization?
Normalization ensures equivalent sequences have the same binary form. NFC composes characters, NFD decomposes them. Important for string comparison.
What is the valid Unicode range?
Valid code points range from U+0000 to U+10FFFF. Some ranges are reserved, private use, or non-characters.