List of Unicode characters
This is a list of Unicode characters.
An HTML or XML numeric character reference refers to a character by its Universal Character Set/Unicode code point, and uses the formatorwhere nnnn is the cod...
List of Unicode characters - Wikipedia
C0 Controls and Basic Latin
The Basic Latin (or C0 Controls and Basic Latin) Unicode block is the first block of the Unicode standard, and the only block which is encoded in one byte in UTF-8. The block contains all the letters ...
C1 Controls and Latin-1 Supplement
The Latin-1 Supplement (also called C1 Controls and Latin-1 Supplement) is the second Unicode block in the Unicode standard. It encodes the upper range of ISO 8859-1: 80 (U+0080) - FF (U+00FF). Contro...
Latin Extended-A
Latin Extended-A is a block of the Unicode Standard.It encodes Latin letters from the Latin ISO character sets other than Latin-1 (which is already encoded in the Latin-1 Supplement block) and also le...
Latin Extended-A - Wikipedia
Latin Extended-B
Latin Extended-B is a block (0180-024F) of the Unicode Standard. It has been included since version 1.0, where it was only allocated to the code points U+0180..U+01FF and contained 113 characters. Dur...
Latin Extended-B - Wikipedia
IPA extensions (Unicode block)
IPA Extensions is a block (0250–02AF) of the Unicode standard that contains full size letters used in the International Phonetic Alphabet (IPA). Both modern and historical characters are included, as ...
Combining character
In digital typography, combining characters are characters that are intended to modify other characters. The most common combining characters in the Latin script are the combining diacritical marks (i...
Unicode and HTML for the Hebrew alphabet
The Unicode and HTML for the Hebrew alphabet are found in the following tables. The Unicode Hebrew block extends from U+0590 to U+05FF and from U+FB1D to U+FB4F. It includes letters, ligatures, combin...
Unicode and HTML for the Hebrew alphabet - Wikipedia
N'Ko alphabet
N'Ko (ߒߞߏ) is both a script devised by Solomana Kante in 1949, as a writing system for the Manding languages of West Africa, and the name of the literary language itself written in the script. The ter...
Samaritan script
The Samaritan alphabet is used by the Samaritans for religious writings, including the Samaritan Pentateuch, writings in Samaritan Hebrew, and for commentaries and translations in Samaritan Aramaic an...
Samaritan script - Wikipedia
Indic Unicode
In Unicode, many of the Brahmic scripts (or Indic scripts) are encoded. As of Unicode version 6.3 the following scripts have been encoded:
Thai script
Thai script (Thai: อักษรไทย; rtgs: akson thai;  [ʔàksɔ̌ːn tʰāj]  listen) is used to write the Thai language and other languages in Thailand. It has 44 consonant letters (Thai: พ...
Thai script - Wikipedia
Lao script
The Lao alphabet, Akson Lao (Lao: ອັກສອນລາວ [ʔáksɔ̌ːn láːw]), is the main script used to write the Lao language and other minority languages in Laos. It is ultimately of Indic origin, the alphabet inc...
Tibetan script
The Tibetan alphabet is an abugida of Indic origin used to write the Tibetan language as well as Dzongkha, the Sikkimese language, Ladakhi, and sometimes Balti. The printed form of the alphabet is cal...
Tibetan script - Wikipedia
Burmese script
The Burmese script (Burmese: မြန်မာအက္ခရာ; MLCTS: mranma akkha.ra; [mjəmà ʔɛʔkʰəjà]) is an abugida in the Brahmic family, used for writing Burmese. It is an adaptation of the Old Mon script or the Py...
Georgian alphabet
The Georgian scripts are the three writing systems used to write the Georgian language: Asomtavruli, Nuskhuri and Mkhedruli. Their letters are equivalent, sharing the same names and alphabetical order...
Georgian alphabet - Wikipedia
List of Hangul Jamo
This is the list of hangul jamo (Korean alphabet letters or characters which represent consonants and vowels) including obsolete ones. This list contains information of location in Unicode mapping."Ha...
List of Hangul Jamo - Wikipedia
Ge'ez script
Ge'ez (ግዕዝ Gəʿəz), (also known as Ethiopic) is a script used as an abugida (syllable alphabet) for several languages of Ethiopia and Eritrea. It originated as an abjad (consonant-only alphabet) and wa...
Ge'ez script - Wikipedia
Cherokee syllabary
The Cherokee syllabary is a syllabary invented by Sequoyah to write the Cherokee language in the late 1810s and early 1820s. His creation of the syllabary is particularly noteworthy in that he could n...
Cherokee syllabary - Wikipedia
Unified Canadian Aboriginal Syllabics (Unicode block)
Unified Canadian Aboriginal Syllabics is a Unicode block containing characters for writing Inuktitut, Carrier, several dialects of Cree, and Canadian Athabascan languages. Additions for some Cree dial...
Limbu script
The Limbu script is used to write the Limbu language. The Limbu script is an abugida derived from the Tibetan script.
According to traditional histories, the Limbu script was first invented in the...
Limbu script - Wikipedia
Latin extended additional
Latin Extended Additional is a block of the Unicode standard.The characters in this block are mostly precomposed combinations of Latin letters with one or more general diacritical marks. There are als...
Latin extended additional - Wikipedia
Greek alphabet
The Greek alphabet is the script that has been used to write the Greek language since the 8th century BC. It was derived from the earlier Phoenician alphabet, and was the first alphabetic script to ha...
Greek alphabet - Wikipedia
Miscellaneous Symbols
Miscellaneous Symbols is a Unicode block (U+2600–U+26FF) containing glyphs representing concepts from a variety of categories, astrological, astronomical, chess, dice, musical notation, political symb...
Miscellaneous Symbols - Wikipedia
Braille Unicode block
In Unicode, braille is represented in a block called Braille Patterns (U+2800..U+28FF). The block contains all 256 possible patterns of an 8-dot braille cell, thereby including the complete 6-dot cell...
Braille Unicode block - Wikipedia
CJK Unified Ideographs
The Chinese, Japanese and Korean (CJK) scripts share a common background. In the process called Han unification the common (shared) characters were identified, and named "CJK Unified Ideographs". Unic...
CJK Unified Ideographs - Wikipedia
Yi syllables (Unicode block)
Yi Syllables is a Unicode block containing the characters of the Liangshan Standard Yi script for writing the Nuosu, or Yi, language.Note that the name for U+A015 is a misnomer, as the character is ac...
Sylheti Nagari
Sylheti Nagari or Syloti Nagri (Silôṭi Nagôri) is the original script used for writing the Sylheti language. It is an almost extinct script, this is because the Sylheti Language itself was reduced to ...
Sylheti Nagari - Wikipedia
Linear B
Linear B is a syllabic script that was used for writing Mycenaean Greek, the earliest attested form of Greek. The script predates the Greek alphabet by several centuries. The oldest Mycenaean writing ...
Linear B - Wikipedia
Aegean numerals
Aegean numbers was the numeral system used by the Minoan and Mycenaean civilizations. They are attested in several Aegean scripts (Linear A, Linear B). They may have survived in the Cypro-Minoan scri...
Phaistos Disc
The Phaistos Disc (also spelled Phaistos Disk, Phaestos Disc) is a disk of fired clay from the Minoan palace of Phaistos on the Greek island of Crete, possibly dating to the middle or late Minoan Bron...
Phaistos Disc - Wikipedia